efficientvit
efficientvit copied to clipboard
Efficient vision foundation models for high-resolution generation and perception.
Hi, First of all great work. I was wondering if you have any advice on how to proceed with finetuning these models on a custom dataset (COCO format)? Best regards,
There is an issue when running the SAM model on MacOS where boxes return a really poor output. This is using mps / metal acceleration with PYTORCH_ENABLE_MPS_FALLBACK=1 due to this...
Hi, ONNX only supports point prompts and box prompts. How to segment everything with Onnxruntime or TensorRT? By the way, is there a plan to add C++ inference demo? Thank...
How to train the image encoder of EfficientViT-SAM using SAM’s image encoder as the teacher,where is the code located?
What is the difference between efficient vit and efficient sam? thanks!
``` def relu_linear_att(self, qkv: torch.Tensor) -> torch.Tensor: B, _, H, W = list(qkv.size()) if qkv.dtype == torch.float16: qkv = qkv.float() qkv = torch.reshape( qkv, ( B, -1, 3 * self.dim,...
Hello, I recently read the paper titled "Condition-Aware Neural Network for Controlled Image Generation," which mentioned this repository as a reference. Could you please provide an update on when the...
when to release the training code of EfficientViT-SAM?