FasterViT issues

Mismatch with timm library

1

`ImportError: cannot import name '_update_default_kwargs' from 'timm.models._builder'` In timm github repo you may see recent name '_update_default_model_kwargs' https://github.com/huggingface/pytorch-image-models/blob/main/timm/models/_builder.py

MaratZakirov

Fixed issue with reneamed dependency function (timm)

1

Changed `_update_default_kwargs` to `_update_default_model_kwargs` function as the timm library changed the name

Charlotte-R-01134

ImageNet-21K checkpoint without fine-tuning!

Congrats on the acceptance and great work! Any chance you could provide an ImageNet-21K faster-vit checkpoint **without** ImageNet-1K fine-tuning. I would be very interested to see how the ImageNet-21K-based learned...

black0017

Drawing Heatmap

![image](https://github.com/NVlabs/FasterViT/assets/50725139/0e389c1e-cff3-41c1-bccd-b50dedfb8ec1) I want to apply this model for my downstream task. And it is very necessary to draw heatmap that model focus on. So, is this model possible to draw...

kojunseo

How to determine the args.mesa ratio?

First, thank you for sharing your wonderful work as an open source. I appreciate your work. I found in the TRAINING.md you provided that the args.mesa parameter settings differ for...

Kim-DKyu

Update object detection sources

1

# PR Summary Small PR - Commit 848b042a1fa3be6a4672e839f40794d127fc1308 moved object detection sources. This PR adjusts sources to changes. It also fixes a few typos along the way.

emmanuel-ferdman

PreTraining the model on image-net to reproduce results. causing errors after 1 epoc.

python train.py \ --config /content/FasterViT/fastervit/configs/faster_vit_1_224_1k.yaml \ --model faster_vit_1_224 \ --tag faster_vit_1_224_exp_1 \ --batch-size 64 \ --lr 0.005 \ --mesa 0.2 \ --model-ema \ --opt adamw \ --weight-decay 0.005 \ --amp...

muhammadumair894

[Bug]: Downstream forward_raw() methods have incorrect permutation

This issue relates to the implementation in `downstream/object_detection/dino/models/dino /fastervit.py`. I'm using this FasterViT implementation as a backbone for my own object detector, so I only needed the `forward_raw()` method. However...

collinmccarthy

[Bug]: NCCL timeout with multi-GPU validation with different image sizes per GPU

1

I'm experimenting with FasterViT in an MMDetection project. In this project the validation data augmentation pipeline does not crop the image, and simply pads it to the minimum size. This...

collinmccarthy

Torch jit bug

Can't use torch.jit.trace + torch.jit.script. Error: `RuntimeError: Expected all tensors to be on the same device, but found at least two devices, cuda:0 and cpu!` Reproduce: ``` import torch import...

vodan37

FasterViT
FasterViT copied to clipboard

Metadata

Mismatch with timm library

Fixed issue with reneamed dependency function (timm)

ImageNet-21K checkpoint without fine-tuning!

Drawing Heatmap

How to determine the args.mesa ratio?

Update object detection sources

PreTraining the model on image-net to reproduce results. causing errors after 1 epoc.

[Bug]: Downstream forward_raw() methods have incorrect permutation

[Bug]: NCCL timeout with multi-GPU validation with different image sizes per GPU

Torch jit bug

← Metadata

Owner

Metadata

FasterViT FasterViT copied to clipboard

Metadata

← Metadata

Owner

Metadata

FasterViT
FasterViT copied to clipboard