Muhammad Umair
Muhammad Umair
The attention mask and the pad token id were not set. As a consequence, you may observe unexpected behavior. Please pass your input's `attention_mask` to obtain reliable results. Setting `pad_token_id`...
python train.py \ --config /content/FasterViT/fastervit/configs/faster_vit_1_224_1k.yaml \ --model faster_vit_1_224 \ --tag faster_vit_1_224_exp_1 \ --batch-size 64 \ --lr 0.005 \ --mesa 0.2 \ --model-ema \ --opt adamw \ --weight-decay 0.005 \ --amp...
First of all, great work, I truly appreciate the effort. I’m looking forward to using this in a research use case and will be sure to give credit. Could you...