cilin yan
cilin yan
Thank you so much for considering my request amidst your busy schedule with the ICML deadline. Wishing you all the best for the conference.
> You can try to use multiple gpus to run! And the error will go away! One simple approach is to ensure that only one video is trained on each...
Could you please provide the estimated training time for CLIPSelf on the COCO and CC3M datasets? Thanks!
Thank you for your reply! > The training of EVA-CLIP models is fast. With 8 A100-80Gs, it takes about 2 hours to train a ViT-B/16 on COCO for 6 epochs...
After testing, the text prompt "a person" now works for human segmentation (where "person" or "human" alone failed earlier)!