VideoMAEv2
VideoMAEv2 copied to clipboard
Vit_B , Vit_S pretrained weights (not HuggingFace inference one, but also for Decoder)
Thank you very much for this awesome repo!
Could you please provide the Vit_b, and Vit_S pretrained pth weights?
In HuggingFace there's only model.safetensors for Encoder part only. It would be awesome
any update on the availability of the ViT_B and ViT_S weights for the encoder-decoder pre-trained version?