ViViT-pytorch icon indicating copy to clipboard operation
ViViT-pytorch copied to clipboard

Implementation of ViViT: A Video Vision Transformer

Results 11 ViViT-pytorch issues
Sort by recently updated
recently updated
newest added

I have been trying to use TimeSformer and ViViT, I have managed to convert it into a regression model by changing the loss function and setting the output of the...

Hi, Thanks for your work making a Pytorch version of the paper - much appreciated! How does this implementation compare to results in the original paper. Specifically on the Moments...

Just wondering, how would you implement masking in the ViViT? Just in case we want to process videos with different lengths. Any suggestions?

Hi,thank you for your open source work.Is attention, reattention, lcattention in the module file one of the three modules as required?

Thanks for the great work in ViViT model 2. Is the value of the num_frame fixed or not? Or does the model process each frame one by one?

Is there any pretrained model on Kinetics or Something Something v2 or EPIC-KITCHENS-100 dataset?

Hi. Thank you for your work. I have a problem because I don't really know how to make use of the files you uploaded to this repository. I would like...

Hi I am a newbie to PyTorch. I want to use this model for my thesis. Can you please help me in explaining how to run this code on my...

If there is anyone could provide me more information about the model parameters that might help me fix the problem, I would be thankful for that because using the default...