InternVideo icon indicating copy to clipboard operation
InternVideo copied to clipboard

Can I adapt the model for video prediction(like moving mmnist)?

Open Crestina2001 opened this issue 1 year ago • 2 comments

Thanks for the great work!

Is it possible to do adapt the model for video prediction? And if so, what decoder model shall I use? Thanks for any suggestions!

Crestina2001 avatar Jul 22 '24 02:07 Crestina2001

You can consider fine-tuning the stage 1 model in combination with videoMAEv2's decoder. These components closely resemble autoencoders and have the potential to predict frames. However, it's important to assess whether they align with your specific requirements.

shepnerd avatar Jul 22 '24 04:07 shepnerd

You can consider fine-tuning the stage 1 model in combination with videoMAEv2's decoder. These components closely resemble autoencoders and have the potential to predict frames. However, it's important to assess whether they align with your specific requirements.

Thanks for your suggestions! I would give it a try.

Crestina2001 avatar Jul 22 '24 07:07 Crestina2001