Can I adapt the model for video prediction(like moving mmnist)?

Open Crestina2001 opened this issue 1 year ago • 2 comments

Thanks for the great work!

Is it possible to do adapt the model for video prediction? And if so, what decoder model shall I use? Thanks for any suggestions!

Jul 22 '24 02:07 Crestina2001

You can consider fine-tuning the stage 1 model in combination with videoMAEv2's decoder. These components closely resemble autoencoders and have the potential to predict frames. However, it's important to assess whether they align with your specific requirements.

Jul 22 '24 04:07 shepnerd

You can consider fine-tuning the stage 1 model in combination with videoMAEv2's decoder. These components closely resemble autoencoders and have the potential to predict frames. However, it's important to assess whether they align with your specific requirements.

Thanks for your suggestions! I would give it a try.

Jul 22 '24 07:07 Crestina2001