How to implement VideoChat2_text

Open qtli opened this issue 1 year ago • 1 comments

Thanks for your high-quality work. Could you please provide the blank video or the code you utilized to generate the ablation model VideoChat2_text? Thanks very much!

Apr 29 '24 15:04 qtli

Hi! For VideoChat2_text, we simply input a video tensor of 0, like torch.zeros_like(video_emb).

May 20 '24 03:05 Andy1621