InternVideo
InternVideo copied to clipboard
[ECCV2024] Video Foundation Models & Data for Multimodal Understanding
I wanted to try out the distillation code, but encountered an error and found that the corresponding file is missing in the project.Can you help me solve this problem?  no utils.py contains the three modules, where can I get the codes?
Dear InternVideo Group, Thank you for your wonderful work! I encountered an issue while attempting to finetune the InternVideo1-B model. According to the config instructions, I set the 'evaluation' as...
Hello i have a question. i wanted to use zeroshot part. but i didn't understand which model i should put in the file you said in ReadMe part. i have...
我在 single_modality 和 multi_modality 目录下没有找到对应的评测脚本,我应该如何复现 InternVideo2s1-1B 在 K400 K600 K700 三个数据集上的精度?
i didn't find it in the downstream part .
Thank for your great work! I want use mutli-gpu to infer videos. Is there a script or an example for this?
Dear team, Thanks for your great job. I would like to know how to get the text feature for the grounding task. I see that you utilize the LLAMA backbone...