InternVideo icon indicating copy to clipboard operation
InternVideo copied to clipboard

[ECCV2024] Video Foundation Models & Data for Multimodal Understanding

Results 170 InternVideo issues
Sort by recently updated
recently updated
newest added

I wanted to try out the distillation code, but encountered an error and found that the corresponding file is missing in the project.Can you help me solve this problem? ![截图...

Hi, Could you please share the training strategy you used for model distillation? If possible, sharing the paper you followed for distillation would be really helpful.

![Image](https://github.com/user-attachments/assets/f92e629f-afab-43bd-87e0-326379b9465b) no utils.py contains the three modules, where can I get the codes?

Dear InternVideo Group, Thank you for your wonderful work! I encountered an issue while attempting to finetune the InternVideo1-B model. According to the config instructions, I set the 'evaluation' as...

Hello i have a question. i wanted to use zeroshot part. but i didn't understand which model i should put in the file you said in ReadMe part. i have...

我在 single_modality 和 multi_modality 目录下没有找到对应的评测脚本,我应该如何复现 InternVideo2s1-1B 在 K400 K600 K700 三个数据集上的精度?

i didn't find it in the downstream part .

Thank for your great work! I want use mutli-gpu to infer videos. Is there a script or an example for this?

Dear team, Thanks for your great job. I would like to know how to get the text feature for the grounding task. I see that you utilize the LLAMA backbone...