InternVideo issues

Evaluation of Finetuned Model on SthV2 dataset Got Extremely Low Performance

1

Thank you for your great work! I downloaded the finetuned model provided in your model zoo: https://huggingface.co/OpenGVLab/InternVideo2-Stage1-1B-224p-f8-thSth/blob/main/1B_ft_ssv2_f8.pth (with 77.1% topp-1 accuracy reported on SthV2) and prepared the dataset SthV2 according...

caidonkey

Attempted relative import with no known parent package

1

When I run this demo.ipynb, I encounter the following import error. Theoretically, Python should be able to successfully import from the current directory using ..utils.easydict. Can you explain why this...

kaizhuanren

有兴趣可以加一下多模态群，大家一起交流多模态技术实战中遇到的问题

如果二维码失效，可以加微信拉入群：yx116169 ![20240716-114139](https://github.com/user-attachments/assets/7b465095-9367-4e2d-bac8-f9eea0a0b01e)

feihuamantian

Not able to reproduce the effects of finetune InterVideo1

1

Hellow , nice job ! I can not reproduce the MSRVTT finetuned model,and I set each args as the [log](https://pjlab-gvm-data.oss-cn-shanghai.aliyuncs.com/internvideo/retrieval/msrvtt/kc4_finetune_1e-32e-3_77words_12frames_128_16_bothdsl/log.txt) Also I check each problems ,such as dataloade or the...

hardlipay

Details about generating video captions for InterVid

1

Dear authors, Can you share some details about how we can generate the captions for new videos in the same manner as done for Intervid? From the paper, you generated...

fmthoker

Download scripts for InternVid-Aesthetics-18M dataset

1

Thank you for your incredible works！ I would like to use the InternVid-Aesthetics-18M dataset to train some video generation models, but didn't find any available instruction or documentation of how...

SystemErrorWang

Training and Evaluation Code for ViClip

11

Dear authors, Great work and thanks for releasing the code for ViClip pretraining on InternVid-10M-FLT. Firstly, It would be really great if the pre-trainning instructions are more detailed, like which...

fmthoker

Request for finetuned InternVideo2-1B results on video retrieval benchmarks

18

Hi, great work and thanks for releasing the code. In Table 10 of your InternVideo2 paper, you reported the results of finetuning video retrieval in both T2V and V2T on...

roberto-amoroso

Can't Reproduce Zero Shot Performance MSRVTT and LSMDC with Intervid-10m-FLT Checkpoint

2

Dear Authors, I am trying to reproduce Zeroshot performance with the checkpoint [ViCLIP-L-14 InternVid-10M-FLT ](https://huggingface.co/OpenGVLab/ViCLIP). However, the performance is different from reported numbers in the paper. Here are the results...

fmthoker

Question regarding InternVideo2+VideoChat2 results on MVBench

3

Hello authors, Table 16 in the InternVideo2 paper reports a score of 60.9 on MVBench, with VideoChat2 + InternVideo2s3-1B + Mistral-7B. However, the highest score on MVBench leaderboard is 60.4...

jpan72

InternVideo
InternVideo copied to clipboard

Metadata

Evaluation of Finetuned Model on SthV2 dataset Got Extremely Low Performance

Attempted relative import with no known parent package

有兴趣可以加一下多模态群，大家一起交流多模态技术实战中遇到的问题

Not able to reproduce the effects of finetune InterVideo1

Details about generating video captions for InterVid

Download scripts for InternVid-Aesthetics-18M dataset

Training and Evaluation Code for ViClip

Request for finetuned InternVideo2-1B results on video retrieval benchmarks

Can't Reproduce Zero Shot Performance MSRVTT and LSMDC with Intervid-10m-FLT Checkpoint

Question regarding InternVideo2+VideoChat2 results on MVBench

← Metadata

Owner

Metadata

InternVideo InternVideo copied to clipboard

Metadata

← Metadata

Owner

Metadata

InternVideo
InternVideo copied to clipboard