InternVideo icon indicating copy to clipboard operation
InternVideo copied to clipboard

[ECCV2024] Video Foundation Models & Data for Multimodal Understanding

Results 170 InternVideo issues
Sort by recently updated
recently updated
newest added

Hi! I have tried Internvideo2-1B-clip in the action recognition task on K400 dataset, I try to use the model without the dataset class you designed. So what I do in...

We are trying to use the InternVideo2 multi_modality model, but right now it's quite painful, we have to git clone the repo, fix the imports, and use a custom `pyproject.toml`...

Is there any timeline for when the 6B Stage-2 pretrained models will be released on Huggingface? In the model zoo (https://github.com/OpenGVLab/InternVideo/blob/main/InternVideo2/multi_modality/MODEL_ZOO.md), it only says 'TBD'. Thanks!

Thank you for great work! I am currently working on temporal action localization and planning to use InternVideo2-1B and 6B for feature extraction from raw video data that is not...

Hi, I wonder if during the training, it should be the same length of the video. In inference steps, how can you infer if the video is longer or shorter...

Hello, Firstly, I love the work that you guys have done here, absolutely incredible. We are currently working on patient monitoring within the NICU and we're wondering what model to...

This fixes part of the relative import package problem...

Add information about the small `clip` models in `README.md`. Hopefully this makes it clearer what these files actually do. Edit: I also fixed a link in the InternVideo2 Readme.

Hello, First, I want to thank the authors for this work. I think I see your poster at ICLR in May. Back then I did not work on anything related...

Hi author, I have compiled and installed fused_dense_lib successfully. But when I tried to run the finetuning code, I encountered this error: "RuntimeError: linear_act_forward failed." which is due to the...