InternVL icon indicating copy to clipboard operation
InternVL copied to clipboard

[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的开源多模态对话模型

Results 461 InternVL issues
Sort by recently updated
recently updated
newest added

### Checklist - [ ] 1. I have searched related issues but cannot get the expected help. - [ ] 2. The bug has not been fixed in the latest...

作者您好,我的任务需要模型同时进行ocr+定位,当我设置较大的学习率(4e-5)的时候ocr会出现很多错别字但定位能力能学的比较好,当我设置较小的学习率(1e-5/1e-6等)的时候ocr不会有错别字但定位能力难以拟合,请问有什么其他的方法来平衡这两个能力吗

Hello, I have some questions about the leaderboard evaluation results. The leaderboard shows that the 1B model mataVista has a score of 45.8, but my actual measurement using VLMEvalKit is...

## Motivation Support V2PE in pre-training, fine-tuning, and inference for InternVL. ## Modification ### V2PE utils Added the file `internvl_chat/internvl/v2pe_utils.py`. It includes the `get_rope_pos_id` function, which calculates position ids required...

### Motivation Hi, thanks for your great work on this project! I’m very interested in the dataset you used and would like to know when you are planning to release...

“Regarding the phrase ‘For instructions with clear ground truths’ mentioned in Section 3.1 of the article, I would like to know how the author evaluates whether the generated responses match...

thanks for your great work. I hope to use vllm to speed up the VisualPRM-8B. Does it support vllm?

### Checklist - [X] 1. I have searched related issues but cannot get the expected help. - [X] 2. The bug has not been fixed in the latest version. -...

### Motivation 请问最新发布的internvl3怎么做目标检测,能给一个例子吗 ### Related resources _No response_ ### Additional context _No response_