lll2343 comments

Results 17 comments of


                                            lll2343

[Feature] How to finetune with deepspeed for larger parameters?

Hi, You can use the same script as for smaller models, but leverage DeepSpeed’s Zero-2 or Zero-3 to save memory.

[Bug] [Errno 2] No such file or directory eval/mmmu/evaluate_mmmu_cot.py

Hi, see [#739](https://github.com/OpenGVLab/InternVL/issues/739).

[Docs] 在mlp1层之前，新增一个全连接层，应该怎么训练？

能否提供一下更详细的需求？

[Docs] 在mlp1层之前，新增一个全连接层，应该怎么训练？

Hi， 1. **修改 MLP 无法减少 `num_image_token`** `pixel_shuffle` 已降低 `num_image_token`，详见 [[extract_feature函数](https://github.com/OpenGVLab/InternVL/blob/main/internvl_chat/internvl/model/internvl_chat/modeling_internvl_chat.py#272)]。如果输入图片分辨率较高，可以通过调整 `max_dynamic_patch` 来减少切图数量。 2. **增加训练步数** 修改 `[meta.json]`中的 `repeat_time` 参数（见 [meta.json](https://github.com/OpenGVLab/InternVL/blob/main/internvl_chat/shell/data/coco_caption.json#L6)），在该数据集上训练更多步数以提升性能。

lll2343

[Feature] How to finetune with deepspeed for larger parameters?

如何加载使用微调后的模型InternVL2-1B

[Bug] [Errno 2] No such file or directory eval/mmmu/evaluate_mmmu_cot.py

关于InternVL定位多类别时的问题

[Docs]

[Docs] 路径问题

[Docs] 路径问题

🔥🔥🔥本地部署+真实测评视频

[Docs] 在mlp1层之前，新增一个全连接层，应该怎么训练？

[Docs] 在mlp1层之前，新增一个全连接层，应该怎么训练？