Why resize images in MMMU for DeepSeek-VL2?

Open kydxh opened this issue 10 months ago • 1 comments

I notice that in evaluation of DeepSeek-VL2 on MMMU, the code resize the first image. But I don't know why. Could you please tell me the reasons?

(codes from "generate_inner" in VLMEvalKit/vlmeval/vlm/deepseek_vl2.py)

Mar 20 '25 10:03 kydxh

The corresponding Pull Request was created by a member from the official DeepSeekVL2 team. We may need to seek help from the PR author @gnobitab

Apr 07 '25 13:04 kennymckormick

Thank you very much.

May 07 '25 09:05 kydxh