VLMEvalKit icon indicating copy to clipboard operation
VLMEvalKit copied to clipboard

Why resize images in MMMU for DeepSeek-VL2?

Open kydxh opened this issue 10 months ago • 1 comments

I notice that in evaluation of DeepSeek-VL2 on MMMU, the code resize the first image. But I don't know why. Could you please tell me the reasons?

Image

(codes from "generate_inner" in VLMEvalKit/vlmeval/vlm/deepseek_vl2.py)

kydxh avatar Mar 20 '25 10:03 kydxh

The corresponding Pull Request was created by a member from the official DeepSeekVL2 team. We may need to seek help from the PR author @gnobitab

kennymckormick avatar Apr 07 '25 13:04 kennymckormick

Thank you very much.

kydxh avatar May 07 '25 09:05 kydxh