VLMEvalKit icon indicating copy to clipboard operation
VLMEvalKit copied to clipboard

Error in analyzing video datasets with Qwen2.5-VL-3B-Instruct

Open DanialTaheri opened this issue 9 months ago • 0 comments

I am running the below command python run.py --data Video-MME_8frame --model Qwen2.5-VL-3B-Instruct but I am getting the error: File "VLMEvalKit_original/VLMEvalKit/vlmeval/vlm/qwen2_vl/model.py", line 192, in generate_inner images, videos = process_vision_info([messages]) File "/usr/local/lib/python3.10/dist-packages/qwen_vl_utils/vision_process.py", line 332, in process_vision_info video_inputs.append(fetch_video(vision_info)) File "/usr/local/lib/python3.10/dist-packages/qwen_vl_utils/vision_process.py", line 260, in fetch_video video = VIDEO_READER_BACKENDSvideo_reader_backend File "/usr/local/lib/python3.10/dist-packages/qwen_vl_utils/vision_process.py", line 230, in _read_video_decord nframes = smart_nframes(ele, total_frames=total_frames, video_fps=video_fps) File "/usr/local/lib/python3.10/dist-packages/qwen_vl_utils/vision_process.py", line 161, in smart_nframes raise ValueError(f"nframes should in interval [{FRAME_FACTOR}, {total_frames}], but got {nframes}.") ValueError: nframes should in interval [2, 2227], but got 0.

I am installing transformers==4.49.0 and qwen-vl-utils[decord]==0.0.8. Are there any issues with my packages?

DanialTaheri avatar Apr 16 '25 21:04 DanialTaheri