MM-NIAH
MM-NIAH copied to clipboard
This is the official implementation of the paper "Needle In A Multimodal Haystack"
Results
2
MM-NIAH issues
Sort by
recently updated
recently updated
newest added
The reasoning-image task only have two choices, so "random choose" can get 50 scores. However, performance of the best model "InternVL-Chat-V1-5-RAG" is  Does it means this task is too...
Model evaluation of MM-NIAH using LMDeploy