VLMEvalKit
VLMEvalKit copied to clipboard
Inputs without JPEG conversion
Hello, I've notice that the input images used in VLMEvalKit are converted to JPEG format. I wonder if there is version that use the original images instead of JPEG format as input?
Hi @kydxh, we do not provide evaluation sets using original images, but the image quality before and after conversion should not change much, and the impact on the evaluation results is quite small. Which dataset do you want to evaluate?
Thank you. I want to evaluate some VQA datasets, such as ScienceQA, MMMU, Mathvista and TextVQA. Besides, I want to ask that, if the bugs in evaluating TextVQA dataset mentioned before has been modified?