VLMEvalKit
VLMEvalKit copied to clipboard
LLaVA v1.5 7b very low performance on TextVQA
Dear authors,
Thanks for sharing this great codebase.
I tried to test LLaVA-v1.5-7b (llava_v1.5_7b) model on TextVQA_VAL and only get about 21.88 accuracy, which is much lower than the 58.2 performance reported in the paper. I understand that the codebase is not aimed to reproduce the exact results but it seems that the gap is too big.
Is there any reason why this happens and any quick fix?
Thanks very much!
Same to you, did you solve it?
Same to you, did you solve it?
No I haven't figured it out. 🥲
I've also got the same problem.