Jingjing Pan
Results
2
comments of
Jingjing Pan
For question 2, is it because UMT-L + VideoChat2 + Mistral-7B=60.4 --> Result reported as MVBench top-1 InternVideo2 + VideoChat2 + Mistral-7B = 60.9 --> Result reported in InternVideo2 paper...
Thank you for the swift response! I see how the dynamic resolution setting is working for HD training. One followup question is - I saw the `blocks` is not used...