GVT icon indicating copy to clipboard operation
GVT copied to clipboard

Official code for "What Makes for Good Visual Tokenizers for Large Language Models?".

Results 5 GVT issues
Sort by recently updated
recently updated
newest added

Thanks authors for the insightful work! I want to understand more details about tuning the visual tokenizer. Could you mind explaining about what kind of dataset used in training your...

Good job on the really insightful and useful paper! The quantitative metrics are really useful when working with these models. **Question:** When evaluating the LLava baseline (Table 4 in paper),...

transformers==4.28.0 and salesforce-lavis 1.0.2 conflict。Can you provide the version of lavis?

Hello there, Great work! And similar to [issue](https://github.com/TencentARC/GVT/issues/2), how about performances of [LLaMA-Adapter](https://github.com/ZrrSkywalker/LLaMA-Adapter) V1 & V2 and InstructBLIP available on LAVIS. Thanks!

Hey, do you have a timeline for when you will release the code and pre-trained models?