LM4VisualEncoding icon indicating copy to clipboard operation
LM4VisualEncoding copied to clipboard

[ICLR 2024 (Spotlight)] "Frozen Transformers in Language Models are Effective Visual Encoder Layers"

Results 3 LM4VisualEncoding issues
Sort by recently updated
recently updated
newest added

Hope to access the 2D VQA and Image-Text Retrieval Task

Thanks for your excellent work, I am wondering when the code for motion forecasting will be released?

Thank you for your insightful discovery. I have a question regarding the influence of ViT. If you use a pre-trained ViT and freeze it, then only train the added adapter...