LM4VisualEncoding
LM4VisualEncoding copied to clipboard
[ICLR 2024 (Spotlight)] "Frozen Transformers in Language Models are Effective Visual Encoder Layers"
Results
3
LM4VisualEncoding issues
Sort by
recently updated
recently updated
newest added
Hope to access the 2D VQA and Image-Text Retrieval Task
Thanks for your excellent work, I am wondering when the code for motion forecasting will be released?
Thank you for your insightful discovery. I have a question regarding the influence of ViT. If you use a pre-trained ViT and freeze it, then only train the added adapter...