LM4VisualEncoding
LM4VisualEncoding copied to clipboard

ziqipang

→

Metadata

[ICLR 2024 (Spotlight)] "Frozen Transformers in Language Models are Effective Visual Encoder Layers"

Reame
Issues

Results 3 LM4VisualEncoding issues

Sort by recently updated

2D VQA and Image-Text Retrieval

3

Hope to access the 2D VQA and Image-Text Retrieval Task

xvolica

About Motion Forecasting

Thanks for your excellent work, I am wondering when the code for motion forecasting will be released?

Zbozhou

Influence of ViT

1

Thank you for your insightful discovery. I have a question regarding the influence of ViT. If you use a pre-trained ViT and freeze it, then only train the added adapter...

jiazhen-code

About

[ICLR 2024 (Spotlight)] "Frozen Transformers in Language Models are Effective Visual Encoder Layers"

vision-transformer

llm

194

Stars

5

Forks

Watchers

Owner

ziqipang

← Metadata

194

Stars

5

Forks

Watchers

Owner

ziqipang

Metadata

[ICLR 2024 (Spotlight)] "Frozen Transformers in Language Models are Effective Visual Encoder Layers"

Back

LM4VisualEncoding LM4VisualEncoding copied to clipboard

Metadata

2D VQA and Image-Text Retrieval

About Motion Forecasting

Influence of ViT

← Metadata

Owner

Metadata

LM4VisualEncoding
LM4VisualEncoding copied to clipboard