EVA
EVA copied to clipboard
Difference between eva-clip model and MIM pretrained model
I find that the eva-clip model has an extra inner_attn_ln layer compared to the original pretrained model.
Any reason to use another layer-norm in the clip model?
您好,我已收到您的邮件,我会尽快给您回复。
Have you got answer yet? I have the same question.