CLIP icon indicating copy to clipboard operation
CLIP copied to clipboard

How to change the features obtained by the clip encoder[1, 512]

Open lwtgithublwt opened this issue 1 year ago • 0 comments

What exactly does the [1,512] feature obtained by the clip encoder mean, and how does it become a lattice of channels, length, and width?

lwtgithublwt avatar May 07 '24 06:05 lwtgithublwt