How do you process non-square images passed to CLIP when doing inference?

Open RedRAINXXXX opened this issue 2 years ago • 0 comments

You use square patches during training, but how do you handle non-square images during inference? Directly resize non-square images to (224, 224)? Looking forward to your reply;)

Dec 08 '23 03:12 RedRAINXXXX