Dinosaurcubs

Results 3 comments of Dinosaurcubs

> You don't need to do > > image_features /= image_features.norm(dim=-1, keepdim=True) text_features /= text_features.norm(dim=-1, keepdim=True) > > if you're using cosine_similarity. torch.cosine_similarity(x, y) already normalizes the inputs, by nature...

> Found the solution. My problem was the size of the images: I had batches of dimension (16, 3, 32, 32) (16 images per batch, 3 channels, 32 height/width). Got...