zxyonaroll
Results
1
comments of
zxyonaroll
> Thanks for your question. Unlike other modalities, Vision logits are not scaled by a temperature: > > https://github.com/facebookresearch/ImageBind/blob/0f8620b6678fd24c35f172721ea6046ab5780890/models/imagebind_model.py#L432 > > If we look at the cosine similarity for Vision...