Vision_by_Language
Vision_by_Language copied to clipboard
[ICLR 2024] Official repository for "Vision-by-Language for Training-Free Compositional Image Retrieval"
Results
2
Vision_by_Language issues
Sort by
recently updated
recently updated
newest added
I want to replace the gpt-turbo-3.5 model in the code with a llama series model. What do I need to do?
I used openAI's CLIP ViT-B/32 to test on FashionIQ's validation set. The results obtained and the results reported in the paper are very different, may I ask what skills exist?...