DeCLIP icon indicating copy to clipboard operation
DeCLIP copied to clipboard

Filter YFCC data

Open Hxyou opened this issue 3 years ago • 3 comments

Hi, thanks for the great work. After downloading the provided YFCC15M label file, I can see there are three keys caption filename url in each one of the labels. how should we find the corresponding YFCC image according to your label? i.e., which key should we use to align with YFCC data?

Hxyou avatar May 25 '22 08:05 Hxyou

You can use the url as key , and filename for check

SlotherCui avatar Jun 23 '22 10:06 SlotherCui

The image name of YFCC data seems to be a md5 encoding. I'm also a little confused about how to make a connection.

raytrun avatar Jun 23 '22 10:06 raytrun

I am also trying to filter YFCC and I have the same issue. The dataset I have downloaded has a very different structure, and I don't know how to find the images based on the filename that you provide. Also I am not sure about what you mean by "Prepare the YFCC15M subset metadata pickle by the label".

My version of YFCC100M looks exactly the same as the one they have in the SLIP repo. Do you organise the data in a different way?

DonkeyShot21 avatar Sep 08 '22 16:09 DonkeyShot21