BLIP icon indicating copy to clipboard operation
BLIP copied to clipboard

Could you please provide your code for downloading CC3M+CC12M+SBU data from the json file you provided?

Open asgsaeid opened this issue 3 years ago • 4 comments

asgsaeid avatar Dec 01 '22 20:12 asgsaeid

The json files contain the image url and text. You may write a script to download images from the url. This code could be helpful: https://github.com/rom1504/img2dataset.

LiJunnan1992 avatar Dec 02 '22 02:12 LiJunnan1992

I find that the code read the image file directly from the path, instead of .tar or parquet. However, img2dataset says that "handling more than a million files in standard filesystem does not work well." Thus, it suggests to use webdataset format. Do I have to untar the file to support your data reading strategy?

Richar-Du avatar Mar 08 '23 02:03 Richar-Du

Yes you need to have raw image path to work with our annotation file. You can choose to recreate new tars and use webdataset.

LiJunnan1992 avatar Mar 08 '23 03:03 LiJunnan1992

Got it! Thanks for your reply :)

Richar-Du avatar Mar 08 '23 06:03 Richar-Du