libsvmdata
libsvmdata copied to clipboard
ENH add fetcher for the criteo dataset
This requires a bit more code as the dataset is shipped as a tar file containing both the train and test set.
This PR makes it possible to download/unzip and preprocess both train and test, to avoid wasting time downloading/uncompressing it twice.