fairseq icon indicating copy to clipboard operation
fairseq copied to clipboard

WMT'16 En-De Dataset Download Link is broken

Open jiachenzhu opened this issue 2 years ago • 3 comments

The dataset link that used in https://github.com/facebookresearch/fairseq/blob/main/examples/scaling_nmt/README.md is broken. Is there any place I can find the dataset?

jiachenzhu avatar Jan 17 '24 04:01 jiachenzhu

#5423 Updated dataset link: https://www.statmt.org/wmt16/index.html

#5423 Updated dataset link: https://www.statmt.org/wmt16/index.html

Excuse me, the preprocessed dataset link in Training a new model on WMT'16 En-De is still broken. Or can you tell me how to preprocess the dataset downloaded from the link you offered? thank you very much.

Armilius avatar Jan 25 '24 15:01 Armilius

The dataset link that used in https://github.com/facebookresearch/fairseq/blob/main/examples/scaling_nmt/README.md is broken. Is there any place I can find the dataset?

Have you found other ways to download the prerpocessed data?

Armilius avatar Jan 25 '24 15:01 Armilius