Cannot get NYT dataset
I try the links you provide that "NYT(The New York Times Annotated Corpus) can only be available from LDC. And we follow the preprocessing code of Durrett et al. (2016) to get the NYT50 datasets". But they all cannot be used due to the license issue. Could you provide the data (original data and processed code) to us through email? [email protected]
Thanks a lot.
Hello~ I got the same problem in getting "the preprocessing code of Durrett al. (2016)", it seems that the resource has been canceled. Could you pls send me a copy of the code through email? [email protected] Thanks a lot!
Hello, I have the same problem. Could you please provide the data (original data: The New York Times Annotated Corpus)) to me through email: [email protected]?
Thanks a lot!
Maybe that could help. The greedy selection in https://github.com/nlpyang/PreSumm/blob/master/src/prepro/data_builder.py
Sorry, due to the license, we cannot share the dataset directly. Please apply for it based on the link provided.