HeterSumGraph icon indicating copy to clipboard operation
HeterSumGraph copied to clipboard

Cannot get NYT dataset

Open yeliu918 opened this issue 5 years ago • 3 comments

I try the links you provide that "NYT(The New York Times Annotated Corpus) can only be available from LDC. And we follow the preprocessing code of Durrett et al. (2016) to get the NYT50 datasets". But they all cannot be used due to the license issue. Could you provide the data (original data and processed code) to us through email? [email protected]

Thanks a lot.

yeliu918 avatar Dec 07 '20 04:12 yeliu918

Hello~ I got the same problem in getting "the preprocessing code of Durrett al. (2016)", it seems that the resource has been canceled. Could you pls send me a copy of the code through email? [email protected] Thanks a lot!

FortuneSeeker avatar Apr 07 '21 05:04 FortuneSeeker

Hello, I have the same problem. Could you please provide the data (original data: The New York Times Annotated Corpus)) to me through email: [email protected]?

Thanks a lot!

physics39 avatar Sep 27 '21 07:09 physics39

Maybe that could help. The greedy selection in https://github.com/nlpyang/PreSumm/blob/master/src/prepro/data_builder.py

YuxiangZhang0114 avatar Nov 26 '21 04:11 YuxiangZhang0114

Sorry, due to the license, we cannot share the dataset directly. Please apply for it based on the link provided.

dqwang122 avatar Mar 13 '23 06:03 dqwang122