Research
Research copied to clipboard
Can dataset processed in MMPMS be Shared?
MMPMS("Generating Multiple Diverse Responses with Multi-Mapping and Posterior Mapping Selection") evaluate the proposed model on two public conversation dataset: Weibo [Shang et al., 2015] and Reddit [Zhou et al., 2018] that maintain a large repository of post-response pairs from popular social websites. The paper mentioned that "After basic data cleaning, we have above 2 million pairs in both datasets." I would like to train the MMPMS from scratch. Could you please share the cleaned data?