Proposal: Use scientific papers for data.
Proposal: What about using open access publications from arXiv to generate a conversational AI system that can provide answers to questions related to scientific papers. As arXiv encourages choosing a liberal license for re-use of the papers (https://info.arxiv.org/help/license/index.html), I think it would be a valuable resource.
Implementation: We can use a question-and-answer format that can extract information from the scientific papers. We could think like a scientific person who needs to write a paper (this is also value for essay data). Here is an example of how the conversational AI system can be used:
Abstract generation: Question: Could you write an abstract for this title
Title suggestion:
Question: What could be a great title for this abstract:
Section content suggestion: Question: I am writing a paper about this
And more: Summary generation, Orthography correction (with a manipulation of the text), Citation recommendation...