Lucas

Results 4 issues of Lucas

https://huggingface.co/datasets/Anthropic/hh-rlhf Is this a dataset we could use for additional training? Would we need to make format changes?

data

Had an idea for a poetry dataset: https://www.kaggle.com/datasets/tgdivy/poetry-foundation-poems This is a dataset with around 14,000 poems, and 93% of them have tags which describe their topic. We could have some...

data

Added the oa_camel folder in data. This dataset is quite large, at 110,000 entries. The original source is found on their page on HF: https://huggingface.co/camel-ai It contains question-answer pairs about...

Dataset Description This dataset contains around 14,000 poems from the PoetryFoundation.org site. They are converted to question:response pairs, using the tags as topics. 5% of the dataset is titling requests...