Open-Assistant add poetry dataset setup

Dataset Description This dataset contains around 14,000 poems from the PoetryFoundation.org site. They are converted to question:response pairs, using the tags as topics. 5% of the dataset is titling requests -- the user provides a poem and asks the assistant to title it.

Languages English

Dataset Structure This dataset follows the OA format, which is:

INSTRUCTION (string): The user asks for a poem (from a variety of premade prompts) with topics (tags). If the given poem has no tags, the user asks for a poem on it's own.

RESPONSE (string): The assistant replies with the poem and title (from a variety of premade prompts).

SOURCE (string): The source is PoetryFoundation.org and the poet's name.

METADATA (JSON String): {"author": "author of the original poem", "title": "title of the poem", "tags": "tags from poetry foundation."}

Preparing the Dataset The dataset can be created with prepare.py. Make sure to install the required libraries in requirements.txt!

Contributions Created by Check Original dataset source - https://www.kaggle.com/datasets/tgdivy/poetry-foundation-poems

You can view it on my huggingface here: https://huggingface.co/datasets/checkai/instruction-poems (this time i ran pre-commit so it should be good :D )

Apr 19 '23 00:04 CheckMC

:x: pre-commit failed. Please run pre-commit run --all-files locally and commit the changes. Find more information in the repository's CONTRIBUTING.md

Apr 19 '23 00:04 github-actions[bot]

:x: pre-commit failed. Please run pre-commit run --all-files locally and commit the changes. Find more information in the repository's CONTRIBUTING.md

Apr 19 '23 02:04 github-actions[bot]

:x: pre-commit failed. Please run pre-commit run --all-files locally and commit the changes. Find more information in the repository's CONTRIBUTING.md

Apr 19 '23 14:04 github-actions[bot]

:x: pre-commit failed. Please run pre-commit run --all-files locally and commit the changes. Find more information in the repository's CONTRIBUTING.md

Apr 19 '23 22:04 github-actions[bot]