Fernando Tarin Morales
Fernando Tarin Morales
Hello, I was working on creating an exciting dataset to fine-tune some of the available new LLM. Unfortunately, the prompts are too big to be used by the means I...
I'd like to add support for the [Catalan language](https://en.wikipedia.org/wiki/Catalan_language), similarly to [Euskara](https://en.wikipedia.org/wiki/Basque_language), which is already supported by Open-Assistant, Catalan is a language spoken in some areas of Spain. So far...
I just have finished a dataset very similar to https://github.com/LAION-AI/Open-Assistant/tree/main/data/datasets/fd_dialogue but for Japanese and taking the data from opensubtitles.org. The dataset contains subtitles for over 7000 tv shows and movies....
… and updated __init__.py. This solves https://github.com/LAION-AI/Open-Assistant/issues/297