Collect data from Data Catalog
Datasets Hackathon in the BigScience Data Tooling working group.
How to contribute
- Step-by-step guide: https://github.com/bigscience-workshop/data_tooling/wiki/datasets-hackathon
- List of catalogued datasets to be collected: https://github.com/orgs/bigscience-workshop/projects/2/views/7
Comment by @HugoLaurencon: https://github.com/bigscience-workshop/data_tooling/issues/246#issuecomment-977190632
Is it possible to create a directory with a txt file for every issue that you posted (and that you potentially update)? Or find another solution, because it really becomes unmanageable if you open hundreds of issues
@HugoLaurencon I have created an issue for every entry in the BigScience Data Catalog (created by the Data Source WG): http://23.251.145.180:8501/
We are preparing a hackathon to contribute all data sources.
Thanks!