data_tooling icon indicating copy to clipboard operation
data_tooling copied to clipboard

Collect data from Data Catalog

Open albertvillanova opened this issue 4 years ago • 3 comments

Datasets Hackathon in the BigScience Data Tooling working group.

How to contribute

  • Step-by-step guide: https://github.com/bigscience-workshop/data_tooling/wiki/datasets-hackathon
  • List of catalogued datasets to be collected: https://github.com/orgs/bigscience-workshop/projects/2/views/7

albertvillanova avatar Nov 23 '21 06:11 albertvillanova

Comment by @HugoLaurencon: https://github.com/bigscience-workshop/data_tooling/issues/246#issuecomment-977190632

Is it possible to create a directory with a txt file for every issue that you posted (and that you potentially update)? Or find another solution, because it really becomes unmanageable if you open hundreds of issues

albertvillanova avatar Nov 23 '21 21:11 albertvillanova

@HugoLaurencon I have created an issue for every entry in the BigScience Data Catalog (created by the Data Source WG): http://23.251.145.180:8501/

We are preparing a hackathon to contribute all data sources.

albertvillanova avatar Nov 23 '21 21:11 albertvillanova

Thanks!

HugoLaurencon avatar Nov 23 '21 22:11 HugoLaurencon