Open-Assistant icon indicating copy to clipboard operation
Open-Assistant copied to clipboard

Dataset: CAMEL Math and Physics

Open Miserlou opened this issue 2 years ago • 4 comments

50K+ Math, Physics and Code inputs and outputs, sounds tasty but hidden in a zip file.

https://github.com/lightaime/camel#data-hosted-on-hugging-face

Miserlou avatar Apr 13 '23 15:04 Miserlou

via https://twitter.com/hammh0a/status/1646524135538065409

Miserlou avatar Apr 13 '23 15:04 Miserlou

Looks like a really good dataset. I can probably convert to OA format.

CheckMC avatar Apr 15 '23 02:04 CheckMC

More sciences added, looks like in a more accessible format:

https://huggingface.co/datasets/camel-ai/chemistry https://huggingface.co/datasets/camel-ai/biology

via https://github.com/lightaime/camel#data-hosted-on-hugging-face

Miserlou avatar Apr 16 '23 19:04 Miserlou

On another look we might have to reconsider these datasets -- The license says that they are for research use only, not commercial. And, some was generated with GPT4

edit: all good HF shows okay license

CheckMC avatar Apr 16 '23 21:04 CheckMC

Looks like this is completed. I'm going to close it out.

camsdixon1 avatar Jun 24 '23 13:06 camsdixon1