EmbodiedScan icon indicating copy to clipboard operation
EmbodiedScan copied to clipboard

Introduce MMScan dataset on Hugging Face

Open NielsRogge opened this issue 7 months ago • 1 comments

Hi @mxh1999 🤗

I'm Niels and work as part of the open-source team at Hugging Face. I discovered your work through Hugging Face's daily papers as yours got featured: https://huggingface.co/papers/2406.09401. The paper page lets people discuss about your paper and lets them find artifacts about it (your dataset for instance), you can also claim the paper as yours which will show up on your public profile at HF, add Github and project page URLs.

It'd be great to make the dataset available on the 🤗 hub, to improve its discoverability/visibility.

Would be awesome to host the dataset, so that people can do:

from datasets import load_dataset

dataset = load_dataset("your-hf-org-or-username/your-dataset")

See here for a guide: https://huggingface.co/docs/datasets/loading.

Besides that, there's the dataset viewer which allows people to quickly explore the first few rows of the data in the browser.

Let me know if you're interested/need any help regarding this!

Cheers,

Niels

NielsRogge avatar Jun 10 '25 18:06 NielsRogge

Hi Niels,

We're really honored that you found our work interesting.

We really appreciate your suggestion about hosting the dataset on the Hugging Face Hub. We'll have our team member follow up soon.

Cheers,

Xiaohan

mxh1999 avatar Jun 10 '25 19:06 mxh1999