Release all datasets from MultiMed on Hugging Face
Hi @leduckhai 🤗
I'm Niels and work as part of the open-source team at Hugging Face. I discovered your work through Hugging Face's daily papers as yours got featured: https://huggingface.co/papers/2409.14074. The paper page lets people discuss about your paper and lets them find artifacts about it (your dataset for instance), you can also claim the paper as yours which will show up on your public profile at HF, add Github and project page URLs.
It's great to see the MultiMed dataset and model being available on the hub. Would you also like to make the VietMed, VietMed-NER, VietMed-Sum datasets available on 🤗 ? Hosting on Hugging Face will give you more visibility/enable better discoverability.
Would be awesome to make the dataset available on 🤗 , so that people can do:
from datasets import load_dataset
dataset = load_dataset("your-hf-org-or-username/your-dataset")
See here for a guide: https://huggingface.co/docs/datasets/loading. We also support Webdataset, useful for image/video datasets: https://huggingface.co/docs/datasets/en/loading#webdataset.
Besides that, there's the dataset viewer which allows people to quickly explore the first few rows of the data in the browser.
Let me know if you're interested/need any help regarding this!
Kind regards,
Niels
Hi @NielsRogge
Yes I would love to make all my datasets available on Hugging Face :)
Please also make my newest dataset available: https://huggingface.co/leduckhai/MultiMed-ST
Thanks a lot! <3
It looks like https://huggingface.co/papers/2504.03546 has a linked model and dataset repository, so I assume you have released everything on the hub?
yes you are right! 👍🏻
Great, could you link them in the README of this Github repo?
Yes I already linked them :)