biomedical icon indicating copy to clipboard operation
biomedical copied to clipboard

Proposal to add MedMCQA dataset

Open giyaseddin opened this issue 3 years ago • 3 comments

Adding a Dataset

  • Name: MedMCQA : A Large-scale Multi-Subject Multi-Choice Dataset for Medical domain Question Answering
  • Description: A large-scale (194k), Multiple-Choice Question Answering (MCQA) dataset designed to address real-world medical entrance exam questions.
  • Task: QnA
  • Paper: https://arxiv.org/abs/2203.14371
  • Data: https://github.com/medmcqa/medmcqa
  • License: Apache-2.0 License
  • Motivation: It is a very recent large-scale medical QA dataset, that covers ~20 different topics of entrance exam questions

giyaseddin avatar Apr 07 '22 23:04 giyaseddin

#self-assign

giyaseddin avatar Apr 07 '22 23:04 giyaseddin

Hi @giyaseddin Just a ping on the status of this dataset. Please let us know if you are still working on it and when you plan to submit a PR. Thanks!!

jason-fries avatar Apr 19 '22 22:04 jason-fries

Thanks for the reminder @jason-fries, I'm afraid I'm not able to continue this week. But I'd like to open a PR. In the near future if it remains free.

giyaseddin avatar Apr 20 '22 07:04 giyaseddin