biomedical
biomedical copied to clipboard
ntcir_13_medweb text schema needs investigation
https://github.com/bigscience-workshop/biomedical/blob/master/bigbio/biodatasets/ntcir_13_medweb/ntcir_13_medweb.py
this is a text classification on tweets task
for config = ntcir_13_medweb_classification_en_bigbio_text
it appears that all tweets with a given label are in one sample
this issue is to investigate this and probably implement a one tweet per sample form.
the other configs for this dataset should also be checked.