biomedical icon indicating copy to clipboard operation
biomedical copied to clipboard

ntcir_13_medweb text schema needs investigation

Open galtay opened this issue 3 years ago • 0 comments

https://github.com/bigscience-workshop/biomedical/blob/master/bigbio/biodatasets/ntcir_13_medweb/ntcir_13_medweb.py

this is a text classification on tweets task for config = ntcir_13_medweb_classification_en_bigbio_text it appears that all tweets with a given label are in one sample this issue is to investigate this and probably implement a one tweet per sample form. the other configs for this dataset should also be checked.

galtay avatar Jun 04 '22 20:06 galtay