XFUND icon indicating copy to clipboard operation
XFUND copied to clipboard

format of zh and ja

Open bakhbyergyen opened this issue 3 years ago • 0 comments

hi, I wanted to know that, why zh and ja datasets are split by character? not word by word? when building a dataset, sentences can be split by words, not characters? thank you. image

bakhbyergyen avatar Apr 01 '22 01:04 bakhbyergyen