Source of language satasets

Open DonaldTsang opened this issue 6 years ago • 0 comments

Where is the source text dataset for the Ngrams of those 53 languages? Would like to see if it is different from https://github.com/wooorm/franc/issues/78 usage of UDHR, and if it is more accurate than them.

Nov 21 '19 07:11 DonaldTsang