node-language-detection
node-language-detection copied to clipboard
Source of language satasets
Where is the source text dataset for the Ngrams of those 53 languages? Would like to see if it is different from https://github.com/wooorm/franc/issues/78 usage of UDHR, and if it is more accurate than them.