textstat
textstat copied to clipboard
Language support clarification
In the README, a number of languages are listed as being supported.
Is this for all functions?
Languages supported:
US English
Catalan
Czech
Danish
Spanish
Estonian
Finnish
French
Hungarian
Indonesian
Icelandic
Italian
Latin
Dutch (Nederlande)
Bokmål (Norwegian)
Polish
Portuguese
Russian
Swedish
@scarroll32 Hey, good point. Methods like difficult_words or forcast have an optional argument with language change. The default is 'en_us'. I should add language support also for other methods because at the moment most of them support only English. In general, textstat uses the text-hyphen library for syllabifying words and easy words dictionaries (lib/dictionaries/ folder) so there should be no problem with supporting these languages. At this point, there are only tests for English.