textstat
textstat copied to clipboard
Ruby gem to calculate statistics from text to determine readability, complexity and grade level of a particular corpus.
I wanted to use the list of difficult words rather than just seeing the count so I've added an optional argument for that. I added an rspec test for that....
I was using this gem to calculate Flesch reading scores, and noticed some discrepancies between the scores returned here versus other online tools ([like this one](https://charactercalculator.com/flesch-reading-ease/)). In investigating, it appears...
It would be nice to introduce a new calculation feature for [CEFR ](https://en.wikipedia.org/wiki/Common_European_Framework_of_Reference_for_Languages)levels.
Since languages other than English are supported, it makes sense to write tests for algorithms that use other languages.
In the README, a number of languages are listed as being supported. Is this for all functions? ``` Languages supported: US English Catalan Czech Danish Spanish Estonian Finnish French Hungarian...
See: https://readabilityformulas.com/fry-graph-readability-formula.php
See: https://readabilityformulas.com/the-bormuth-readability-formula.php
I've noticed some pretty nasty performance issues related to the use of the Text::Hyphen package. When passed in really long words it can take 5+ minutes to 'visualise'. Given there...
Add no1.txt to lib/dictionaries/ with ~2000-2500 easiest Norsk (Norwegian) words
Add mn.txt to lib/dictionaries/ with ~2000-2500 easiest Mongolian words