ruby-tf-idf
ruby-tf-idf copied to clipboard
Ruby gem that calculates TF-IDF out of a text to find most relevant words in each document of the corpus
Hi there, I have an issue with the gem, when you give a corpus already downcased it fails because downcase! returns nil. Here is the line: https://github.com/mathieuripert/ruby-tf-idf/blob/master/lib/ruby-tf-idf.rb#L100 Maybe you can...
downcase! was causing that some documents that were already in downcase were lost, as downcase! returns nil if no change is made.
Thanks for building this gem. I'm using it for my project with Indonesian documents. I added Indonesian stop words to the listed stop words, and I tried to fix a...