KeyBERT icon indicating copy to clipboard operation
KeyBERT copied to clipboard

Document Clustering between KeyBERT and Sentence Transformer?

Open keyuchen21 opened this issue 2 years ago • 3 comments

I'm wondering if anyone compared the differences using KeyBERT vs Sentence Transformers for document clustering?

keyuchen21 avatar Feb 27 '23 17:02 keyuchen21

KeyBERT itself is already using SentenceTransformers for extracting the document and word embeddings. It might be interesting to compare how well the clustering would be on the keyword embeddings compared to the document embedding but unfortunately I have not tried it out yet.

MaartenGr avatar Feb 28 '23 05:02 MaartenGr

@MaartenGr Yeah! I read your official doc a few month ago, I remember there were a section which you suggesting about first use KeyBERT then clustering, but recently I tried to find that section again, but not able to locate it anymore.

keyuchen21 avatar Mar 01 '23 03:03 keyuchen21

I actually do not remember using writing in the documentation as such a use case with respect to KeyBERT. It may have been PolyFuzz but KeyBERT is not generally used for clustering unless word embeddings are clustered again.

MaartenGr avatar Mar 01 '23 05:03 MaartenGr