Andrew W
Andrew W
Update: get_representative_docs() seems to be drawing from the original labels, rather than the merged labels.
Thanks Maarten! Sorry for the slight mess. I copied this out of the notebook that I've been troubleshooting in. ` # Calculate Embeddings sentence_model = SentenceTransformer("all-MiniLM-L6-v2") embeddings = sentence_model.encode(docs, show_progress_bar=True)...
One thing that I've noticed is that it tends to get more and more messed up with each iteration, if I go through multiple rounds of merges.