TopicTuner icon indicating copy to clipboard operation
TopicTuner copied to clipboard

HDBSCAN Tuning for BERTopic Models

Results 6 TopicTuner issues
Sort by recently updated
recently updated
newest added

For the same docs, my dimensionality reduction in Bertopic costed 1.5 hour but tmt.reduce() only costed 10 more mins. The following is the output of tmt.reduce(): UMAP(angular rp forest=True, metric='cosine,...

Hi Great package! May I ask what evaluation metrics you used for evaluate the success?

@drob-xx I checked your code, very impressive work, here I got a question. I think you used grid search to do different setting of min_cluster_size and min-samples and did some...

Hi there, It would be great if visualizeEmbeddings used grey or something like that for the -1 outlier topic so that it is distinguishable from other topics (like the visualisation...

Hi there, Small thing, I think it may be helpful to have some error checking on whether docs have been set when they are needed. I was running through the...

Hey, First off, great work for the library, i think it already helps in general for most use cases to cluster with the least amount of outliers as possible, and...