pix-plot icon indicating copy to clipboard operation
pix-plot copied to clipboard

Adjust UMAP hyperparameters given user data

Open duhaime opened this issue 7 years ago • 1 comments

when running UMAP with very small or very large datasets, the default hyperparameters produce results that are too far apart or too tightly clustered. We can help fix this problem by setting UMAP's hyperparameters using insights from a user's dataset.

duhaime avatar Apr 27 '18 22:04 duhaime

From testing it appears the spread variable could be key to visualizing small (1,000) datasets. Defaulting to 1, I've found that 0.4 works well with a dataset of 1,600 images. And conversely going above 1 might help with much larger datasets. It's described as "The effective scale of embedded points"; in pixplot it sort of has the effect of adjusting white space.

pleonard212 avatar Jun 08 '18 19:06 pleonard212