easy-clustering

Have you ever looked at your data and thought "how many clusters should I even input into K-means??". I got tired of looking for an elbow in the elbow plot, so I created these functions that perform agglomerative clustering on X, automatically decide a distance cutoff for defining clusters, and plot the clusters on both a dendrogram and a UMAP so you can inspect the quality of them.

Additionally, the plot_umap function accepts up to 4 y variables, which it will plot on up to 4 subplots, so you can visualize important features of your data and how they coincide with the clusters. The colorbars automatically change for better visualization of binary vs continuous features.

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
LICENSE		LICENSE
README.md		README.md
cluster.py		cluster.py
plot_umap.py		plot_umap.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

easy-clustering

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

easy-clustering

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages