Ecosyste.ms: Repos
An open API service providing repository metadata for many open source software ecosystems.
GitHub / berksudan / PySpark-Auto-Clustering
Implemented an auto-clustering tool with seed and number of clusters finder. Optimizing algorithms: Silhouette, Elbow. Clustering algorithms: k-Means, Bisecting k-Means, Gaussian Mixture. Module includes micro-macro pivoting, and dashboards displaying radius, centroids, and inertia of clusters. Used: Python, Pyspark, Matplotlib, Spark MLlib.
JSON API: https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/berksudan%2FPySpark-Auto-Clustering
Stars: 0
Forks: 0
Open Issues: 0
License: None
Language: Python
Repo Size: 73.2 KB
Dependencies: pending
Created: about 2 years ago
Updated: about 2 years ago
Last pushed: about 2 years ago
Last synced: about 1 year ago
Topics: bisecting-kmeans, clustering, clustering-analysis, elbow-method, gaussian-mixture, kmeans-clustering, pyspark, silhouette-score, spark, spark-mllib