Ecosyste.ms: Repos

An open API service providing repository metadata for many open source software ecosystems.

GitHub / berksudan / PySpark-Auto-Clustering

Implemented an auto-clustering tool with seed and number of clusters finder. Optimizing algorithms: Silhouette, Elbow. Clustering algorithms: k-Means, Bisecting k-Means, Gaussian Mixture. Module includes micro-macro pivoting, and dashboards displaying radius, centroids, and inertia of clusters. Used: Python, Pyspark, Matplotlib, Spark MLlib.

JSON API: https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/berksudan%2FPySpark-Auto-Clustering

Stars: 0
Forks: 0
Open Issues: 0

License: None
Language: Python
Repo Size: 73.2 KB
Dependencies: pending

Created: about 2 years ago
Updated: about 2 years ago
Last pushed: about 2 years ago
Last synced: about 1 year ago

Topics: bisecting-kmeans, clustering, clustering-analysis, elbow-method, gaussian-mixture, kmeans-clustering, pyspark, silhouette-score, spark, spark-mllib

Files
    Loading...
    Readme
    Loading...