Ecosyste.ms: Repos
An open API service providing repository metadata for many open source software ecosystems.
GitHub / derrickburns / generalized-kmeans-clustering
Spark library for generalized K-Means clustering. Supports general Bregman divergences. Suitable for clustering probabilistic data, time series data, high dimensional data, and very large data.
Stars: 284
Forks: 51
Open Issues: 3
License: apache-2.0
Language: HTML
Repo Size: 7.42 MB
Dependencies:
0
Created: almost 10 years ago
Updated: 5 months ago
Last pushed: 5 months ago
Last synced: 5 months ago
Topics: bregman-divergence, clustering, cosine-similarity, embeddings, entropy, euclidean-distance, itakura-saito-divergence, k-means, kullback-leibler-divergence, similarity-search, spark, spark-mllib
Files
No dependencies found