An open API service providing repository metadata for many open source software ecosystems.

Topic: "cluster-analysis"

scikit-learn-contrib/hdbscan

A high performance implementation of HDBSCAN clustering.

Language: Jupyter Notebook - Size: 27.8 MB - Last synced at: 7 days ago - Pushed at: 20 days ago - Stars: 2,908 - Forks: 515

kubesphere/kubeeye

KubeEye aims to find various problems on Kubernetes, such as application misconfiguration, unhealthy cluster components and node problems.

Language: Go - Size: 220 MB - Last synced at: about 18 hours ago - Pushed at: 26 days ago - Stars: 829 - Forks: 131

elki-project/elki

ELKI Data Mining Toolkit

Language: Java - Size: 54.9 MB - Last synced at: about 1 month ago - Pushed at: about 2 months ago - Stars: 808 - Forks: 325

milaan9/Clustering_Algorithms_from_Scratch

Implementing Clustering Algorithms from scratch in MATLAB and Python

Language: Jupyter Notebook - Size: 6.5 MB - Last synced at: about 1 month ago - Pushed at: over 2 years ago - Stars: 201 - Forks: 179

kvesta/vesta

A static analysis of vulnerabilities, Docker and Kubernetes cluster configuration detect toolkit based on the real penetration of cloud computing

Language: Go - Size: 3.93 MB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 197 - Forks: 28

erda-project/kubeprober

Large-scale Kubernetes cluster diagnostic tool.

Language: Go - Size: 268 MB - Last synced at: about 1 month ago - Pushed at: over 1 year ago - Stars: 140 - Forks: 39

TheJJ/ceph-balancer

Efficient Ceph placement optimization, aiming for maximum storage capacity through equal OSD utilization.

Language: Python - Size: 302 KB - Last synced at: 11 days ago - Pushed at: 11 days ago - Stars: 121 - Forks: 34

mpraski/clusters

Cluster analysis library for Golang

Language: Go - Size: 539 KB - Last synced at: 11 months ago - Pushed at: over 5 years ago - Stars: 84 - Forks: 11

bkrai/Top-10-Machine-Learning-Methods-With-R

Includes top ten must know machine learning methods with R.

Size: 82 KB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 77 - Forks: 66

BinaryResearch/centrifuge-toolkit

Tool for visualizing and empirically analyzing information encoded in binary files

Language: Jupyter Notebook - Size: 23.9 MB - Last synced at: 11 months ago - Pushed at: over 4 years ago - Stars: 73 - Forks: 9

gagolews/genieclust

Genie: Fast and Robust Hierarchical Clustering with Noise Point Detection - in Python and R

Language: C++ - Size: 79.1 MB - Last synced at: 6 days ago - Pushed at: 6 days ago - Stars: 62 - Forks: 11

CI-Research/KeywordAnalysis

Word analysis, by domain, on the Common Crawl data set for the purpose of finding industry trends

Size: 27.9 MB - Last synced at: 8 months ago - Pushed at: over 1 year ago - Stars: 57 - Forks: 13

HaebinShin/dec-tensorflow

Tensorflow implementation of "Unsupervised Deep Embedding for Clustering Analysis"

Language: Python - Size: 29.3 KB - Last synced at: about 2 years ago - Pushed at: almost 7 years ago - Stars: 53 - Forks: 21

lachhebo/pyclustertend

A python package to assess cluster tendency

Language: Python - Size: 6.2 MB - Last synced at: 12 days ago - Pushed at: 5 months ago - Stars: 47 - Forks: 11

Beliavsky/Burkardt-Fortran-90

Classification of John Burkardt's many Fortran 90 codes

Size: 29.9 MB - Last synced at: about 2 months ago - Pushed at: 3 months ago - Stars: 46 - Forks: 10

porterehunley/RACplusplus

A high performance implementation of Reciprocal Agglomerative Clustering in C++

Language: Jupyter Notebook - Size: 191 KB - Last synced at: about 1 year ago - Pushed at: over 1 year ago - Stars: 43 - Forks: 2

microsoft/dstoolkit-forecasting

Template for forecasting data science project and identify consumption profiles in time series

Language: Jupyter Notebook - Size: 5.79 MB - Last synced at: 5 days ago - Pushed at: over 1 year ago - Stars: 42 - Forks: 4

ispingos/pytheas-splitting

Home of the Pytheas software for local shear-wave splitting analysis

Language: Python - Size: 10.6 MB - Last synced at: 9 months ago - Pushed at: 9 months ago - Stars: 40 - Forks: 7

lucko515/clustering-python

Different clustering approaches applied on different problemsets

Language: Jupyter Notebook - Size: 268 KB - Last synced at: 28 days ago - Pushed at: almost 5 years ago - Stars: 39 - Forks: 57

gagolews/clustering-benchmarks

A framework for benchmarking clustering algorithms

Language: Python - Size: 194 MB - Last synced at: 2 days ago - Pushed at: about 1 month ago - Stars: 38 - Forks: 6

LuisScoccola/persistable

density-based clustering for exploratory data analysis based on multi-parameter persistence

Language: Python - Size: 11.3 MB - Last synced at: 10 days ago - Pushed at: about 2 months ago - Stars: 38 - Forks: 2

fanfanda/S_Dbw

S_Dbw validity index

Language: Python - Size: 17.6 KB - Last synced at: about 2 years ago - Pushed at: over 6 years ago - Stars: 38 - Forks: 10

philips-software/latrend

An R package for clustering longitudinal datasets in a standardized way, providing interfaces to various R packages for longitudinal clustering, and facilitating the rapid implementation and evaluation of new methods

Language: R - Size: 62.9 MB - Last synced at: 2 days ago - Pushed at: 2 days ago - Stars: 31 - Forks: 5

IgorWounds/Cluster-Analysis-Machine-Learning-for-Pairs-Trading

Find trading pairs with Machine Learning

Language: Jupyter Notebook - Size: 365 KB - Last synced at: about 2 years ago - Pushed at: almost 4 years ago - Stars: 27 - Forks: 15

ear-team/bambird

Unsupervised classification to improve the quality of a bird song recording dataset. https://doi.org/10.1016/j.ecoinf.2022.101952

Language: Python - Size: 207 MB - Last synced at: 25 days ago - Pushed at: 25 days ago - Stars: 26 - Forks: 6

epigen/unsupervised_analysis

A general purpose Snakemake workflow and MrBiomics module to perform unsupervised analyses (dimensionality reduction & cluster analysis) and visualizations of high-dimensional data.

Language: Python - Size: 56 MB - Last synced at: 27 days ago - Pushed at: 2 months ago - Stars: 26 - Forks: 4

Beliavsky/Burkardt-Fortran-90-codes

John Burkardt's Fortran 90 codes and documentation

Language: Fortran - Size: 35.3 MB - Last synced at: about 2 months ago - Pushed at: 3 months ago - Stars: 24 - Forks: 1

mlr-org/mlr3cluster

Cluster analysis for mlr3

Language: R - Size: 7.9 MB - Last synced at: 2 days ago - Pushed at: about 1 month ago - Stars: 23 - Forks: 6

JoachimGoedhart/PlotTwist

PlotTwist - a web app for plotting and annotating time-series data

Language: R - Size: 3.94 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 23 - Forks: 4

debsin/dropClust

Version 2.1.0 released

Language: R - Size: 188 MB - Last synced at: about 1 month ago - Pushed at: over 5 years ago - Stars: 23 - Forks: 8

volfpeter/localclustering

Python 3 implementation and documentation of the Hermina-Janos local graph clustering algorithm.

Language: Python - Size: 2.48 MB - Last synced at: 5 days ago - Pushed at: over 2 years ago - Stars: 22 - Forks: 1

gagolews/genie

Genie: A Fast and Robust Hierarchical Clustering Algorithm (this R package has now been superseded by genieclust)

Language: C++ - Size: 409 KB - Last synced at: about 1 month ago - Pushed at: over 2 years ago - Stars: 22 - Forks: 3

PetoLau/CoronaDash

COVID-19 spread shiny dashboard with a forecasting model, countries' trajectories graphs, and cluster analysis tools

Language: R - Size: 3.05 MB - Last synced at: over 1 year ago - Pushed at: over 4 years ago - Stars: 22 - Forks: 8

MBAigner/PDFSegmenter

This library builds a graph-representation of the content of PDFs. The graph is then clustered, resulting page segments are classified and returned. Tables are retrieved formatted as a CSV.

Language: Python - Size: 399 KB - Last synced at: 16 days ago - Pushed at: over 4 years ago - Stars: 22 - Forks: 3

hlennon/LCTMtools

Latent Class Trajectory Models: An R Package

Language: R - Size: 156 MB - Last synced at: over 1 year ago - Pushed at: over 2 years ago - Stars: 21 - Forks: 14

RamiKrispin/ts-cluster-analysis-r

Materials for the the Analyzing Time Series at Scale with Cluster Analysis in R Workshop

Language: HTML - Size: 89 MB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 19 - Forks: 4

hermesespinola/FOA-Kmeans-Color-Image-Segmentation

Clustering analysis using an evolutionary optimization algorithm based on nature, Forest Optimization Algorithm

Language: MATLAB - Size: 627 KB - Last synced at: 11 months ago - Pushed at: over 5 years ago - Stars: 19 - Forks: 9

aryashah2k/Datalogy-Customer-Segmentation-Data-Science-Internship

A Repository Maintaining My Summer Internship Work At Datalogy As A Data Science Intern Working On Customer Segmentation Models Using Heirarchical Clustering, K-Means Clustering And Identifying Loyal Customers Based On Creation Of Recency, Frequence, Monetary (RFM) Matrix.

Language: Jupyter Notebook - Size: 21.3 MB - Last synced at: 4 days ago - Pushed at: over 4 years ago - Stars: 17 - Forks: 4

lettier/interactivekmeans

Interactive HTML canvas based implementation of k-means.

Language: JavaScript - Size: 4.64 MB - Last synced at: about 1 month ago - Pushed at: about 7 years ago - Stars: 15 - Forks: 2

Mthrun/FCPS

The Fundamental Clustering Problems Suite (FCPS) summaries 54 state-of-the-art clustering algorithms, common cluster challenges and estimations of the number of clusters as well as the testing for cluster tendency.

Language: HTML - Size: 5.81 MB - Last synced at: over 1 year ago - Pushed at: almost 2 years ago - Stars: 13 - Forks: 1

m-clark/R-models

A quick reference for how to run many models in R.

Language: R - Size: 21 MB - Last synced at: 13 days ago - Pushed at: almost 7 years ago - Stars: 13 - Forks: 1

AlexandrovLab/SigProfilerClusters

Tool for analyzing the inter-mutational distances between SNV-SNV and INDEL-INDEL mutations. Tool separates mutations into clustered and non-clustered groups on a sample-dependent basis.

Language: Python - Size: 1.68 MB - Last synced at: 13 days ago - Pushed at: 27 days ago - Stars: 12 - Forks: 1

instamatic-dev/edtools

Collection of tools for automated processing and clustering of electron diffraction data

Language: Python - Size: 451 KB - Last synced at: about 1 month ago - Pushed at: 3 months ago - Stars: 11 - Forks: 9

clusterking/clusterking

Cluster sets of histograms/curves, in particular kinematic distributions in high energy physics.

Language: Python - Size: 2.95 MB - Last synced at: about 1 month ago - Pushed at: 3 months ago - Stars: 11 - Forks: 1

AiCorsair/Dataquest-Data-Science-Analysis-Projects

A repository dedicated to storing guided projects completed while learning data science concepts with Dataquest.

Language: Jupyter Notebook - Size: 74 MB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 11 - Forks: 3

dmattek/ARCOS

An R package to detect collective spatio-temporal phenomena

Language: R - Size: 50.8 MB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 11 - Forks: 3

BerkKilicoglu/ML-Modelling-Disease-Analysis

Obtaining meaningful results from the data set using the model trained with machine learning methods.

Language: Python - Size: 5.14 MB - Last synced at: almost 2 years ago - Pushed at: over 2 years ago - Stars: 11 - Forks: 1

philips-labs/demo-clustering-longitudinal-data

Supplementary materials for the manuscript "Clustering of longitudinal data: A tutorial on a variety of approaches" by N. G. P. Den Teuling, S.C. Pauws, and E.R. van den Heuvel (2021)

Language: R - Size: 18.6 KB - Last synced at: 13 days ago - Pushed at: over 3 years ago - Stars: 11 - Forks: 9

uef-machine-learning/fastdp

Fast variant of Density Peaks clustering

Language: C - Size: 21 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 10 - Forks: 4

NicolasH2/ggdendroplot

dendrograms in ggplot2.

Language: R - Size: 626 KB - Last synced at: about 1 month ago - Pushed at: almost 4 years ago - Stars: 10 - Forks: 0

alashkov83/S_Dbw

S_Dbw validity index. Adapted for DBSCAN (and similar)

Language: Jupyter Notebook - Size: 2.34 MB - Last synced at: about 1 month ago - Pushed at: about 6 years ago - Stars: 10 - Forks: 5

PetoLau/ClusterForecast

Clustering-based Forecasting Method for Individual End-consumer Electricity Consumption Using Smart Grid Data

Language: R - Size: 20.5 KB - Last synced at: over 1 year ago - Pushed at: over 6 years ago - Stars: 10 - Forks: 5

Devinterview-io/cluster-analysis-interview-questions

🟣 Cluster Analysis interview questions and answers to help you prepare for your next machine learning and data science interview in 2024.

Size: 15.6 KB - Last synced at: about 1 month ago - Pushed at: over 1 year ago - Stars: 9 - Forks: 1

jayelm/ca-parkinsons

Cluster analysis of Parkinson's disease

Language: R - Size: 82.4 MB - Last synced at: about 1 year ago - Pushed at: over 7 years ago - Stars: 9 - Forks: 6

benjaminirving/perfusion-slic

Simple Linear Iterative Clustering adapted for 4D DCE-MRI or other perfusion imaging

Language: Python - Size: 271 KB - Last synced at: almost 2 years ago - Pushed at: almost 9 years ago - Stars: 9 - Forks: 1

eren-ck/finch

A Python implementation of "FINCH Clustering Algorithm (CVPR 2019)"

Language: Python - Size: 460 KB - Last synced at: 14 days ago - Pushed at: 3 months ago - Stars: 8 - Forks: 2

ScialdoneLab/CIARA

CIARA (Cluster Independent Algorithm for the identification of markers of RAre cell types) is an R package that identifies potential markers of rare cell types looking at genes whose expression is confined in small regions of the expression space

Language: R - Size: 76.4 MB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 8 - Forks: 0

albertopessia/Kpax3.jl

Bayesian bi-clustering of categorical data

Language: Julia - Size: 530 KB - Last synced at: 4 months ago - Pushed at: about 3 years ago - Stars: 8 - Forks: 7

skacem/Strategic-Business-Analytics-Foundations

Foundations of strategic business analytics (in Python) by ESSEC Business School

Language: Jupyter Notebook - Size: 2.36 MB - Last synced at: 2 days ago - Pushed at: almost 4 years ago - Stars: 8 - Forks: 4

nunofachada/amvidc

Data clustering algorithm based on agglomerative hierarchical clustering (AHC) which uses minimum volume increase (MVI) and minimum direction change (MDC) clustering criteria.

Language: Matlab - Size: 131 KB - Last synced at: about 1 month ago - Pushed at: over 9 years ago - Stars: 8 - Forks: 4

emso-exe/Analise_de_rh_-_people_analytics

Projeto de people analytics, utilizando machine learning na clusterização de dados de funcionários que poderão deixar a empresa.

Language: Jupyter Notebook - Size: 28.4 MB - Last synced at: 29 days ago - Pushed at: 8 months ago - Stars: 7 - Forks: 2

Aidenzich/HelloBERTopic

BERTopic 中文使用範例

Language: Jupyter Notebook - Size: 6.15 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 7 - Forks: 0

sidharth178/The-Battle-of-Neighborhoods-Capstone-Project

The objective of this project is to find the best place or neighbourhood in Toronto to open a restaurant or startup using Foursquare location data.

Language: Python - Size: 27.3 MB - Last synced at: about 2 years ago - Pushed at: over 3 years ago - Stars: 7 - Forks: 5

abjur/tjsp_app

Shiny app that makes cluster analysis of some productivity measures from TJSP

Language: R - Size: 23.8 MB - Last synced at: over 1 year ago - Pushed at: over 4 years ago - Stars: 7 - Forks: 0

Andreoliveira85/My-data-machine-learning-portfolio

Portfolio with data science and machine learning projects I developed during my training in data science.

Size: 8.87 MB - Last synced at: over 1 year ago - Pushed at: over 4 years ago - Stars: 7 - Forks: 1

Liang-Team/Sequenzo

A fast, scalable, and intuitive Python package in social sequence analysis.

Language: Jupyter Notebook - Size: 50.4 MB - Last synced at: 7 days ago - Pushed at: 7 days ago - Stars: 6 - Forks: 1

uef-machine-learning/Balanced_k-Means_Revisited

Balanced k-Means Revisited algorithm

Language: C - Size: 6.68 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 6 - Forks: 0

V-MalM/Stock-Clustering-and-Prediction Fork of dschoen24/Stock-Prediction

To build, train and test LSTM model to forecast next day 'Close' price and to create diverse stock portfolios using k-means clustering to detect patterns in stocks that move similarly with an underlying trend i.e., for a given period, how stocks trend together.To deploy our findings to an app along with an interactive dashboard to predict the next day ‘Close’ for any given stock.

Language: Jupyter Notebook - Size: 58.1 MB - Last synced at: about 2 months ago - Pushed at: over 3 years ago - Stars: 6 - Forks: 1

nejci/Pepelka

Pepelka is a MATLAB toolbox for data clustering and visualization.

Language: MATLAB - Size: 38.9 MB - Last synced at: about 2 years ago - Pushed at: over 3 years ago - Stars: 6 - Forks: 0

acabassi/coca

R package for COCA: Cluster-of-Clusters Analysis

Language: R - Size: 2.28 MB - Last synced at: 25 days ago - Pushed at: almost 5 years ago - Stars: 6 - Forks: 2

rtimbro185/syr_mads_mar653_marketing_analytics

Syracuse University, Masters of Applied Data Science - MAR 653 Marketing Analytics

Language: HTML - Size: 50.8 MB - Last synced at: over 1 year ago - Pushed at: about 5 years ago - Stars: 6 - Forks: 2

eXascaleInfolab/daoc

DAOC (Deterministic and Agglomerative Overlapping Clustering algorithm): Stable Clustering of Large Networks

Language: C++ - Size: 117 KB - Last synced at: about 1 year ago - Pushed at: over 5 years ago - Stars: 6 - Forks: 0

PetoLau/UnsupervisedEnsembles

Unsupervised ensemble learning methods for time series forecasting. Bootstrap aggregating (bagging) for double-seasonal time series forecasting and its ensembles.

Language: R - Size: 19.5 KB - Last synced at: over 1 year ago - Pushed at: over 6 years ago - Stars: 6 - Forks: 4

aniket1992/High-Profile-Doctor-Segmentation

Segmenting High profile doctors for Pharma company for maximising returns.

Language: Jupyter Notebook - Size: 2.27 MB - Last synced at: about 1 year ago - Pushed at: about 7 years ago - Stars: 6 - Forks: 8

adanjoga/cvik-toolbox

CVIK is a Toolbox for the automatic determination of the number of clusters on data clustering problems

Language: MATLAB - Size: 4.87 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 5 - Forks: 2

nafisalawalidris/911-Call-Analysis

The 911 Call Analysis project explores and visualises emergency call data to uncover patterns and trends. It includes data preparation, exploratory analysis, visualizing call volume and reasons and generating heatmaps. Users can customize the code for their dataset. The project relies on libraries like Pandas, NumPy, Matplotlib, Seaborn, and SciPy

Language: Jupyter Notebook - Size: 24.1 MB - Last synced at: 23 days ago - Pushed at: almost 2 years ago - Stars: 5 - Forks: 0

acabassi/klic

R package for KLIC: Kernel Learning Integrative Clustering

Language: R - Size: 22.4 MB - Last synced at: 12 days ago - Pushed at: almost 5 years ago - Stars: 5 - Forks: 3

c3duan/Time-Series-Classifier

Anomaly Classification in Time Series Data

Language: Jupyter Notebook - Size: 11.8 MB - Last synced at: about 1 month ago - Pushed at: almost 5 years ago - Stars: 5 - Forks: 0

myntra/Analyse-Redis-Cluster-nodes

Tired of analysing redis cluster using `cluster nodes` command. Try using this simple shell script.

Language: Shell - Size: 1.38 MB - Last synced at: 28 days ago - Pushed at: about 6 years ago - Stars: 5 - Forks: 3

HawxChen/CloudComputing

MapReduce, Spark, Hadoop, PostgreSQL, Cluster Management

Language: Python - Size: 54.7 KB - Last synced at: about 2 years ago - Pushed at: over 7 years ago - Stars: 5 - Forks: 0

mike-liuliu/Min-Max-Jump-distance

Source code of the paper "Min-Max-Jump distance and its applications."

Language: Jupyter Notebook - Size: 104 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 4 - Forks: 0

egy1st/denmune-clustering-algorithm Fork of scikit-learn-contrib/denmune-clustering-algorithm

DenMune is a clustering algorithm that can find clusters of arbitrary size, shapes and densities in two-dimensions. Higher dimensions are first reduced to 2-D using the t-sne. The algorithm relies on a single parameter K (the number of nearest neighbors). The results show the superiority of DenMune. Enjoy the simplicty but the power of DenMune.

Language: Jupyter Notebook - Size: 73.9 MB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 4 - Forks: 0

pnavaro/GeometricClusterAnalysis.jl

Geometric methods for Cluster Analysis

Language: Julia - Size: 84.3 MB - Last synced at: 6 days ago - Pushed at: 5 months ago - Stars: 4 - Forks: 0

saskiakutz/BaClAva

GUI for Bayesian cluster analysis of SMLM data

Language: R - Size: 4.05 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 4 - Forks: 2

VarIr/copac

COPAC clustering

Language: Jupyter Notebook - Size: 21.3 MB - Last synced at: about 1 month ago - Pushed at: over 1 year ago - Stars: 4 - Forks: 0

JIgor08/Cust_Seg_Project

Projeto de Clusterização com RFV

Language: Jupyter Notebook - Size: 29.1 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 4 - Forks: 0

nredell/RARI

A python package which implements a distance-based extension of the adjusted Rand index for the supervised validation of 2 cluster analysis solutions

Language: Python - Size: 368 KB - Last synced at: 30 days ago - Pushed at: over 2 years ago - Stars: 4 - Forks: 0

kaburelabs/bbb-twitter-monitor

Social Media Analysis

Language: Python - Size: 80.4 MB - Last synced at: about 1 year ago - Pushed at: over 2 years ago - Stars: 4 - Forks: 3

jirotubuyaki/ThunderBayes.jl

A Julia Package for Bayesian Nonparametric Analysis for Machine Learning

Language: Julia - Size: 278 KB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 4 - Forks: 0

pedrohasselmann/GmodeClass

Adapted G-mode Clustering method for Python 2.7 using Numpy, Scipy and Matplotlib.

Language: Python - Size: 1.12 MB - Last synced at: almost 2 years ago - Pushed at: over 3 years ago - Stars: 4 - Forks: 1

ArtemKovera/clust

a few different clustering algorithms with python libraries for data science

Language: Jupyter Notebook - Size: 108 KB - Last synced at: over 1 year ago - Pushed at: about 4 years ago - Stars: 4 - Forks: 4

EricPostMaster/Halloween-Candy-Power-Ranking-Cluster-Analysis

This analysis uses Principal Components Analysis and k-Means clustering to identify useful subgroups within FiveThirtyEight's Ultimate Halloween Candy Power Ranking dataset. It also used hypothesis tests to assess whether differences exist between popular and unpopular candies.

Language: R - Size: 4.33 MB - Last synced at: about 1 year ago - Pushed at: over 4 years ago - Stars: 4 - Forks: 0

vishalv91/Customer-Analytics

The project concerns an international e-commerce company* based in the USA who want to discover key insights from their customer database. They want to use some of the most advanced machine learning techniques to study their customers.

Language: R - Size: 1.34 MB - Last synced at: about 1 year ago - Pushed at: over 4 years ago - Stars: 4 - Forks: 5

clusterfreak/ClusterCoreSwift

Core classes for cluster analysis - Swift - Fuzzy-C-Means and Possibilistic-C-Means Algorithms based on the Java Version of ClusterCore

Language: Swift - Size: 14.6 KB - Last synced at: over 1 year ago - Pushed at: almost 5 years ago - Stars: 4 - Forks: 0

prathmachowksey/Hopkins-Statistic-Clustering-Tendency

A python implementation for computing the Hopkins' statistic (Lawson and Jurs 1990) for measuring clustering tendency of data

Language: Jupyter Notebook - Size: 176 KB - Last synced at: over 1 year ago - Pushed at: about 5 years ago - Stars: 4 - Forks: 4

eXascaleInfolab/daor

DAOR Parameter-free Embedding Framework for Large Graphs (Networks)

Language: C++ - Size: 487 KB - Last synced at: about 1 year ago - Pushed at: over 5 years ago - Stars: 4 - Forks: 2

sharmaroshan/Text-Clustering

It is a very different task, as here I am going to cluster 200 different texts related to games and sports in 2 or more different clusters. we can also use zipf plot to determine how many useful clusters can be formed.

Language: Jupyter Notebook - Size: 495 KB - Last synced at: over 1 year ago - Pushed at: almost 6 years ago - Stars: 4 - Forks: 5

tjkemp/likert-clusters 📦

A jupyter notebook for likert data cluster analysis and visualization of Finland's parliamentary election 2015

Language: Jupyter Notebook - Size: 2.81 MB - Last synced at: about 2 years ago - Pushed at: over 6 years ago - Stars: 4 - Forks: 0

PetoLau/ClipStream

ClipStream - multiple data streams clustering method

Language: R - Size: 33.2 KB - Last synced at: over 1 year ago - Pushed at: over 6 years ago - Stars: 4 - Forks: 2

uef-machine-learning/tspgclu

Fast but accurate approximation of Ward's agglomerative clustering using a fully connected TSP graph

Language: C - Size: 8.18 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 3 - Forks: 0

Related Topics
clustering 132 machine-learning 88 clustering-algorithm 85 python 80 data-science 58 r 57 data-visualization 43 data-analysis 37 kmeans-clustering 31 k-means-clustering 30 machine-learning-algorithms 28 unsupervised-learning 28 cluster 24 data-mining 24 clustering-evaluation 23 unsupervised-machine-learning 20 pca 19 clustering-methods 19 visualization 19 k-means 18 exploratory-data-analysis 18 logistic-regression 15 statistics 14 pandas 14 segmentation 14 hierarchical-clustering 14 time-series-analysis 13 time-series 13 pca-analysis 12 random-forest 12 customer-segmentation 12 jupyter-notebook 12 principal-component-analysis 12 kmeans 12 dimensionality-reduction 12 numpy 11 nlp 11 scikit-learn 11 python3 10 deep-learning 10 classification 9 regression-analysis 9 dbscan 9 text-mining 9 ggplot2 9 seaborn 8 covid-19 8 supervised-learning 8 matplotlib 8 factor-analysis 8 rstudio 8 regression-models 8 dbscan-clustering 7 eda 7 sklearn 7 decision-trees 7 clusters 6 linear-regression 6 python-3 6 feature-selection 6 knn 6 nlp-machine-learning 6 knn-classification 6 association-rules 6 outlier-detection 6 data-cleaning 6 forecasting 6 random-forest-classifier 5 outliers 5 gaussian-mixture-models 5 dendrogram 5 clustering-algorithms 5 anomaly-detection 5 genomics 5 spark 5 ensemble-learning 5 webscraping 5 feature-engineering 5 umap 5 marketing 5 clustering-analysis 5 ml 5 analysis 5 shiny 5 multivariate-analysis 5 excel 4 feature-extraction 4 java 4 bioinformatics 4 data-analytics 4 datasets 4 hadoop 4 predictive-modeling 4 data 4 artificial-intelligence 4 data-preprocessing 4 recommendation-system 4 cluster-validity-index 4 xgboost 4 network-analysis 4