An open API service providing repository metadata for many open source software ecosystems.

GitHub topics: clustering-analysis

aiaaee/Clustering-Methods-Evaluation

A collection of clustering algorithm implementations using Scikit-Learn.

Language: Jupyter Notebook - Size: 3.31 MB - Last synced at: 4 days ago - Pushed at: 4 days ago - Stars: 0 - Forks: 0

renatocorreia-rmcm/mall-customers-segmentation

Implementation of a simple clustering model.

Language: Jupyter Notebook - Size: 824 KB - Last synced at: 20 days ago - Pushed at: 20 days ago - Stars: 4 - Forks: 1

drmukeshnitin/Single-Cell-RNA-seq-Meta-Analysis-Tool

Open-source toolkit for single-cell RNA-seq meta-analysis Tool

Language: Python - Size: 48.8 KB - Last synced at: 22 days ago - Pushed at: 22 days ago - Stars: 0 - Forks: 0

GuestGoodTIO/analise-de-fenotipos-com-R

🧬 Analyze health-related phenotypic data using R to uncover patterns, create predictive models, and visualize insights with interactive tools.

Language: R - Size: 35.2 KB - Last synced at: 27 days ago - Pushed at: 27 days ago - Stars: 0 - Forks: 0

nathadriele/analise-de-fenotipos-com-R

Este projeto inicial básico estuda análises de dados fenotípicos na área de saúde, utilizando a linguagem R. O foco está na exploração de características físicas, biomarcadores e medidas clínicas para identificar padrões e associações.

Language: R - Size: 34.2 KB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 0 - Forks: 0

DOH-JDJ0303/bigbacter-nf

Bacterial surveillance pipeline.

Language: Nextflow - Size: 6.7 MB - Last synced at: 12 days ago - Pushed at: 3 months ago - Stars: 25 - Forks: 4

EtzionR/Clustering-by-Silhouette

Optimize clustering labels using Silhouette Score.

Language: Python - Size: 23.6 MB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 15 - Forks: 2

mahaibrahim344/client-segmentation

Segment clients for e-commerce with actionable insights. Enhance marketing strategies through data-driven segmentation. 🌐📊 Explore the project and data files on GitHub.

Language: Python - Size: 8.82 MB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 0 - Forks: 0

Simon-Bertrand/Clusters-Features

The Clusters-Features package allows data science users to compute high-level linear algebra operations on any type of data set. It computes approximatively 40 internal evaluation scores such as Davies-Bouldin Index, C Index, Dunn and its Generalized Indexes and many more ! Other features are also available to evaluate the clustering quality.

Language: Python - Size: 23.9 MB - Last synced at: 19 days ago - Pushed at: 6 months ago - Stars: 33 - Forks: 8

at-tan/Hierarchical_Clustering_of_Currencies 📦

A clustering exercise of global currencies on three common financial market features using data from 2017 through 2019, as published in Towards Data Science on Medium.com

Language: Jupyter Notebook - Size: 6 MB - Last synced at: about 2 months ago - Pushed at: almost 5 years ago - Stars: 9 - Forks: 3

bessagroup/CRATE

CRATE: Accurate and efficient clustering-based nonlinear analysis of heterogeneous materials through computational homogenization

Language: Python - Size: 50.8 MB - Last synced at: 2 days ago - Pushed at: 2 months ago - Stars: 40 - Forks: 7

jyyulab/MICA

Mutual Information-based Non-linear Clustering Analysis

Language: Python - Size: 497 MB - Last synced at: 10 days ago - Pushed at: 3 months ago - Stars: 2 - Forks: 3

ryandewolfe33/FuzzyClusteringSimilarity.jl

Code for Dirichlet Random Models for Fuzzy Rand Adjustment

Language: Julia - Size: 2.4 MB - Last synced at: about 2 months ago - Pushed at: 4 months ago - Stars: 0 - Forks: 0

monty-se/PINstimation

A comprehensive bundle of utilities for the estimation of probability of informed trading models: original PIN in Easley and O'Hara (1992) and Easley et al. (1996); Multilayer PIN (MPIN) in Ersan (2016); Adjusted PIN (AdjPIN) in Duarte and Young (2009); and volume-synchronized PIN (VPIN) in Easley et al. (2011, 2012). Implementations of various estimation methods suggested in the literature are included. Additional compelling features comprise posterior probabilities, an implementation of an expectation-maximization (EM) algorithm, and PIN decomposition into layers, and into bad/good components. Versatile data simulation tools, and trade classification algorithms are among the supplementary utilities. The package provides fast, compact, and precise utilities to tackle the sophisticated, error-prone, and time-consuming estimation procedure of informed trading, and this solely using the raw trade-level data.

Language: R - Size: 5.27 MB - Last synced at: 9 days ago - Pushed at: 11 months ago - Stars: 40 - Forks: 7

LatiefDataVisionary/machine-learning-basic-dicoding

Language: Jupyter Notebook - Size: 17.8 MB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 0 - Forks: 0

BayoAdejare/lightning-containers

Docker powered starter for geospatial analysis of lightning atmospheric data.

Language: Python - Size: 159 MB - Last synced at: 11 days ago - Pushed at: 4 months ago - Stars: 6 - Forks: 2

pajaskowiak/clusterConfusion

Clustering validation with ROC Curves

Language: R - Size: 1.2 MB - Last synced at: 4 months ago - Pushed at: 7 months ago - Stars: 7 - Forks: 1

Naeem1144/segmentation-project

Customer Segmentation using Machine learning models for clustering analysis

Language: Jupyter Notebook - Size: 16.8 MB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 0 - Forks: 0

michael-bmstu/clustering_recomend_system

An implementation of a recommender system based on clustering anime user ratings

Language: Jupyter Notebook - Size: 24.7 MB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 0 - Forks: 0

salar96/MEP-Orthogonal-NMF

Clustering and resource allocation using Deterministic Annealing Approach and Orthogonal Non-negative Matrix Factorization O-(NMF)

Language: Jupyter Notebook - Size: 1.31 MB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 10 - Forks: 3

issyollie/GG4257_Spatial_Analysis_DC_Crime

Spatial Analysis of Crime and SES in Washington D.C. Completed in fulfillment of GG4257: Urban Analytics as a Toolkit for Sustainable Development.

Language: Jupyter Notebook - Size: 92.3 MB - Last synced at: 6 months ago - Pushed at: 6 months ago - Stars: 0 - Forks: 0

milaan9/Clustering_Algorithms_from_Scratch

Implementing Clustering Algorithms from scratch in MATLAB and Python

Language: Jupyter Notebook - Size: 6.5 MB - Last synced at: 4 months ago - Pushed at: almost 3 years ago - Stars: 201 - Forks: 179

jordanameacham/Marketing-Analysis-Fictitious-Beer-Company-

Marketing analysis on a fictitious beer company that wants to identify its target audience and develop a marketing strategy to cater to the identified audience.

Size: 282 KB - Last synced at: 7 months ago - Pushed at: 7 months ago - Stars: 0 - Forks: 0

Fraggle460/AI-Regression-Clustering-Classification-Ethics

A series of Juypter Notebooks covering tasks related to Linear regression, clustering and classification using neural networks and 3D visualizations

Language: Python - Size: 1.51 MB - Last synced at: 7 months ago - Pushed at: 7 months ago - Stars: 0 - Forks: 0

anhvu2201/Churn_Users_Prediction_using_Supervised_and_Unsupervised_ML

Develop and train an supervised machine learning model to identify potential churn users. Additionally, segment these users into distinct groups using an unsupervised machine learning model to enable tailored marketing strategies.

Language: Jupyter Notebook - Size: 2.23 MB - Last synced at: 7 months ago - Pushed at: 7 months ago - Stars: 0 - Forks: 0

ShrinivasaPH/ML-Clustering-Countries

This project clusters countries based on socio-economic factors using Gaussian Mixture Model (GMM). Input data like child mortality, income, etc., and get a prediction of whether a country is Poor Developing or Rich. The results are visualized on an interactive world map, allowing you to explore global clustering patterns.

Language: Python - Size: 3.31 MB - Last synced at: 7 months ago - Pushed at: 7 months ago - Stars: 1 - Forks: 0

juan-kabbali/clustering-dimention-reduction

This repository has an analysis of a dataset with 33 variables and 395 observations. The goal is to reduce dimensions and create clusters to provide interesting conclusions.

Language: R - Size: 2.98 MB - Last synced at: 3 months ago - Pushed at: over 5 years ago - Stars: 2 - Forks: 1

zcebeci/fcvalid

Internal Validity Indexes for Fuzzy and Possibilistic Clustering

Language: R - Size: 157 KB - Last synced at: 3 months ago - Pushed at: almost 3 years ago - Stars: 7 - Forks: 3

berksudan/PySpark-Auto-Clustering

Implemented an auto-clustering tool with seed and number of clusters finder. Optimizing algorithms: Silhouette, Elbow. Clustering algorithms: k-Means, Bisecting k-Means, Gaussian Mixture. Module includes micro-macro pivoting, and dashboards displaying radius, centroids, and inertia of clusters. Used: Python, Pyspark, Matplotlib, Spark MLlib.

Language: Python - Size: 64.5 KB - Last synced at: 7 months ago - Pushed at: 7 months ago - Stars: 0 - Forks: 0

dharmik2101/Statistical-Methods

This repository is a comprehensive resource for learning and applying statistical techniques, complete with theoretical explanations, practical Python implementations, and food domain-specific applications.

Language: Jupyter Notebook - Size: 843 KB - Last synced at: 8 months ago - Pushed at: 8 months ago - Stars: 0 - Forks: 0

DimFragk/Centroid-clustering-app

Selection of the best centroid based clustering version with k-medoids and k-means

Language: Python - Size: 260 KB - Last synced at: 8 months ago - Pushed at: 8 months ago - Stars: 0 - Forks: 0

Prayuganingtyas/Clustering-Model-Bank-FundFusion-Bootcamp-Myskill

This project was developed as part of my learning experience in the Data Analyst Bootcamp at Myskill, where I focused on applying Python code for Clustering Model analysis.

Language: Jupyter Notebook - Size: 968 KB - Last synced at: 8 months ago - Pushed at: 8 months ago - Stars: 0 - Forks: 0

AYSE-DUMAN/Clustering-by-Business-Income-and-Expenses

load and visualize data and clusters with scatter plots; prepare data for cluster analysis; perform centroid clustering with k-means; interpret clustering results and determine the optimal number of clusters for a given dataset.

Language: Jupyter Notebook - Size: 488 KB - Last synced at: 9 months ago - Pushed at: 9 months ago - Stars: 3 - Forks: 0

tnleite/credit-card-customer-clustering

Este repositório apresenta um projeto de segmentação e predição de clientes de cartões de crédito. Utilizando EDA, clusterização (K-Means) e machine learning, o objetivo é prever o grupo de novos clientes, apoiando estratégias de marketing personalizadas.

Language: Jupyter Notebook - Size: 7.74 MB - Last synced at: 2 months ago - Pushed at: 10 months ago - Stars: 0 - Forks: 1

Priyanshu501/CausalGeneAnalysis

This repository contains analysis and exploration of causal and non-causal relationships between genes and phenotypes using embeddings generated from GPT-3.5. The project applies vector analysis, dimensionality reduction, and clustering techniques (K-Means, Hierarchical, and DBSCAN) to uncover potential patterns and insights into causality.

Language: Jupyter Notebook - Size: 253 KB - Last synced at: 6 months ago - Pushed at: 10 months ago - Stars: 1 - Forks: 0

diardanoraihan/VulprioApp

A web app to detect sites' vulnerabilities and prioritize findings that need to be addressed soon

Language: Python - Size: 917 KB - Last synced at: 3 months ago - Pushed at: almost 2 years ago - Stars: 1 - Forks: 0

dilettagoglia/DataMining

🔎Data Understanding, Visualization , Preparation & Cleaning - Clustering algorithms (unsupervised learning) - Classification algorithms (supervised learning) - Sequential Pattern Mining

Language: Jupyter Notebook - Size: 94 MB - Last synced at: 9 days ago - Pushed at: over 4 years ago - Stars: 9 - Forks: 8

AArashev/bank-marketing-prediction

"Predicting Bank Marketing Campaign Outcomes Using Classification and Regression Models. Analyze the customer segmentation with Clustering. "

Language: Jupyter Notebook - Size: 1010 KB - Last synced at: 10 months ago - Pushed at: 10 months ago - Stars: 0 - Forks: 0

Tdelaselle/Multi-Detec

Methods for the automated detection of acoustic multiplets and their hierarchical classification according to the similarity of their emission mechanisms

Language: MATLAB - Size: 60.5 KB - Last synced at: 11 months ago - Pushed at: 11 months ago - Stars: 1 - Forks: 0

abhivali/ThesisCodes

Language: C++ - Size: 1.64 MB - Last synced at: 11 months ago - Pushed at: 11 months ago - Stars: 1 - Forks: 0

BayoAdejare/pipeline-ecommerce

E-commerce Data Pipeline

Language: Python - Size: 22.5 MB - Last synced at: 11 days ago - Pushed at: 11 months ago - Stars: 0 - Forks: 0

krishcy25/K-Means-Clustering-Unsupervised-Learning

This repository focuses on building K-Means Clustering (Unsupervised Learning algorithm) that builds the effective number of cluster grouping/segmentation based on Elbow method.

Language: Jupyter Notebook - Size: 50.8 KB - Last synced at: about 1 year ago - Pushed at: about 5 years ago - Stars: 0 - Forks: 0

daniau23/us_baby_names

Analysis of Names given at Birth

Language: Jupyter Notebook - Size: 18.7 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

Devanshi-Bavaria/Predictive-Modeling-for-Stock-Market-Trends

📈 Comprehensive stock price analysis, including preprocessing, clustering, correlation, and predictive modeling, to enhance investment insights and accuracy. 💡

Language: Jupyter Notebook - Size: 1.22 MB - Last synced at: 5 months ago - Pushed at: about 1 year ago - Stars: 3 - Forks: 1

kaylaque/etongue

Processing Data of Electronic Tongue to Identify Mineral and Lead

Language: Jupyter Notebook - Size: 7.25 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

shrutikapujari/R

A repository of all my projects in R.

Language: R - Size: 628 KB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

paschalugwu/alx-data_science-unsupervised

A comprehensive exploration of unsupervised learning techniques, including clustering and dimensionality reduction, applied to real-world data science projects.

Language: Jupyter Notebook - Size: 1.03 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

AnFrBo/internet_censorship

Analysis of the State of Internet Censorship in the United Kingdom Using Data Provided by OONI and Blocked Project as well as Scraped URL Meta Data

Language: R - Size: 37.1 MB - Last synced at: 4 months ago - Pushed at: over 4 years ago - Stars: 3 - Forks: 0

gmzmercado/MILESTONE-ShapingPolicies

This repository explores how Rappler articles shape presidential policies by analyzing dominant themes related to President Bongbong Marcos' first year in office using Latent Semantic Analysis (LSA). The study provides insights for policy-making, strategic communications, and public engagement.

Language: Jupyter Notebook - Size: 1.56 MB - Last synced at: 10 months ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

lucasSaavedra123/dynamic_clustering

A concatenation of two GNNs to decode dynamic clustering on localization datasets

Language: Python - Size: 15.1 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

taissirboukrouba/Comparing-Clustering-Methods-on-Rent-Data

A Comparative report between Clustering-Based Anomaly Detection & K-means Clustering

Language: Jupyter Notebook - Size: 948 KB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

S84v/Customer-segmentation-clustering

Project to demonstrate various clustering algorithms for customer segmentation.

Language: Jupyter Notebook - Size: 13.2 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

mpaltsai/scRNA-Explorer

scRNA-Explorer pipeline allows users to interrogate in an interactive manner scRNA-sequencing data sets to explore via gene expression correlations possible function(s) of a gene of interest

Language: R - Size: 51 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

parthnan/IowaGamblingTask-Clustering

Clustering Analysis of all available research data on the Iowa Gambling Task(list of sources in readme) using R. The Scripts produce the output for the most common archetypes among the dataset of one researcher using PCA.

Language: R - Size: 917 KB - Last synced at: 4 months ago - Pushed at: over 5 years ago - Stars: 3 - Forks: 0

Bobidan97/Cheminformatics

Cheminformatics based project that aims to assess the diversity of the known inhibitors of SarsCov-2 proteases taken from COVID Moonshot project.

Language: Python - Size: 2.86 MB - Last synced at: over 1 year ago - Pushed at: over 3 years ago - Stars: 1 - Forks: 1

isadays/BayesianInference

The model predicts the treatment success rate for new TB cases with high accuracy and robustness. Two different approaches: PCA and Bayesian Inference. The Bayesian regression analysis reveals that c_new_sp_tsr and new_sp_fail are significant predictors of the treatment success rate, while other predictors show less certainty in their effects.

Language: Jupyter Notebook - Size: 2.25 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

karthik-d/em-clustering-sc-transcriptomics

Expectation-Maximization-based clustering algorithm to identify groups defined by biological variates as clusters in single-cell transcriptomic data.

Language: Jupyter Notebook - Size: 4.57 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

mustafahakkoz/Classification_Clustering_Freq_Pattern_Mining

3 notebooks covering Classification, Clustering Analysis and Frequent Pattern Mining in the scope of Data Mining lectures in Marmara University.

Language: Jupyter Notebook - Size: 892 KB - Last synced at: 6 months ago - Pushed at: almost 4 years ago - Stars: 4 - Forks: 0

KevinMacAstro/Phenomenological-Nonlinear-Power-Spectrum-Models

Phenomenological power spectrum models for Halpha emission line galaxies from the Nancy Grace Roman Space Telescope (2023MNRAS.523.2498M)

Size: 0 Bytes - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

PranjalGupta2199/crime-analysis-report

Data Mining (CS F415) | Research Project

Language: Jupyter Notebook - Size: 8.92 MB - Last synced at: over 1 year ago - Pushed at: almost 3 years ago - Stars: 0 - Forks: 0

Evanskorir/kenya-counties

This project implements data analysis and visualization techniques to explore various indicators related to a specific domain. It includes functionality for loading, preprocessing, and analyzing the data, as well as clustering and visualizing the results.

Language: HTML - Size: 2.48 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

KirovVerst/qparallel

Library of popular algorithms implemented in a parallel way

Language: Python - Size: 93.8 KB - Last synced at: over 1 year ago - Pushed at: over 6 years ago - Stars: 0 - Forks: 0

Pravallikab29/User-Profiling-and-Segmentation-using-K-Means-Clustering

Language: Jupyter Notebook - Size: 2.74 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

anirban-code-to-live/diverse-clustering

Partitioning a set of objects into groups(clusters) of diverse objects. The aim is to maximize intra-cluster diversity while at the same time maintaining the inter-cluster similarity.

Language: Python - Size: 2.88 MB - Last synced at: over 1 year ago - Pushed at: almost 6 years ago - Stars: 0 - Forks: 0

srushtii-m/Amazon-product-co-purchasing-network-analysis

Analyzing and recommending Amazon products using graph-based methods and regression models.

Language: Jupyter Notebook - Size: 6.21 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

MarinaMoreno/Client-Segmentation-Clustering

This repository contains an ML project that was approached with a business mindset from the beginning to the end. It addresses the problem of clustering.

Language: Jupyter Notebook - Size: 8.89 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

julherest/drought_clusters

Code used to identify and analyze drought clusters from gridded data.

Language: Python - Size: 301 KB - Last synced at: over 1 year ago - Pushed at: about 3 years ago - Stars: 19 - Forks: 10

Popseli/Customer-Segmentation-Using-Cohort-RFM-and-Clustering-Analyses

This project implements cohort, RFM and Clustering based analyses to identify various customer segments of a retail business for developing group specific marketing strategies based on customer purchasing behaviours.

Language: Jupyter Notebook - Size: 7.25 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

nixvihari/MLTTAssignment

This is a repository consisting of my Machine Learning Tools and Techniques Assignment

Language: Jupyter Notebook - Size: 1.96 MB - Last synced at: 4 months ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

dalitri/Clustering-Top-Spotify-Songs

Clustering Top Spotify Songs (Based on Rock Genre)

Language: Jupyter Notebook - Size: 2.9 MB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 0 - Forks: 0

Abdul-AA/Kickstarters

Predictive Modeling and Clustering Insights for Kickstarter Success

Language: Jupyter Notebook - Size: 5.35 MB - Last synced at: 18 days ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

SamDewriter/UserAnalytics-Telecommunication

User Analytics in the Telecommunication industry. The focus of this project is to explore a given set of Telecommunication data to analyze users experience, engagement and satisfaction.

Language: Jupyter Notebook - Size: 24.7 MB - Last synced at: 4 months ago - Pushed at: over 3 years ago - Stars: 2 - Forks: 0

ROCeey/R-project-on-household-census

A personal project to explore Quarto functionality in R. I love💖

Language: HTML - Size: 496 KB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 0 - Forks: 0

varungopithallapelly/Client-Management-Projects

Market and Insights Analyst at the consulting services department of a multinational professional services firm. As part of this role, you are asked to work across the following three (3) different client engagement projects.

Size: 686 KB - Last synced at: almost 2 years ago - Pushed at: about 3 years ago - Stars: 0 - Forks: 0

TasneemKapadia/Machine-Learning-Projects

Machine Learning Projects

Language: Jupyter Notebook - Size: 1.87 MB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 0 - Forks: 0

rezacsedu/Deep-Learning-for-Clustering-in-Bioinformatics

Deep Learning-based Clustering Approaches for Bioinformatics

Language: Jupyter Notebook - Size: 5.42 MB - Last synced at: almost 2 years ago - Pushed at: over 4 years ago - Stars: 117 - Forks: 31

sharmaroshan/MNIST-Using-K-means

It is One of the Easiest Problems in Data Science to Detect the MNIST Numbers, Using a Classification Algorithm, Here I have used a csv File which contains the Pixels of the Numbers from 0 to 9 and we have to Classify the Numbers Accordingly. I have Used K-Means Classification Algorithm.

Language: HTML - Size: 227 KB - Last synced at: almost 2 years ago - Pushed at: about 6 years ago - Stars: 10 - Forks: 7

ShreyaPatil1199/CLUSTERING_COUNTRIES

Dedicated to HELP International's 🌍 mission of alleviating poverty, this GitHub repository offers a Python-based project. It clusters countries by vital socio-economic factors, ensuring efficient resource allocation. Commencing with data inspection and cleaning, it identifies countries needing aid through a comprehensive analysis.

Language: Jupyter Notebook - Size: 7.99 MB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 0 - Forks: 0

olivierzach/random-neighbors

Random Neighbors: Random Forest style clustering for high-dimensional data

Language: Python - Size: 1.32 MB - Last synced at: almost 2 years ago - Pushed at: almost 5 years ago - Stars: 3 - Forks: 1

ArtemKovera/clust

a few different clustering algorithms with python libraries for data science

Language: Jupyter Notebook - Size: 108 KB - Last synced at: almost 2 years ago - Pushed at: over 4 years ago - Stars: 4 - Forks: 4

sidpatondikar/Capstone-Netflix-Movies-and-TV-Shows-Clustering

This project focused on analyzing a dataset of movies and TV shows from Netflix, with the main objective of text-based clustering to group similar content together. 📺📊

Language: Jupyter Notebook - Size: 147 MB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 1 - Forks: 0

barbarametzler/clusteringsatelliteimages

code for PhD thesis

Language: Python - Size: 15.6 KB - Last synced at: over 1 year ago - Pushed at: about 2 years ago - Stars: 1 - Forks: 0

mbsuraj/Clustering_Neighborhoods_Using_FoursquareAPI

How to use clustering to understand a city and make suggestions on where to go for food / coffee / amusement etc.

Language: Jupyter Notebook - Size: 2.01 MB - Last synced at: about 2 years ago - Pushed at: almost 4 years ago - Stars: 0 - Forks: 0

EtzionR/generate-Convex-Hull-SHP-from-HDBSCAN-clustering-probabilities

Defines a boundary around cluster centers in a given point-layer shapefile.

Language: Python - Size: 7.5 MB - Last synced at: over 1 year ago - Pushed at: about 4 years ago - Stars: 4 - Forks: 3

Janice-Afi/Market-Segmentation

This is a Clustering analysis on mall customers

Language: Jupyter Notebook - Size: 467 KB - Last synced at: about 2 years ago - Pushed at: almost 3 years ago - Stars: 2 - Forks: 0

alasdairgm/boardgame_analysis

An analysis project on the data from my boardgame collection

Language: HTML - Size: 6.11 MB - Last synced at: about 2 years ago - Pushed at: about 2 years ago - Stars: 0 - Forks: 0

SeanFlannery/NAR-Data-Discovery

Nucleic Acids Research Data Discovery

Language: Jupyter Notebook - Size: 10.8 MB - Last synced at: about 2 years ago - Pushed at: almost 6 years ago - Stars: 0 - Forks: 0

LashaGoch/Clustering-with-K-means-in-R

This project was done together with my colleague Noam Shmuel during the class of "Unsupervised Learning" @ University of Warsaw taught by professor Jacek Lewkowicz, PhD

Language: R - Size: 367 KB - Last synced at: about 2 years ago - Pushed at: about 5 years ago - Stars: 1 - Forks: 0

yongyx/Data-Analysis-on-Eating-Habits

EDA on Eating Habits dataset to uncover variables that determine obesity levels.

Language: Jupyter Notebook - Size: 1.31 MB - Last synced at: about 2 years ago - Pushed at: over 4 years ago - Stars: 0 - Forks: 0

luisosorio3214/Machine-Learning-Projects-Python-

A composition of Machine Learning Projects in python using algorithms in supervised, unsupervised, and deep learning.

Language: Jupyter Notebook - Size: 25.3 MB - Last synced at: over 1 year ago - Pushed at: almost 2 years ago - Stars: 1 - Forks: 0

n-shenoy/Clustering-and-Segmenting-Neighborhoods-in-Toronto

A repository for Coursera's Applied Data Science Capstone Project

Language: Jupyter Notebook - Size: 1.03 MB - Last synced at: about 2 years ago - Pushed at: about 3 years ago - Stars: 0 - Forks: 0

micheleandreucci/Distributed-Data-Analysis-and-Mining-Project

Language: Jupyter Notebook - Size: 10.2 MB - Last synced at: about 2 years ago - Pushed at: almost 3 years ago - Stars: 0 - Forks: 0

PabloJRW/super_store_projects

Clustering - Cohort Analysis - Retention Analysis

Language: Python - Size: 250 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

paocarvajal1912/Crypto_Clustering

Uses K-Means unsupervised machine learning algorithm and Principal Component Analysis to cluster cryptocurrencies based on performance in selected periods.

Language: Jupyter Notebook - Size: 5.16 MB - Last synced at: 11 months ago - Pushed at: almost 3 years ago - Stars: 3 - Forks: 0

DRLib/CDR

Implementation of CDR - Interactive Visual Cluster Analysis by Contrastive Dimensionality Reduction

Language: JavaScript - Size: 13.6 MB - Last synced at: over 2 years ago - Pushed at: over 2 years ago - Stars: 13 - Forks: 2

rohanmohapatra/hdbscan-cpp

Fast and Efficient Implementation of HDBSCAN in C++ using STL

Language: C++ - Size: 7.55 MB - Last synced at: over 2 years ago - Pushed at: about 6 years ago - Stars: 35 - Forks: 8

francaisse/DMML-CW1

K-Means Clustering & Dimensionality Reduction and Market Basket Analysis - Project Submission for Data Mining & Machine Learning Module

Language: Python - Size: 6.84 KB - Last synced at: over 2 years ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

timothynn/Palmer-Penguins-Clustering

EDA, Clustering for Penguins dataset

Language: Jupyter Notebook - Size: 4.15 MB - Last synced at: over 2 years ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

nhthaonguyen/Customer-Clustering-KPrototypes

Use case of K-prototypes algorithm for Customer Clustering.

Language: Jupyter Notebook - Size: 1.45 MB - Last synced at: over 2 years ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

nurselinkaya/Unsupervised-Learning

Machine learning projects for utilizing unsupervised learning techniques

Language: Jupyter Notebook - Size: 142 KB - Last synced at: over 2 years ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

Related Keywords
clustering-analysis 134 clustering 36 machine-learning 26 clustering-algorithm 24 python 21 hierarchical-clustering 11 data-science 11 k-means-clustering 10 unsupervised-learning 10 data-analysis 10 clustering-evaluation 10 kmeans-clustering 10 unsupervised-machine-learning 9 clustering-methods 8 machine-learning-algorithms 8 data-visualization 7 python3 7 pca-analysis 6 r 6 cluster-analysis 5 kmeans 5 marketing 5 data-mining 5 dbscan 5 dimensionality-reduction 5 classification-algorithm 5 logistic-regression 5 classification 5 customer-segmentation 4 pandas 4 deep-learning 4 ml 4 linear-regression 4 k-means 4 data-analytics 4 exploratory-data-analysis 4 segmentation 4 regression-models 4 unsupervised-clustering 3 marketing-analytics 3 kmeans-clustering-algorithm 3 sklearn 3 clustering-validation 3 jupyter 3 machinelearning 3 nlp 3 hypothesis-testing 3 jupyter-notebook 3 scikit-learn 3 rfm-analysis 3 hdbscan 3 statistics 2 cybersecurity 2 k-prototypes 2 gaussian-mixture-models 2 seaborn 2 number-of-clusters 2 recommender-system 2 data-warehouse 2 data-engineering-pipeline 2 data-engineer 2 t-sne 2 agglomerative-clustering 2 classification-model 2 numpy 2 elbow-method 2 silhouette-score 2 anomaly-detection 2 business-intelligence 2 outlier-detection 2 recommendation-system 2 cluster 2 powerbi 2 feature-engineering 2 dplyr 2 k-means-implementation-in-python 2 r-programming 2 supervised-learning 2 correlation-analysis 2 time-series-analysis 2 clustering-models 2 data-cleaning 2 eda 2 neural-networks 2 supervised-machine-learning 2 association-rule-mining 2 corrplot 2 data-cleaning-and-preprocessing 2 validation 2 data-generation 2 tableau 2 exploratory-analysis 2 modeling 2 pipeline 2 plotly 2 pca 2 randomforest 2 clusters 2 shiny 2 statistical-analysis 2