An open API service providing repository metadata for many open source software ecosystems.

GitHub topics: hierarchical-clustering

ayushi-p/webscrapping-indeed

This project automates the extraction of job postings from Indeed using web scraping techniques. It gathers structured job data, including job titles, company names, locations, salaries, and job descriptions, to provide insights into hiring trends, salary benchmarks, and skill demand across industries.

Language: Jupyter Notebook - Size: 2.94 MB - Last synced at: 8 months ago - Pushed at: 8 months ago - Stars: 0 - Forks: 0

geoav74/Data_Scientist_Salaries_in_EUR_2025

Language: Jupyter Notebook - Size: 0 Bytes - Last synced at: 8 months ago - Pushed at: 8 months ago - Stars: 0 - Forks: 0

AniK4111/Netflix_Movies_And_TV_Shows_Clustering

Unsupervised Machine Learning project for Netflix Movies and TV Shows Clustering. The main goal of this project is to create a content-based recommender system that recommends top 10 shows to users based on their viewing history.

Size: 2.58 MB - Last synced at: 8 months ago - Pushed at: 8 months ago - Stars: 0 - Forks: 0

pamudu123/seeds_clustering

Machine Learning Clustering

Language: Jupyter Notebook - Size: 1.9 MB - Last synced at: 8 months ago - Pushed at: 8 months ago - Stars: 0 - Forks: 0

AdityaSreevatsaK/DS-ML-Playground

A collection of data science and machine learning projects showcasing complete workflows, from data cleaning and preprocessing to model building and evaluation. Dive into diverse datasets, explore a range of techniques, and experiment with models in this comprehensive playground for learning and innovation.

Language: Jupyter Notebook - Size: 40.2 MB - Last synced at: 8 months ago - Pushed at: 8 months ago - Stars: 0 - Forks: 0

beginnerSC/pyminimax

Python implementation of minimax-linkage hierarchical clustering

Language: Python - Size: 444 KB - Last synced at: 2 months ago - Pushed at: over 2 years ago - Stars: 1 - Forks: 0

taneishi/LeukemiaClustering

Hierarchical Clustering of Leukemia Gene Expression Dataset

Language: Python - Size: 106 KB - Last synced at: 9 months ago - Pushed at: 9 months ago - Stars: 1 - Forks: 0

pngo1997/Chicago-Airbnb-Hybrid-Recommender-System

Develops a hybrid recommender system for Chicago Airbnb listings using data from Inside Airbnb.

Language: Jupyter Notebook - Size: 30.9 MB - Last synced at: 5 months ago - Pushed at: 9 months ago - Stars: 2 - Forks: 0

gabrielramirezv/protein_clustering

The ABC transporter family is a group of proteins that are involved in the transport of various molecules across the cell membrane. We aim to to cluster the proteins of the ABC transporter family into groups based on their sequence similarity.

Language: R - Size: 3.19 MB - Last synced at: 8 months ago - Pushed at: 9 months ago - Stars: 0 - Forks: 1

Tolumie/Stock-Market-Clustering-and-Predictive-Analysis_-Uncovering-Investment-Insights

Stock Market Clustering & Predictive Analysis | Leverage PCA & DBSCAN, K-Means, Hierarchical Clustering to uncover investment insights. Identify market segments, high-risk outliers (NVDA, TSLA, NFLX), and portfolio optimization strategies using S&P 500 data.

Language: Jupyter Notebook - Size: 236 KB - Last synced at: 8 months ago - Pushed at: 9 months ago - Stars: 0 - Forks: 0

aye-nyeinSan/NLP_Workshop5

The national Anthems of our world with K-Mean and HAC

Language: Jupyter Notebook - Size: 220 KB - Last synced at: 28 days ago - Pushed at: 9 months ago - Stars: 0 - Forks: 0

bgr8/Makine-ogrenmesi

Machine Learning with Python

Language: Jupyter Notebook - Size: 3.03 MB - Last synced at: 3 months ago - Pushed at: about 3 years ago - Stars: 4 - Forks: 1

firatolcum/Machine_Learning_Course

This repository contains the Machine Learning lessons I took from the Clarusway Bootcamp between 10 Aug - 14 Sep 2022 and includes 17 sessions, 5 labs, 4 case studies, 5 weekly agendas, and 3 projects.

Language: Jupyter Notebook - Size: 237 MB - Last synced at: 9 months ago - Pushed at: 9 months ago - Stars: 7 - Forks: 1

drod-96/efficient_clustering

Efficient clustering approch to identify optimal heat consumers clusters within the DHNs

Language: Python - Size: 11.7 MB - Last synced at: 9 months ago - Pushed at: 9 months ago - Stars: 0 - Forks: 0

shubhamjha97/hierarchical-clustering

A Python implementation of divisive and hierarchical clustering algorithms. The algorithms were tested on the Human Gene DNA Sequence dataset and dendrograms were plotted.

Language: Python - Size: 2.28 MB - Last synced at: 8 months ago - Pushed at: almost 5 years ago - Stars: 83 - Forks: 31

data-liangai/hierarchical-clustering

This is a hierarchical clustering project of viral protein sequences. For details, please refer to https://doi.org/10.46793/match.90-2.381A

Language: R - Size: 344 KB - Last synced at: 9 months ago - Pushed at: 9 months ago - Stars: 1 - Forks: 0

Vidhi1290/Malware-Detection

Welcome to the Malicious Executable Detection project! This repository explores the world of machine learning and clustering analysis to detect malicious executable files 🔥🔐

Language: Jupyter Notebook - Size: 12.7 KB - Last synced at: 8 months ago - Pushed at: about 2 years ago - Stars: 3 - Forks: 0

AiCorsair/Python-Case-Study-365-Data-Science-Customer-Segmentation-in-Marketing

This repository contains a detailed case study on the segmentation of 365 Data Science customers using real-world data from an onboarding survey.

Language: Jupyter Notebook - Size: 425 KB - Last synced at: 8 months ago - Pushed at: 9 months ago - Stars: 0 - Forks: 0

fazeelibtesam/Brain_Tumor

Classification of Brain Tumor

Language: Jupyter Notebook - Size: 577 KB - Last synced at: 9 months ago - Pushed at: 9 months ago - Stars: 0 - Forks: 0

dpruthardt/UnsupervisedLearning

Language: HTML - Size: 3.46 MB - Last synced at: 10 months ago - Pushed at: 10 months ago - Stars: 0 - Forks: 0

iesl/xcluster

Algorithms and evaluation tools for extreme clustering

Language: Scala - Size: 8.95 MB - Last synced at: 7 months ago - Pushed at: over 6 years ago - Stars: 72 - Forks: 21

raj1603chdry/CSE3020-Web-Mining-Labs

Repository containing all the codes created for the lab sessions of CSE3020 Web Mining at VIT University Chennai Campus

Language: Python - Size: 13.8 MB - Last synced at: 5 months ago - Pushed at: almost 7 years ago - Stars: 24 - Forks: 19

subhendughosh91/HELP-Humanitarian-NGO-Data-K-Means-and-Hierarchical-Clustering

Clustering countries from an NGO Data to get Top 10 countries who are in dire need of Aid based on their Socio Economic Condition

Language: Jupyter Notebook - Size: 517 KB - Last synced at: 3 months ago - Pushed at: about 5 years ago - Stars: 1 - Forks: 0

wei9935/wasserstein_HRP

This trial explores enhancing HRP portfolio optimization by using sliced-Wasserstein distance for clustering. Results are preliminary and may not outperform the original method, but further research could improve outcomes.

Language: Python - Size: 1.34 MB - Last synced at: 10 months ago - Pushed at: 10 months ago - Stars: 0 - Forks: 0

QuantLet/HClustering Fork of lizzzi111/HClustering

Language: Jupyter Notebook - Size: 30.3 MB - Last synced at: 10 months ago - Pushed at: 10 months ago - Stars: 0 - Forks: 3

cmdevries/LMW-tree

Learning M-Way Tree - Web Scale Clustering - EM-tree, K-tree, k-means, TSVQ, repeated k-means, bitwise clustering

Language: C++ - Size: 74.5 MB - Last synced at: 9 months ago - Pushed at: almost 4 years ago - Stars: 74 - Forks: 20

AbbasPak/Clustering-Methods-Examples

This repository aims to provide an overview of various clustering methods, along with practical examples and implementations.

Language: Jupyter Notebook - Size: 1.59 MB - Last synced at: 10 months ago - Pushed at: 10 months ago - Stars: 5 - Forks: 1

wolny/phash-hierarchical-clustering

Hierarchical clustering of images using phash and Hamming distance

Language: Scala - Size: 1.37 MB - Last synced at: about 2 months ago - Pushed at: over 8 years ago - Stars: 8 - Forks: 3

OCHOLA-EDDYPHIL/Clustering

This project performs hierarchical clustering on a dataset containing network usage and performance metrics. It includes data preprocessing, encoding, normalization, and visualization of clustering results using dendrograms. The purpose is to analyze and group similar data points, offering insights into patterns and relationships within the dataset

Language: Python - Size: 19.5 KB - Last synced at: 11 months ago - Pushed at: 11 months ago - Stars: 0 - Forks: 0

weihan-zhao/knotweed_fieldwork

The documents under this repository are the codes for analysis of knotweed population performance across ranges and environmental drivers of trait variation

Language: R - Size: 82 KB - Last synced at: 11 months ago - Pushed at: 11 months ago - Stars: 0 - Forks: 0

jonghough/jlearn

Machine Learning Library, written in J

Language: J - Size: 9.53 MB - Last synced at: 9 months ago - Pushed at: almost 2 years ago - Stars: 58 - Forks: 13

Armin-Abdollahi/Machine-Learning

Language: Jupyter Notebook - Size: 1.7 MB - Last synced at: 11 months ago - Pushed at: 11 months ago - Stars: 4 - Forks: 0

mariamffatima/Machine-Learning-Tasks

This repository contains a collection of lab tasks, assignments, and projects designed to learn and practice key concepts in Machine Learning. It includes hands-on Jupyter notebooks covering fundamental ML techniques, real-world projects, and theoretical exercises. Ideal for students and enthusiasts aiming to deepen their understanding of ML.

Language: Jupyter Notebook - Size: 19.2 MB - Last synced at: 11 months ago - Pushed at: 11 months ago - Stars: 0 - Forks: 0

teja-1403/Coursera-Machine-Learning-with-Python-Honors

This project involves building a classifier to predict rainfall for the next day based on weather data from the Australian Government's Bureau of Meteorology. Various machine learning techniques such as Linear Regression, KNN, Decision Trees, Logistic Regression, and SVM were implemented and evaluated.

Language: Jupyter Notebook - Size: 24.4 KB - Last synced at: 7 days ago - Pushed at: 11 months ago - Stars: 0 - Forks: 0

theo-liang/Python-Project-Analysis-for-ClimateWins

This project analyzes historical weather data to identify patterns and predict future weather conditions, focusing on extreme events and temperature trends across Europe.

Language: Jupyter Notebook - Size: 76.1 MB - Last synced at: 8 months ago - Pushed at: 11 months ago - Stars: 0 - Forks: 0

mrunmaim16/CSE-5334-Programming-Assignments

Programming assignments completed for course CSE - 5334 Data Mining under Professor Dr. Marnim Galib.

Language: Jupyter Notebook - Size: 460 KB - Last synced at: 11 months ago - Pushed at: 11 months ago - Stars: 0 - Forks: 0

iesl/expLinkage

Supervised hierarchical clustering

Language: Python - Size: 102 KB - Last synced at: 7 months ago - Pushed at: over 5 years ago - Stars: 10 - Forks: 6

felipeversiane/face-cluster

application that receives a dataset of faces and creates a cluster of images that have similarity.

Language: Python - Size: 555 KB - Last synced at: 5 months ago - Pushed at: 12 months ago - Stars: 0 - Forks: 0

subhashpolisetti/Clustering-Techniques-and-Embeddings

This repository includes Colab notebooks demonstrating various clustering algorithms, from scratch-based methods to advanced deep learning models and embeddings. Each notebook features explanations, visualizations, and quality evaluation metrics for clustering performance.

Language: Jupyter Notebook - Size: 14.4 MB - Last synced at: about 1 month ago - Pushed at: 11 months ago - Stars: 0 - Forks: 0

palVikram/Machine-Learning-using-Python

Regression, Classification and Clustering

Language: Python - Size: 223 MB - Last synced at: 12 months ago - Pushed at: 12 months ago - Stars: 19 - Forks: 19

Gemois/HCvision

This AutoML web application, built with Spring Boot, Python, and Angular, performs Agglomerative Hierarchical Clustering on user-uploaded datasets. It automatically determines the optimal number of clusters, proximity method and provides clustering results.

Language: Java - Size: 1.88 MB - Last synced at: 12 months ago - Pushed at: 12 months ago - Stars: 1 - Forks: 0

Aptacode/PathFinder

An optimized C# HPA* Pathfinder

Language: C# - Size: 24.7 MB - Last synced at: 2 months ago - Pushed at: over 3 years ago - Stars: 15 - Forks: 0

GiatrasKon/Hyperspectral-Image-Clustering

Analysis of the Salinas hyperspectral image dataset using advanced clustering algorithms, focusing on identifying homogeneous regions in the image. Implementations of cost-function optimization and hierarchical clustering techniques, along with evaluations and visualizations in reduced-dimensional spaces.

Language: MATLAB - Size: 9.22 MB - Last synced at: 8 months ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

MoinDalvs/Assignment_East-West_Airlines

Problem Statement Perform clustering (Hierarchical,K means clustering and DBSCAN) for the airlines data to obtain optimum number of clusters

Language: Jupyter Notebook - Size: 3.47 MB - Last synced at: 7 months ago - Pushed at: over 3 years ago - Stars: 5 - Forks: 3

Priyanshu501/CausalGeneAnalysis

This repository contains analysis and exploration of causal and non-causal relationships between genes and phenotypes using embeddings generated from GPT-3.5. The project applies vector analysis, dimensionality reduction, and clustering techniques (K-Means, Hierarchical, and DBSCAN) to uncover potential patterns and insights into causality.

Language: Jupyter Notebook - Size: 253 KB - Last synced at: 8 months ago - Pushed at: 12 months ago - Stars: 1 - Forks: 0

KasrAskari/Cars-Models

Clustering of Cars Models

Language: Jupyter Notebook - Size: 1 MB - Last synced at: 9 months ago - Pushed at: 12 months ago - Stars: 0 - Forks: 0

raysas/ML-from-scratch

a machine learning python library implementation from scratch

Language: Jupyter Notebook - Size: 670 KB - Last synced at: 12 months ago - Pushed at: 12 months ago - Stars: 0 - Forks: 0

RobCyberLab/Image-Pixel-Clustering

🌀Image Pixel Clustering📏

Language: Python - Size: 2.52 MB - Last synced at: 8 months ago - Pushed at: about 1 year ago - Stars: 1 - Forks: 0

akhileshthite/clustering

Mall Customers

Language: Jupyter Notebook - Size: 517 KB - Last synced at: 8 months ago - Pushed at: over 4 years ago - Stars: 1 - Forks: 0

prneidhardt/Unsupervised-Learning

Trade & Ahead Project

Language: Jupyter Notebook - Size: 4.22 MB - Last synced at: 7 months ago - Pushed at: about 1 year ago - Stars: 3 - Forks: 0

anna-kay/Bank-Marketing-Dataset-classification-clustering

Bank Marketing Dataset (BMD), Machine Learning Repository (UCI), classification, clustering, R

Language: R - Size: 9.77 KB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

juan-gamero-salinas/climateready-survey-pamplona

This repository gives you access to the CLIMATEREADY survey dataset containing thermal comfort votes during the 2021 and 2022 heatwave periods in Pamplona, Spain, as well as other relevant parameters self-reported by surveyees (e.g. occupant characteristics and behaviour, key building/dwelling characteristics, sleep problems, heat-related symptoms)

Language: HTML - Size: 1.52 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 2 - Forks: 0

Aruuunn/clustering-visualizer

Clustering Visualizer is a Web Application for visualizing popular Machine Learning Clustering Algorithms (K-Means, DBSCAN, Mean Shift, etc.).

Language: TypeScript - Size: 4.18 MB - Last synced at: about 1 year ago - Pushed at: over 1 year ago - Stars: 11 - Forks: 0

SirineMaaroufi/ML_Clustering_Explorations

This repository contains a series of notebooks exploring various clustering techniques in machine learning.

Language: Jupyter Notebook - Size: 1.11 MB - Last synced at: 4 months ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

jeffreywijaya100/exercise-ml

solving case and answer question given about machine learning

Language: Jupyter Notebook - Size: 390 KB - Last synced at: 2 months ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

starkblaze01/Artificial-Intelligence-Codes

Collection of Artificial Intelligence Algorithms implemented on various problems

Language: Jupyter Notebook - Size: 5.88 MB - Last synced at: 7 months ago - Pushed at: about 5 years ago - Stars: 41 - Forks: 9

walidbosso/R_Data_mining

Extract knowledge from a data using different techniques, including Association Rules Hierarchical Agglomerative Clustering (HAC) K-means Clustering Decision Trees

Language: R - Size: 9.75 MB - Last synced at: 8 months ago - Pushed at: almost 2 years ago - Stars: 2 - Forks: 0

JaewonSon37/Fundamentals_of_Data_Science1

Language: R - Size: 780 KB - Last synced at: 8 months ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

KOLANICH-mirrors/PyChem

A mirror and a fork of PyChem

Language: Python - Size: 21.7 MB - Last synced at: 3 months ago - Pushed at: almost 3 years ago - Stars: 2 - Forks: 0

Niteshchawla/Clustering-ML

Analyzing the vast data of learners can uncover patterns in their professional backgrounds and preferences. Allowing Scaler to make tailored content recommendations and provide specialized mentorship.

Language: Jupyter Notebook - Size: 12.7 MB - Last synced at: 9 months ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

cambiotraining/stats-mva

Materials supporting the Multivariate analysis sessions

Language: Python - Size: 10.1 MB - Last synced at: 2 months ago - Pushed at: about 1 year ago - Stars: 1 - Forks: 1

shlienlab/raccoon

Multi-scale clustering in Python

Language: Python - Size: 6.14 MB - Last synced at: 12 days ago - Pushed at: over 3 years ago - Stars: 4 - Forks: 4

robertfmath/Asset-Class-Hierarchical-Clustering

Hierarchically clustering major asset classes in the investment landscape

Language: Jupyter Notebook - Size: 1.22 MB - Last synced at: about 2 months ago - Pushed at: almost 3 years ago - Stars: 1 - Forks: 0

rdebullain/ML-Wholesale_Data_Analysis

Language: Jupyter Notebook - Size: 9.34 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

GabrielMazzotta/NLP-Clustering--Movie-Similarity-from-Plot-Summaries

A Python-based movie recommendation system leveraging NLP and clustering techniques. This project includes data processing, vectorization of plot summaries, and the implementation of recommendation algorithms to suggest similar movies based on user input.

Language: Jupyter Notebook - Size: 1.21 MB - Last synced at: 8 months ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

div-lab/dendromap

Interactively and visually explore large-scale image datasets used in machine learning using treemaps. VIS 2022

Language: Jupyter Notebook - Size: 167 MB - Last synced at: 8 months ago - Pushed at: over 1 year ago - Stars: 48 - Forks: 1

singhsourav0/Machine-Learning-Algorithms

Explore a broad range of machine learning algorithms, including ML, RF, SVM, LR, NB, PCA, LogReg, DT, KMeans, SVMC, GD, HClust, DBSCAN, ICA, KNN, and more, within this repository. Gain practical insights and apply these diverse ML concepts effectively.

Language: Jupyter Notebook - Size: 4.48 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 1 - Forks: 0

fjovanovic/Twitter-AI-Sentiment-Analysis

A project leveraging classification and clustering algorithms to determine the sentiment of tweets, distinguishing between hatred and non-hatred content

Language: Jupyter Notebook - Size: 7.42 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

SirTee12/UK-Retail-Segmentation-Analysis

Consequently, the main purpose of this study is to develop a systematic implementation of customer segmentation for the business. To distinguish diverse customers, customers’ behavioral characteristics are obtained from the RFM model (Recency, Frequency, Monetary Value).

Language: Jupyter Notebook - Size: 21.4 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

Observarun/Hierarchical-path-segmentation-II

Codes for segmenting and clustering of relocation time series data to generate StaMEs and CAMs, coding of raw CAMs with StaMEs as bases, CAM rectification, and comparison of coding schemes, as performed in https://doi.org/10.1101/2024.08.02.606194.

Language: R - Size: 61.5 KB - Last synced at: 3 months ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

Matt-Alaei/credit-card-holder-clustering

Implementation of five different clustering algorithms

Language: Jupyter Notebook - Size: 722 KB - Last synced at: 2 days ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

ClimSocAna/sentiments-with-hierarchical-clustering

Code for the paper "Guiding sentiment analysis with hierarchical text clustering: Analyzing the German X/Twitter conversation on face masks in the 2020 COVID-19 pandemic"

Language: HTML - Size: 1.24 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

ranceforhiwd/heir-clustering

Hierarchical Clustering

Language: Python - Size: 112 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

cego669/DirtyCategoriesEncoding

Repository containing two classes (StringAgglomerativeEncoder and StringDistanceEncoder) useful for grouping or visualizing the distance between dirty categorical variables. They are compatible with the scikit-learn API.

Language: Jupyter Notebook - Size: 1.08 MB - Last synced at: 3 months ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

mcxraider/sph-timeline-project

SPH media prototype

Language: Python - Size: 245 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

mpfi-dsp/GoldInAndOut

🔬🥇🧠🍔 Automated Gold Particle Analysis For Freeze Fracture Replica Electron Microscopy Images

Language: Python - Size: 330 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 2 - Forks: 3

pandafengye/MIST

MIST: a metagenomic intra-species typing tool.

Language: Python - Size: 251 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 4 - Forks: 0

KUSHALKUMARD/Customer-Segmentation-Project

In this project, we will first firstly implement RFM Analysis to group customers according to RFM metrics and then the same customers will be segmented by using K-Means and Hierarchical Clustering algortihms.

Language: Python - Size: 177 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

krunal-nagda/HELP-International-Case-Study---Clustering

Using K-means and Hierarchical Clustering to categorise the countries using some socio-economic and health factors that determine the overall development of the country. Then suggesting the countries which the CEO needs to focus on the most.

Language: Jupyter Notebook - Size: 437 KB - Last synced at: over 1 year ago - Pushed at: over 4 years ago - Stars: 1 - Forks: 0

Priyanshu501/Customer_Segmentation_for_Targeted_Marketing

Analysis of customer information to identify potential segments for targeted marketing campaigns using various clustering techniques.

Language: Jupyter Notebook - Size: 2.44 MB - Last synced at: 2 months ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

JessicaaaJe/User-Behavior-Analysis-Using-Clustering

Exploring customer loyalty and behavior using clustering techniques on a churn-risk dataset,with insights into demographics, engagement, churn risk, and promotional strategy effectiveness

Language: Jupyter Notebook - Size: 956 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 1 - Forks: 0

PerretB/ultrametric-fitting

Ultrametric Fitting by Gradient Descent

Language: Jupyter Notebook - Size: 15.5 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 8 - Forks: 3

ggeop/Flag-Study

Nations Flags Classification & Clustering project. :flags:

Language: R - Size: 2.52 MB - Last synced at: 3 months ago - Pushed at: about 7 years ago - Stars: 3 - Forks: 1

mqwfrog/MHCCL

MHCCL: Masked Hierarchical Cluster-wise Contrastive Learning for Multivariate Time Series - a PyTorch Version (AAAI-2023)

Language: Python - Size: 5.06 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 30 - Forks: 10

SaniyaAbushakimova/Brewing-Insights-with-Unsupervised-Learning

Conducted a comprehensive clustering analysis to categorize beers based on features such as Astringency, Alcohol content, Bitterness, Sourness, and more. Utilized k-medoids and hierarchical agglomerative clustering algorithms to achieve this classification. Tech: Python (numpy, pandas, seaborn, matplotlib, sklearn, scipy)

Language: Jupyter Notebook - Size: 50 MB - Last synced at: 9 months ago - Pushed at: over 1 year ago - Stars: 1 - Forks: 0

riddhigupta1110/DataWarehousingAndMining-V-MU-CSE

Codes for Practical experiments of Data Warehousing and Mining (Semester V - Computer Engineering - Mumbai University)

Language: Python - Size: 24.4 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 1 - Forks: 0

pharo-ai/hierarchical-clustering

Hierarchical Clustering algorithms for Pharo

Language: Smalltalk - Size: 63.5 KB - Last synced at: 8 months ago - Pushed at: almost 2 years ago - Stars: 1 - Forks: 1

srosalino/Clustering_On_Starbucks_Beverages

Grouping of drinks according to their nutritional values, making it easier to categorize them in a future catalog, increasing organization and facilitating the search depending on individual preferences

Language: Jupyter Notebook - Size: 5.95 MB - Last synced at: 9 months ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

Bauti2020/Statistical-and-Machine-Learning-Approaches-for-Portfolio-Optimization

Six portfolio optimization strategies were considered, plus one benchmark, across 3 scenarios,. We considered methods relying both in ML and common statistical procedures; and we run an out-of-sample back-test for each strategy, for every scenario.

Language: Jupyter Notebook - Size: 9.15 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 1 - Forks: 1

vhtua/Group4_Data_analysis

Hierarchical Cluster Analysis: Movie Genres Preferences

Language: TeX - Size: 24.2 MB - Last synced at: 8 months ago - Pushed at: almost 2 years ago - Stars: 1 - Forks: 0

Keerthiga-Sekar/Segmentation-using-Clustering-and-PCA-

1. Digital Marketing Advertisement Data Segmentation using clustering techniques. 2. Identify Optimum Principal Components that explains the most variance in the Primary Census data.

Language: Jupyter Notebook - Size: 1.48 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

karthik-d/image-clef-medical-gan-2023

Scripts, figures, and working notes for the participation in ImageCLEFmedical GANs task, part of the 14th CLEF Conference, 2023.

Language: Jupyter Notebook - Size: 1.32 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 1

esoto15/text-tone-preference-analysis

This repository contains the code and analysis for a research project aimed at enhancing social media impact for food safety security organizations. The project focuses on understanding text tone preferences based on user demographics using machine learning techniques.

Language: Jupyter Notebook - Size: 25.1 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

SamKazan/fraud-detection-ml

Machine learning models for enhanced fraud detection in e-commerce transactions, exploring feature engineering, distance prediction, and clustering analysis.

Language: Jupyter Notebook - Size: 11.4 MB - Last synced at: 7 months ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

fairuznajla/Eco-Taxi-Analytics

This team project is consists recommendation deck and monitoring dashboard for an Eco-Friendly Online Taxi Company by using K-Means and Hierarchical Clustering of CO2 emissions levels from various car brands.

Language: Jupyter Notebook - Size: 4.33 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

hugohiraoka/Credit_Card_Customer_Segmentation

Classification Model of Potential Credit Card Customers

Language: HTML - Size: 12.7 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

kumaranjalij/Flora-Genie

Flora Genie is a personalized plant recommendation system designed to help amateur gardeners select the most suitable plants for their homes or gardens.

Language: Jupyter Notebook - Size: 227 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

shreyansh-2003/Hands-On-With-Machine-Learning-Algorithms

This repository contains a collection of labs that explore various machine learning algorithms and techniques. Each lab focuses on a specific topic and provides detailed explanations, code examples, and analysis. The labs cover clustering, classification and regression algos, hyperparameter tuning, data-preprocessing and various evaluation metrics.

Language: Jupyter Notebook - Size: 14.6 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 1 - Forks: 0

tboudart/Tanzanian-Water-Pumps-Clustering-and-Classification

For this group project, I performed cluster analysis and classification using Python to predict one of three classes for water pumps; functional, functional but needs repair, and non-functions. I used clustering to find hidden data structures to exploit for fitting individual classification techniques with better results than using the entire dataset. Unfortunately, k-means clustering, DBSCAN, hierarchical clustering, nor OPTICS produced well-defined clusters. The entire dataset was therefore used for fitting classification algorithms. The two classification techniques I was responsible for were k-nearest neighbors and stacked generalization ensemble. For the latter, I combined the best models each group member developed. All the models had a hard time predicting the functional but need repair class. My best model was only able to achieve an accuracy of 76%.

Language: Jupyter Notebook - Size: 7.61 MB - Last synced at: about 1 year ago - Pushed at: over 4 years ago - Stars: 1 - Forks: 0

sdq/deepvis

machine learning algorithms in Swift

Language: Swift - Size: 1.34 MB - Last synced at: 3 months ago - Pushed at: about 8 years ago - Stars: 56 - Forks: 11

Related Keywords
hierarchical-clustering 650 clustering 214 kmeans-clustering 166 machine-learning 151 python 130 k-means-clustering 117 unsupervised-learning 74 data-science 73 pca 67 clustering-algorithm 60 dbscan-clustering 60 logistic-regression 52 r 42 linear-regression 42 agglomerative-clustering 42 k-means 42 data-visualization 37 dendrogram 36 dbscan 36 unsupervised-machine-learning 36 kmeans 34 decision-trees 34 machine-learning-algorithms 33 random-forest 32 principal-component-analysis 30 pca-analysis 29 data-analysis 27 scikit-learn 27 pandas 27 visualization 27 python3 26 classification 26 gaussian-mixture-models 26 data-mining 24 exploratory-data-analysis 24 jupyter-notebook 23 knn-classification 23 knn 22 silhouette-score 22 numpy 20 matplotlib 19 polynomial-regression 18 dimensionality-reduction 18 sklearn 18 scipy 17 naive-bayes-classifier 17 customer-segmentation 17 svm 17 deep-learning 16 support-vector-machine 16 supervised-learning 16 eda 16 decision-tree-classifier 16 elbow-method 15 dendogram 14 cluster-analysis 14 k-nearest-neighbours 14 neural-network 13 support-vector-machines 12 seaborn 12 regression 12 spectral-clustering 12 clustering-analysis 12 time-series 11 rfm-analysis 11 clustering-methods 11 naive-bayes 11 apriori-algorithm 10 nlp 10 density-based-clustering 9 t-sne 9 random-forest-classifier 9 multiple-linear-regression 9 svm-classifier 9 pytorch 8 k-nearest-neighbors 8 kmeans-algorithm 8 kmeans-clustering-algorithm 8 cluster 8 elbow-plot 8 natural-language-processing 8 statistics 8 gradient-descent 7 data-preprocessing 7 gmm 7 feature-engineering 7 neural-networks 7 data-mining-algorithms 7 autoencoder 7 lda 7 segmentation 7 hdbscan 7 dbscan-clustering-algorithm 7 community-detection 7 umap 6 preprocessing 6 ward-linkage 6 knn-classifier 6 artificial-neural-networks 6 cluster-profiling 6