GitHub topics: hierarchical-clustering
DrStef/Machine_Learning_with_Python-IBM
Language: Jupyter Notebook - Size: 6.64 MB - Last synced at: 3 days ago - Pushed at: 3 days ago - Stars: 1 - Forks: 0

AkshaySyal/Hierarchical-Clustering
This project implements hierarchical clustering using a disjoint set data structure to iteratively merge the closest points (single linkage) on the moons dataset, then visualizes the resulting clusters when cut at K=2, K=5, and K=10
Language: Jupyter Notebook - Size: 390 KB - Last synced at: 3 days ago - Pushed at: 3 days ago - Stars: 0 - Forks: 0

md-experiments/picture_text
Interactive tree-maps with SBERT & Hierarchical Clustering (HAC)
Language: Python - Size: 39.2 MB - Last synced at: 2 days ago - Pushed at: 6 months ago - Stars: 30 - Forks: 9

skfolio/skfolio
Python library for portfolio optimization built on top of scikit-learn
Language: Python - Size: 122 MB - Last synced at: 4 days ago - Pushed at: 10 days ago - Stars: 1,559 - Forks: 139

gagolews/genieclust
Genie: Fast and Robust Hierarchical Clustering with Noise Point Detection - in Python and R
Language: C++ - Size: 94.8 MB - Last synced at: 4 days ago - Pushed at: 4 days ago - Stars: 63 - Forks: 12

rustic-ml/FormicaX
FormicaX: Rust library with clustering algorithms like K-Means, DBSCAN, and GMM, FormicaX delivers efficient, adaptable insights for trading applications. Inspired by the collaborative and resilient nature of ants (Formica), it offers a modular, high-performance framework for developers and data scientists.
Size: 102 KB - Last synced at: 5 days ago - Pushed at: 5 days ago - Stars: 0 - Forks: 0

panagiotisanagnostou/HiPart
Hierarchical divisive clustering algorithm execution, visualization and Interactive visualization.
Language: Python - Size: 151 MB - Last synced at: 6 days ago - Pushed at: 12 days ago - Stars: 51 - Forks: 8

Shuyib/Phylogenetic-tree-study
Estimating Phylogenetic trees using six microorganisms 16S rRNA gene with Unsupervised Learning, web based tools and Molecular Evolutionary Genetics Analysis MEGA7
Language: Jupyter Notebook - Size: 5.06 MB - Last synced at: 9 days ago - Pushed at: 9 days ago - Stars: 4 - Forks: 5

adiag321/Bank-Customer-Profiling-and-Segmentation
In this project, we will leverage AI/ML to launch a targeted marketing ad campaign that is tailored to a specific group of customers.
Language: Jupyter Notebook - Size: 26.2 MB - Last synced at: 11 days ago - Pushed at: 11 days ago - Stars: 1 - Forks: 1

hyuncat/RichCluster
Customizable C++ algorithm for clustering biological terms by gene similarity, compiled into an R package with supporting visualizations and for easy use.
Language: R - Size: 10.1 MB - Last synced at: 12 days ago - Pushed at: 12 days ago - Stars: 3 - Forks: 1

ivan-pi/fortran-flann
Fortran bindings to the FLANN library for performing fast approximate nearest neighbor searches in high dimensional spaces.
Language: Fortran - Size: 1.21 MB - Last synced at: 1 day ago - Pushed at: 14 days ago - Stars: 14 - Forks: 1

nurulashraf/hierarchical-clustering-customer-segmentation
A customer segmentation project using hierarchical clustering to group customers based on their spending behaviour and demographics. This helps businesses identify patterns and create targeted marketing strategies.
Language: Jupyter Notebook - Size: 5.58 MB - Last synced at: 20 days ago - Pushed at: 20 days ago - Stars: 0 - Forks: 0

hyunsooseol/snowCluster
This module allows users to analyze k-means & hierarchical clustering, and visualize results of Principal Component, Correspondence Analysis, Discriminant analysis, Decision tree, Multidimensional scaling, Multiple Factor Analysis, Machine learning, and Prophet analysis.
Language: R - Size: 438 MB - Last synced at: 22 days ago - Pushed at: 22 days ago - Stars: 8 - Forks: 2

volfpeter/localclustering
Python 3 implementation and documentation of the Hermina-Janos local graph clustering algorithm.
Language: Python - Size: 2.48 MB - Last synced at: 6 days ago - Pushed at: over 2 years ago - Stars: 23 - Forks: 1

pedrodbs/Aglomera
A hierarchical agglomerative clustering (HAC) library written in C#
Language: C# - Size: 2.62 MB - Last synced at: 13 days ago - Pushed at: over 2 years ago - Stars: 52 - Forks: 17

Ali-Tharwat/Data-Science-Tasks
Comprehensive data preparation and exploration processes integrated with machine learning models for classification and clustering
Language: Jupyter Notebook - Size: 1.32 MB - Last synced at: 26 days ago - Pushed at: 26 days ago - Stars: 0 - Forks: 0

DolapoSalim/hierarchical-clustering-and-dendrogram
This project demonstrates how to generate synthetic (marine ecological) data and apply unsupervised machine learning (hierarchical clustering) to explore patterns in policy coverage across marine zones.
Language: Jupyter Notebook - Size: 0 Bytes - Last synced at: 29 days ago - Pushed at: 29 days ago - Stars: 0 - Forks: 0

afairless/binary_classification_shap
Run histogram-based gradient boosted trees binary classifier on generated data and interpret results with standard metrics, SHAP, and supervised clustering
Language: Python - Size: 22.5 KB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 0 - Forks: 0

EvanGks/hierarchical-clustering-mall-customers
A comprehensive machine learning project demonstrating hierarchical clustering for customer segmentation on the Mall Customers dataset. Includes EDA, preprocessing, multiple linkage/distance comparisons, and professional visualizations.
Language: Jupyter Notebook - Size: 198 KB - Last synced at: 21 days ago - Pushed at: about 1 month ago - Stars: 1 - Forks: 0

holgerteichgraeber/TimeSeriesClustering.jl
Julia implementation of unsupervised learning methods for time series datasets. It provides functionality for clustering and aggregating, detecting motifs, and quantifying similarity between time series datasets.
Language: Julia - Size: 171 MB - Last synced at: 6 days ago - Pushed at: over 4 years ago - Stars: 82 - Forks: 23

JuliaStats/Clustering.jl
A Julia package for data clustering
Language: Julia - Size: 7.04 MB - Last synced at: 5 days ago - Pushed at: 6 months ago - Stars: 367 - Forks: 124

susanli2016/Machine-Learning-with-Python
Python code for common Machine Learning Algorithms
Language: Jupyter Notebook - Size: 58.1 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 4,440 - Forks: 4,819

cjunwon/ODAQ-SDA
Applying Categorical Exploratory Data Analysis (CEDA) methods to study audio quality perception
Language: Python - Size: 950 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 0 - Forks: 0

RohanFredriksson/agglomerative-clustering
🎨 Fast hierarchical agglomerative clustering powered by WebAssembly
Language: C++ - Size: 207 KB - Last synced at: 9 days ago - Pushed at: about 1 month ago - Stars: 2 - Forks: 0

tommygrammar/Markov-HVQ-Macro-Regime-Modeling-Pipeline
A Python toolkit for discovering and modeling macro-scale regimes in time-series data by combining Hierarchical Vector Quantisation (HVQ) with Markov chain transition modeling.
Language: Python - Size: 12.7 KB - Last synced at: 26 days ago - Pushed at: about 1 month ago - Stars: 1 - Forks: 0

tommygrammar/stochastic-classifier
A two-stage clustering tool that converts noisy, high-entropy data into deterministic encodings for studying.
Language: Python - Size: 14.6 KB - Last synced at: 26 days ago - Pushed at: about 1 month ago - Stars: 1 - Forks: 0

monty-se/PINstimation
A comprehensive bundle of utilities for the estimation of probability of informed trading models: original PIN in Easley and O'Hara (1992) and Easley et al. (1996); Multilayer PIN (MPIN) in Ersan (2016); Adjusted PIN (AdjPIN) in Duarte and Young (2009); and volume-synchronized PIN (VPIN) in Easley et al. (2011, 2012). Implementations of various estimation methods suggested in the literature are included. Additional compelling features comprise posterior probabilities, an implementation of an expectation-maximization (EM) algorithm, and PIN decomposition into layers, and into bad/good components. Versatile data simulation tools, and trade classification algorithms are among the supplementary utilities. The package provides fast, compact, and precise utilities to tackle the sophisticated, error-prone, and time-consuming estimation procedure of informed trading, and this solely using the raw trade-level data.
Language: R - Size: 5.27 MB - Last synced at: 12 days ago - Pushed at: 9 months ago - Stars: 40 - Forks: 7

BjornMelin/stardex
🌟 Stardex: Explore GitHub Stars Intelligently. Stardex is a powerful web app that lets you search, filter, and cluster any GitHub user's starred repositories. Discover hidden patterns and find your next favorite project with intelligent, AI-powered exploration.
Language: TypeScript - Size: 549 KB - Last synced at: about 1 month ago - Pushed at: 2 months ago - Stars: 5 - Forks: 0

John-sam1983/John_Ndaa_Samson_Data_Science_Portfolio
This repository is a compilation of all the data science and in particular Machine Learning projects I have successfully carried out.
Language: Jupyter Notebook - Size: 8.24 MB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 0 - Forks: 0

marcocampanario/ms-mlpa_HCA-ROC
Bioinformatics consultancy on MS-MLPA data analysis | Consultoria em Bioinformática para análise de dados de MS-MLPA
Language: R - Size: 394 KB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 0 - Forks: 0

lumi-a/exact-clustering
Find optimal clusterings and optimal hierarchical clusterings.
Language: Rust - Size: 87.9 KB - Last synced at: 28 days ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

sung-yeon-kim/HIER-CVPR23
Official PyTorch Implementation of HIER: Metric Learning Beyond Class Labels via Hierarchical Regularization, CVPR 2023
Language: Python - Size: 4.52 MB - Last synced at: about 2 months ago - Pushed at: almost 2 years ago - Stars: 59 - Forks: 6

OmarA32/House-Price-Prediction-and-Clustering-with-Deep-Learning
[WIP] House Price Prediction & Clustering. Utilizing PyTorch for neural networks, and Tkinter for the user interface.
Language: Python - Size: 932 KB - Last synced at: about 1 month ago - Pushed at: 2 months ago - Stars: 0 - Forks: 0

nadeyyah/Clustering-the-Quality-of-Senior-High-School-Education-in-Each-District-of-Indonesia-in-2023
The clustering method can help to find out which provinces need special attention in overcoming education problems
Language: R - Size: 412 KB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 0 - Forks: 0

pajaskowiak/clusterConfusion
Clustering validation with ROC Curves
Language: R - Size: 1.2 MB - Last synced at: 2 months ago - Pushed at: 5 months ago - Stars: 7 - Forks: 1

dimitris-markopoulos/latent-semantic-clustering
Clustering book chapters with unsupervised ML—custom EM-GMM, sklearn baselines, and dimensionality reduction.
Language: Jupyter Notebook - Size: 87.2 MB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 0 - Forks: 1

socnetv/app
Social Network Analysis and Visualization software application.
Language: C++ - Size: 22.9 MB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 227 - Forks: 27

creme332/rowmerge
A heuristic algorithm for merging rows efficiently.
Language: C++ - Size: 7.18 MB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 0 - Forks: 0

pingyuu/student_performance_clustering_r
PCA-based clustering of student grades to explore academic performance patterns (R)
Language: R - Size: 146 KB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

aryddntaabbss/klasterisasi-minatbaca
Proyek ini merupakan hasil dari SKRIPSI saya yang berjudul "KLASTERISASI MINAT BACA MASYARAKAT KOTA TERNATE MENGGUNAKAN ALGORITMA HIERARCHICAL CLUSTERING (STUDI KASUS DINAS PERPUSTAKAAN DAN KEARSIPAN DAERAH KOTA TERNATE)".
Language: Python - Size: 0 Bytes - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

dedupeio/hcluster 📦
Hierarchical Clustering Algorithms
Language: Python - Size: 1.75 MB - Last synced at: about 23 hours ago - Pushed at: about 3 years ago - Stars: 36 - Forks: 20

Markkreel/Binary-Static-Analysis-Through-Instruction-and-Operand-Extraction-and-AHC-Algorithm
A static binary analysis tool visualizes code blocks in the assembly of a disassembled binary file using the AHC algorithm, aided by entropy calculation and similarity measurement.
Language: Assembly - Size: 3.93 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

gcorso/NeuroSEED
Implementation of Neural Distance Embeddings for Biological Sequences (NeuroSEED) in PyTorch (NeurIPS 2021)
Language: Python - Size: 1.38 MB - Last synced at: 2 months ago - Pushed at: over 1 year ago - Stars: 73 - Forks: 18

DiogoFerrari/hdpGLM
Hierarchical Dirichlet Process Generalized Linear Models
Language: R - Size: 42.4 MB - Last synced at: 20 days ago - Pushed at: 3 months ago - Stars: 12 - Forks: 4

aungpyaeap/distfun-matlab
MATLAB functions designed to construct dissimilarity matrices using a variety of distance metric functions. It provides a comprehensive toolkit for analyzing and comparing data sets through different distance measures.
Language: MATLAB - Size: 22.5 KB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 1 - Forks: 0

koonimaru/radialtree
A python module to draw a circular dendrogram
Language: Python - Size: 1.35 MB - Last synced at: 3 months ago - Pushed at: 8 months ago - Stars: 19 - Forks: 9

lamtong/car_price_analysis
This project performs exploratory data analysis on the CW car price dataset, applies machine learning models (Linear Regression, Neural Networks) for price prediction, and uses unsupervised learning techniques for product segmentation.
Language: Jupyter Notebook - Size: 1000 Bytes - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

MengChunYou/hstoptics
Language: R - Size: 85.2 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 2 - Forks: 0

shallowManica/Curriculum-Design-through-Web-Scraping
This repository designs a course curriculum for a Master’s program in DS & AI. The project involves web-scraping job postings from Indeed.com, extracting and engineering skill-based features using NLP and OpenAI’s text embeddings, and applying both hierarchical and k-means clustering algorithms.
Language: Jupyter Notebook - Size: 4.85 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

xhan97/hunger
A python library for evaluating Hierarchical Clustering
Language: Python - Size: 13.7 KB - Last synced at: 3 months ago - Pushed at: about 3 years ago - Stars: 1 - Forks: 0

mishraanuraagx/FinSight
FinSight is a machine learning-driven financial analytics tool designed to explore, cluster, and visualize different financial assets based on their risk and return behaviors.
Language: Jupyter Notebook - Size: 379 KB - Last synced at: 23 days ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

nlp4se/FeaClustRE
API for feature clustering, generating hierarchical feature organization with feature family clustering.
Language: Python - Size: 288 MB - Last synced at: 5 days ago - Pushed at: 6 days ago - Stars: 1 - Forks: 0

rajarsheya/Scalable-Recognition-using-Vocabulary-Tree
Enhanced Vocabulary Trees for Real-Time Object Recognition in Image and Video Streams
Language: C++ - Size: 896 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 0 - Forks: 1

PZVivian/School.MSc.STATS780_DataScience
This repository contains projects for the STATS 780 Data Science course at McMaster University completed during my master's studies.
Language: Jupyter Notebook - Size: 39.3 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

havelhakimi/gene-expression
Agglomerative based clustering on gene expression dataset
Language: Jupyter Notebook - Size: 1.26 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 8 - Forks: 0

razamehar/Financial-Stock-Analysis-and-Clustering
Analyzed 157 US Energy stocks (Jan-Dec '23), identified Bullish/Bearish trends and risk categories. Used KMeans, Hierarchical, Spectral Clustering, revealing balanced returns and low volatility. Integrated data with Kafka for seamless subscriptions.
Language: Jupyter Notebook - Size: 4.34 MB - Last synced at: about 1 month ago - Pushed at: 3 months ago - Stars: 1 - Forks: 2

greenelab/hclust
Agglomerative hierarchical clustering in JavaScript
Language: JavaScript - Size: 202 KB - Last synced at: 11 days ago - Pushed at: 7 months ago - Stars: 19 - Forks: 3

arj1211/cluster-links
pipeline that extracts, cleans, embeds, and clusters web links into topical groups using text extraction, semantic keyword extraction, and unsupervised clustering
Language: Python - Size: 34.2 KB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

ayushi-p/webscrapping-indeed
This project automates the extraction of job postings from Indeed using web scraping techniques. It gathers structured job data, including job titles, company names, locations, salaries, and job descriptions, to provide insights into hiring trends, salary benchmarks, and skill demand across industries.
Language: Jupyter Notebook - Size: 2.94 MB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 0 - Forks: 0

geoav74/Data_Scientist_Salaries_in_EUR_2025
Language: Jupyter Notebook - Size: 0 Bytes - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 0 - Forks: 0

PacktWorkshops/The-Unsupervised-Learning-Workshop
An Interactive Approach to Understanding Unsupervised Learning Algorithms
Language: Jupyter Notebook - Size: 115 MB - Last synced at: 3 months ago - Pushed at: over 4 years ago - Stars: 28 - Forks: 32

AniK4111/Netflix_Movies_And_TV_Shows_Clustering
Unsupervised Machine Learning project for Netflix Movies and TV Shows Clustering. The main goal of this project is to create a content-based recommender system that recommends top 10 shows to users based on their viewing history.
Size: 2.58 MB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 0 - Forks: 0

pamudu123/seeds_clustering
Machine Learning Clustering
Language: Jupyter Notebook - Size: 1.9 MB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 0 - Forks: 0

AdityaSreevatsaK/DS-ML-Playground
A collection of data science and machine learning projects showcasing complete workflows, from data cleaning and preprocessing to model building and evaluation. Dive into diverse datasets, explore a range of techniques, and experiment with models in this comprehensive playground for learning and innovation.
Language: Jupyter Notebook - Size: 40.2 MB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 0 - Forks: 0

beginnerSC/pyminimax
Python implementation of minimax-linkage hierarchical clustering
Language: Python - Size: 444 KB - Last synced at: about 8 hours ago - Pushed at: almost 2 years ago - Stars: 1 - Forks: 0

taneishi/LeukemiaClustering
Hierarchical Clustering of Leukemia Gene Expression Dataset
Language: Python - Size: 106 KB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 1 - Forks: 0

pngo1997/Chicago-Airbnb-Hybrid-Recommender-System
Develops a hybrid recommender system for Chicago Airbnb listings using data from Inside Airbnb.
Language: Jupyter Notebook - Size: 30.9 MB - Last synced at: about 1 month ago - Pushed at: 5 months ago - Stars: 2 - Forks: 0

gabrielramirezv/protein_clustering
The ABC transporter family is a group of proteins that are involved in the transport of various molecules across the cell membrane. We aim to to cluster the proteins of the ABC transporter family into groups based on their sequence similarity.
Language: R - Size: 3.19 MB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 0 - Forks: 1

Tolumie/Stock-Market-Clustering-and-Predictive-Analysis_-Uncovering-Investment-Insights
Stock Market Clustering & Predictive Analysis | Leverage PCA & DBSCAN, K-Means, Hierarchical Clustering to uncover investment insights. Identify market segments, high-risk outliers (NVDA, TSLA, NFLX), and portfolio optimization strategies using S&P 500 data.
Language: Jupyter Notebook - Size: 236 KB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 0 - Forks: 0

aye-nyeinSan/NLP_Workshop5
The national Anthems of our world with K-Mean and HAC
Language: Jupyter Notebook - Size: 220 KB - Last synced at: about 1 month ago - Pushed at: 4 months ago - Stars: 0 - Forks: 0

bgr8/Makine-ogrenmesi
Machine Learning with Python
Language: Jupyter Notebook - Size: 3.03 MB - Last synced at: 3 months ago - Pushed at: almost 3 years ago - Stars: 4 - Forks: 1

firatolcum/Machine_Learning_Course
This repository contains the Machine Learning lessons I took from the Clarusway Bootcamp between 10 Aug - 14 Sep 2022 and includes 17 sessions, 5 labs, 4 case studies, 5 weekly agendas, and 3 projects.
Language: Jupyter Notebook - Size: 237 MB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 7 - Forks: 1

drod-96/efficient_clustering
Efficient clustering approch to identify optimal heat consumers clusters within the DHNs
Language: Python - Size: 11.7 MB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 0 - Forks: 0

shubhamjha97/hierarchical-clustering
A Python implementation of divisive and hierarchical clustering algorithms. The algorithms were tested on the Human Gene DNA Sequence dataset and dendrograms were plotted.
Language: Python - Size: 2.28 MB - Last synced at: 4 months ago - Pushed at: over 4 years ago - Stars: 83 - Forks: 31

data-liangai/hierarchical-clustering
This is a hierarchical clustering project of viral protein sequences. For details, please refer to https://doi.org/10.46793/match.90-2.381A
Language: R - Size: 344 KB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 1 - Forks: 0

Vidhi1290/Malware-Detection
Welcome to the Malicious Executable Detection project! This repository explores the world of machine learning and clustering analysis to detect malicious executable files 🔥🔐
Language: Jupyter Notebook - Size: 12.7 KB - Last synced at: 4 months ago - Pushed at: almost 2 years ago - Stars: 3 - Forks: 0

nmonath/graphgrove
A framework for building (and incrementally growing) graph-based data structures used in hierarchical or DAG-structured clustering and nearest neighbor search
Language: C++ - Size: 1.03 MB - Last synced at: 25 days ago - Pushed at: over 2 years ago - Stars: 44 - Forks: 6

AiCorsair/Python-Case-Study-365-Data-Science-Customer-Segmentation-in-Marketing
This repository contains a detailed case study on the segmentation of 365 Data Science customers using real-world data from an onboarding survey.
Language: Jupyter Notebook - Size: 425 KB - Last synced at: 3 months ago - Pushed at: 5 months ago - Stars: 0 - Forks: 0

fazeelibtesam/Brain_Tumor
Classification of Brain Tumor
Language: Jupyter Notebook - Size: 577 KB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 0 - Forks: 0

dpruthardt/UnsupervisedLearning
Language: HTML - Size: 3.46 MB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 0 - Forks: 0

iesl/xcluster
Algorithms and evaluation tools for extreme clustering
Language: Scala - Size: 8.95 MB - Last synced at: 3 months ago - Pushed at: almost 6 years ago - Stars: 72 - Forks: 21

raj1603chdry/CSE3020-Web-Mining-Labs
Repository containing all the codes created for the lab sessions of CSE3020 Web Mining at VIT University Chennai Campus
Language: Python - Size: 13.8 MB - Last synced at: 21 days ago - Pushed at: over 6 years ago - Stars: 24 - Forks: 19

wei9935/wasserstein_HRP
This trial explores enhancing HRP portfolio optimization by using sliced-Wasserstein distance for clustering. Results are preliminary and may not outperform the original method, but further research could improve outcomes.
Language: Python - Size: 1.34 MB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 0 - Forks: 0

QuantLet/HClustering Fork of lizzzi111/HClustering
Language: Jupyter Notebook - Size: 30.3 MB - Last synced at: 6 months ago - Pushed at: 6 months ago - Stars: 0 - Forks: 3

cmdevries/LMW-tree
Learning M-Way Tree - Web Scale Clustering - EM-tree, K-tree, k-means, TSVQ, repeated k-means, bitwise clustering
Language: C++ - Size: 74.5 MB - Last synced at: 5 months ago - Pushed at: over 3 years ago - Stars: 74 - Forks: 20

AbbasPak/Clustering-Methods-Examples
This repository aims to provide an overview of various clustering methods, along with practical examples and implementations.
Language: Jupyter Notebook - Size: 1.59 MB - Last synced at: 6 months ago - Pushed at: 6 months ago - Stars: 5 - Forks: 1

wolny/phash-hierarchical-clustering
Hierarchical clustering of images using phash and Hamming distance
Language: Scala - Size: 1.37 MB - Last synced at: 2 days ago - Pushed at: about 8 years ago - Stars: 8 - Forks: 3

OCHOLA-EDDYPHIL/Clustering
This project performs hierarchical clustering on a dataset containing network usage and performance metrics. It includes data preprocessing, encoding, normalization, and visualization of clustering results using dendrograms. The purpose is to analyze and group similar data points, offering insights into patterns and relationships within the dataset
Language: Python - Size: 19.5 KB - Last synced at: 6 months ago - Pushed at: 6 months ago - Stars: 0 - Forks: 0

weihan-zhao/knotweed_fieldwork
The documents under this repository are the codes for analysis of knotweed population performance across ranges and environmental drivers of trait variation
Language: R - Size: 82 KB - Last synced at: 6 months ago - Pushed at: 6 months ago - Stars: 0 - Forks: 0

jonghough/jlearn
Machine Learning Library, written in J
Language: J - Size: 9.53 MB - Last synced at: 5 months ago - Pushed at: over 1 year ago - Stars: 58 - Forks: 13

Armin-Abdollahi/Machine-Learning
Language: Jupyter Notebook - Size: 1.7 MB - Last synced at: 6 months ago - Pushed at: 6 months ago - Stars: 4 - Forks: 0

mariamffatima/Machine-Learning-Tasks
This repository contains a collection of lab tasks, assignments, and projects designed to learn and practice key concepts in Machine Learning. It includes hands-on Jupyter notebooks covering fundamental ML techniques, real-world projects, and theoretical exercises. Ideal for students and enthusiasts aiming to deepen their understanding of ML.
Language: Jupyter Notebook - Size: 19.2 MB - Last synced at: 6 months ago - Pushed at: 6 months ago - Stars: 0 - Forks: 0

teja-1403/Coursera-Machine-Learning-with-Python-Honors
This project involves building a classifier to predict rainfall for the next day based on weather data from the Australian Government's Bureau of Meteorology. Various machine learning techniques such as Linear Regression, KNN, Decision Trees, Logistic Regression, and SVM were implemented and evaluated.
Language: Jupyter Notebook - Size: 24.4 KB - Last synced at: 5 months ago - Pushed at: 6 months ago - Stars: 0 - Forks: 0

moon-hotel/MachineLearningWithMe
A repository contains more than 12 common statistical machine learning algorithm implementations. 常见10余种机器学习算法原理与实现及视频讲解。
Language: Jupyter Notebook - Size: 35.7 MB - Last synced at: 7 months ago - Pushed at: 7 months ago - Stars: 254 - Forks: 47

theo-liang/Python-Project-Analysis-for-ClimateWins
This project analyzes historical weather data to identify patterns and predict future weather conditions, focusing on extreme events and temperature trends across Europe.
Language: Jupyter Notebook - Size: 76.1 MB - Last synced at: 4 months ago - Pushed at: 7 months ago - Stars: 0 - Forks: 0

mrunmaim16/CSE-5334-Programming-Assignments
Programming assignments completed for course CSE - 5334 Data Mining under Professor Dr. Marnim Galib.
Language: Jupyter Notebook - Size: 460 KB - Last synced at: 7 months ago - Pushed at: 7 months ago - Stars: 0 - Forks: 0

iesl/expLinkage
Supervised hierarchical clustering
Language: Python - Size: 102 KB - Last synced at: 3 months ago - Pushed at: about 5 years ago - Stars: 10 - Forks: 6

felipeversiane/face-cluster
application that receives a dataset of faces and creates a cluster of images that have similarity.
Language: Python - Size: 555 KB - Last synced at: 12 days ago - Pushed at: 7 months ago - Stars: 0 - Forks: 0

subhashpolisetti/Clustering-Techniques-and-Embeddings
This repository includes Colab notebooks demonstrating various clustering algorithms, from scratch-based methods to advanced deep learning models and embeddings. Each notebook features explanations, visualizations, and quality evaluation metrics for clustering performance.
Language: Jupyter Notebook - Size: 14.4 MB - Last synced at: 3 months ago - Pushed at: 7 months ago - Stars: 0 - Forks: 0

palVikram/Machine-Learning-using-Python
Regression, Classification and Clustering
Language: Python - Size: 223 MB - Last synced at: 7 months ago - Pushed at: 7 months ago - Stars: 19 - Forks: 19
