An open API service providing repository metadata for many open source software ecosystems.

GitHub topics: hierarchical-clustering

acsicuib/GA-hierarchical-clustering

Algorithm to analyse fog colonies and service placement using genetic algorithms and hierarchical clustering

Language: Python - Size: 31.9 MB - Last synced at: 1 day ago - Pushed at: 2 days ago - Stars: 1 - Forks: 0

hyunsooseol/snowCluster

This module allows users to analyze k-means & hierarchical clustering, and visualize results of Principal Component, Correspondence Analysis, Discriminant analysis, Decision tree, Multidimensional scaling, Multiple Factor Analysis, Machine learning, and Prophet analysis.

Language: R - Size: 440 MB - Last synced at: 3 days ago - Pushed at: 4 days ago - Stars: 9 - Forks: 2

moon-hotel/MachineLearningWithMe

A repository contains more than 12 common statistical machine learning algorithm implementations. 常见10余种机器学习算法原理与实现及视频讲解。@月来客栈 出品

Language: Jupyter Notebook - Size: 36 MB - Last synced at: 3 days ago - Pushed at: 4 days ago - Stars: 279 - Forks: 50

gagolews/genieclust

Genie: Fast and Robust Hierarchical Clustering with Noise Point Detection - in Python and R

Language: C++ - Size: 108 MB - Last synced at: 3 days ago - Pushed at: 5 days ago - Stars: 67 - Forks: 12

skfolio/skfolio

Python library for portfolio optimization built on top of scikit-learn

Language: Python - Size: 123 MB - Last synced at: 6 days ago - Pushed at: 8 days ago - Stars: 1,707 - Forks: 157

dcelisgarza/PortfolioOptimisers.jl

Portfolio Optimisation library built in Julia.

Language: Julia - Size: 53.6 MB - Last synced at: 7 days ago - Pushed at: 7 days ago - Stars: 4 - Forks: 0

rustic-ml/FormicaX

FormicaX: Rust library with clustering algorithms like K-Means, DBSCAN, and GMM, FormicaX delivers efficient, adaptable insights for trading applications. Inspired by the collaborative and resilient nature of ants (Formica), it offers a modular, high-performance framework for developers and data scientists.

Language: Rust - Size: 282 KB - Last synced at: 8 days ago - Pushed at: 3 months ago - Stars: 1 - Forks: 1

aungpyaeap/Weighted-Mixed-Distance

A Distance Metric for Clustering Mixed Data Using Graph-Based Feature Influence Balancing Approach.

Language: MATLAB - Size: 3.68 MB - Last synced at: 14 days ago - Pushed at: 14 days ago - Stars: 0 - Forks: 0

QuaCau-TheSphere/Graphvidian

Obsidian plugin to export Graphviz graphs from vault's notes

Language: TypeScript - Size: 99.6 KB - Last synced at: 10 days ago - Pushed at: over 3 years ago - Stars: 29 - Forks: 1

JessicaWoods03/Furby_Hack

Semantics and Ontology Large Language AI Model

Language: Jupyter Notebook - Size: 13.1 MB - Last synced at: 18 days ago - Pushed at: 18 days ago - Stars: 5 - Forks: 0

smart-models/Progressive-Summarizer-RAPTOR

Cutting-edge semantic text processing system that uses hierarchical clustering and advanced language models to automatically organize and summarize large volumes of text.

Language: Python - Size: 2.12 MB - Last synced at: 19 days ago - Pushed at: 19 days ago - Stars: 0 - Forks: 0

pneuvial/adjclust

Adjacency-constrained hierarchical clustering of a similarity matrix

Language: R - Size: 18.5 MB - Last synced at: 21 days ago - Pushed at: 21 days ago - Stars: 16 - Forks: 9

Otniel113/SegmentasiPelanggan

Penerapan Kecerdasan Buatan dalam Komunikasi Pemasaran: Segmentasi Pelanggan dengan Agglomerative Clustering

Language: Jupyter Notebook - Size: 1.63 MB - Last synced at: 22 days ago - Pushed at: 22 days ago - Stars: 0 - Forks: 0

JuliaStats/Clustering.jl

A Julia package for data clustering

Language: Julia - Size: 7.04 MB - Last synced at: 7 days ago - Pushed at: 2 months ago - Stars: 367 - Forks: 124

sahilkarande/Machine-Learning-Algorithms-Course

Machine Learning Mastery is a comprehensive repository designed to teach machine learning with Python. It covers essential techniques from data preprocessing to advanced methods in classification, regression, and clustering, catering to beginners and advanced learners alike.

Language: Jupyter Notebook - Size: 11.8 MB - Last synced at: 27 days ago - Pushed at: 27 days ago - Stars: 0 - Forks: 1

ahmedshahriar/PulsePoint-Data-Analytics

EDA, data processing, cleaning and extensive geospatial analysis on a selenium based web crawled dataset

Language: HTML - Size: 10.1 MB - Last synced at: 25 days ago - Pushed at: over 3 years ago - Stars: 6 - Forks: 2

tasrif-khondaker/HT-CapsNet

Taxonomy-Guided Routing in Capsule Network for Hierarchical Image Classification

Language: Python - Size: 1.21 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 0 - Forks: 0

PaPJoIN/unsupervised-learning-customer-segmentation

Performing customer segmentation by utilising unsupervised learning methods - Hierarchical & K-means clustering, with further dimensionality reduction for clearer modelling and visualisation.

Size: 2.93 KB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 0 - Forks: 0

aungpyaeap/distfun-matlab

MATLAB functions designed to construct dissimilarity matrices using a variety of distance metric functions. It provides a comprehensive toolkit for analyzing and comparing data sets through different distance measures.

Language: MATLAB - Size: 35.2 KB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 1 - Forks: 0

houhou21/dry_eye_disease-cluster-analysis

👁️ Analyze clustering methods to uncover subgroups in Dry Eye Disease patients, using health and lifestyle data for targeted insights and improved outcomes.

Size: 3.45 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 0 - Forks: 0

holgerteichgraeber/TimeSeriesClustering.jl

Julia implementation of unsupervised learning methods for time series datasets. It provides functionality for clustering and aggregating, detecting motifs, and quantifying similarity between time series datasets.

Language: Julia - Size: 171 MB - Last synced at: about 14 hours ago - Pushed at: almost 5 years ago - Stars: 84 - Forks: 23

PacktWorkshops/The-Unsupervised-Learning-Workshop

An Interactive Approach to Understanding Unsupervised Learning Algorithms

Language: Jupyter Notebook - Size: 114 MB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 29 - Forks: 33

lumi-a/exact-clustering

Find optimal clusterings and optimal hierarchical clusterings.

Language: Rust - Size: 107 KB - Last synced at: 16 days ago - Pushed at: about 2 months ago - Stars: 0 - Forks: 0

panagiotisanagnostou/HiPart

Hierarchical divisive clustering algorithm execution, visualization and Interactive visualization.

Language: Python - Size: 151 MB - Last synced at: 4 days ago - Pushed at: 4 months ago - Stars: 52 - Forks: 8

javiermerinom/dry_eye_disease-cluster-analysis

Unsupervised ML analysis of lifestyle data to uncover risk patterns for Dry Eye Disease

Size: 3.45 MB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 1 - Forks: 0

kumar-kiran-24/Mall-Customer-Segmentation

A project to segment customers by comparing clustering results with Rand Index, analyzing K-Means clusters, and interpreting segments with meaningful business insights.

Language: Jupyter Notebook - Size: 1020 KB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 1 - Forks: 0

DeboJp/Clustering-Countries-on-Socioeconomic-Indicators

Hierarchical agglomerative clustering (HAC) based on socioeconomic indicators of countries.

Language: Python - Size: 25.4 KB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 0 - Forks: 0

jonnevd/constrained-linkage

Python implementation of a (plug-and-play) constrained linkage function for constrained hierarchical clustering with maximum cluster size, minimum cluster size, must-link, cannot-link and custom constraints, returns SciPy-compatible linkage matrix for subsequent Hierarchical Agglomerative Clustering. Based on HEAT published in Energy and AI (2025)

Language: Python - Size: 219 KB - Last synced at: 27 days ago - Pushed at: 2 months ago - Stars: 0 - Forks: 0

mailtopurnimams-29/Project_UnsupervisedLearning

Project Goal: To segment credit card customers based on spending patterns and interactions for targeted marketing. Role: Conducted EDA, applied clustering algorithms, reduced dimensions, and profiled segments. Helped identify key customer segments, enabling the bank to tailor services and improve customer engagement

Language: Jupyter Notebook - Size: 2.23 MB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 0 - Forks: 0

hyuncat/RichCluster

Customizable C++ algorithm for clustering biological terms by gene similarity, compiled into an R package with supporting visualizations and for easy use.

Language: R - Size: 10.1 MB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 3 - Forks: 1

EhtishamSK/HierCluster

hierarchical clustering and circular dendrograms

Language: R - Size: 620 KB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 0 - Forks: 0

alagoz/higec

HiGeC: Hierarchy Generation and Classification Framework

Language: Python - Size: 2.22 MB - Last synced at: about 1 month ago - Pushed at: 2 months ago - Stars: 0 - Forks: 0

caponetto/bayesian-hierarchical-clustering

Python implementation of Bayesian hierarchical clustering and Bayesian rose trees algorithms.

Language: Python - Size: 65.4 KB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 6 - Forks: 3

Mo-Elshamy/machine-learning-practice

This repository serves as a collection of my work and learning in machine learning while my internship in Cellual-Technologies, including algorithm explanations, data preprocessing workflows, and two projects.

Language: HTML - Size: 24.8 MB - Last synced at: about 2 months ago - Pushed at: 2 months ago - Stars: 0 - Forks: 0

david-revell/customer-segmentation-clustering

Customer segmentation using K-Means and Hierarchical clustering on e-commerce data

Language: Jupyter Notebook - Size: 2.87 MB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 0 - Forks: 0

adiag321/Bank-Customer-Profiling-and-Segmentation

In this project, we will leverage AI/ML to launch a targeted marketing ad campaign that is tailored to a specific group of customers.

Language: Jupyter Notebook - Size: 25.4 MB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 1 - Forks: 1

nmonath/graphgrove

A framework for building (and incrementally growing) graph-based data structures used in hierarchical or DAG-structured clustering and nearest neighbor search

Language: C++ - Size: 1.03 MB - Last synced at: about 2 months ago - Pushed at: over 2 years ago - Stars: 43 - Forks: 7

fakework16/Car_price_prediction_ML

Predict car prices with our Machine Learning model. Input features like brand and year for accurate predictions. Explore the project on GitHub! 🐙

Language: Jupyter Notebook - Size: 608 KB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 0 - Forks: 0

guglielmosanchini/ClustViz

Visualization of many Clustering Algorithms, via Notebook or GUI

Language: Jupyter Notebook - Size: 246 MB - Last synced at: about 2 months ago - Pushed at: over 4 years ago - Stars: 24 - Forks: 14

MAbdelhamid2001/Advanced-Unsupervised-Learning

Applying many advanced unsupervised learning algorithms and techniques

Language: Jupyter Notebook - Size: 7.39 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 0 - Forks: 1

ivan-pi/fortran-flann

Fortran bindings to the FLANN library for performing fast approximate nearest neighbor searches in high dimensional spaces.

Language: Fortran - Size: 1.21 MB - Last synced at: 15 days ago - Pushed at: 4 months ago - Stars: 15 - Forks: 1

SaiPh3r/hierarchical_clustering

A very basic project that gives an ideas behind the thoery of hierarchy clustering model (ps-i love exploring actual theory and mathematical induction behind ML models)

Language: Jupyter Notebook - Size: 378 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

Ajeeb-Alameen/machine-learning

This repository contains machine learning projects from the Fundamentals of Machine Learning course at GUC.

Language: Jupyter Notebook - Size: 4.37 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 1 - Forks: 0

KhooodeSIN/Movie-clustering

Language: Jupyter Notebook - Size: 108 KB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

DrStef/Machine_Learning_with_Python-IBM

Language: Jupyter Notebook - Size: 6.64 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 1 - Forks: 0

AkshaySyal/Hierarchical-Clustering

This project implements hierarchical clustering using a disjoint set data structure to iteratively merge the closest points (single linkage) on the moons dataset, then visualizes the resulting clusters when cut at K=2, K=5, and K=10

Language: Jupyter Notebook - Size: 390 KB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

at-tan/Hierarchical_Clustering_of_Currencies 📦

A clustering exercise of global currencies on three common financial market features using data from 2017 through 2019, as published in Towards Data Science on Medium.com

Language: Jupyter Notebook - Size: 6 MB - Last synced at: 3 months ago - Pushed at: almost 5 years ago - Stars: 9 - Forks: 3

md-experiments/picture_text

Interactive tree-maps with SBERT & Hierarchical Clustering (HAC)

Language: Python - Size: 39.2 MB - Last synced at: 21 days ago - Pushed at: 10 months ago - Stars: 30 - Forks: 9

Shuyib/Phylogenetic-tree-study

Estimating Phylogenetic trees using six microorganisms 16S rRNA gene with Unsupervised Learning, web based tools and Molecular Evolutionary Genetics Analysis MEGA7

Language: Jupyter Notebook - Size: 5.06 MB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 4 - Forks: 5

nicolasfguillaume/Strategic-Business-Analytics-with-R

Foundations of Strategic Business Analytics - ESSEC Business school via Coursera.org

Size: 276 KB - Last synced at: 3 months ago - Pushed at: over 9 years ago - Stars: 18 - Forks: 27

nurulashraf/hierarchical-clustering-customer-segmentation

A customer segmentation project using hierarchical clustering to group customers based on their spending behaviour and demographics. This helps businesses identify patterns and create targeted marketing strategies.

Language: Jupyter Notebook - Size: 5.58 MB - Last synced at: 2 months ago - Pushed at: 4 months ago - Stars: 0 - Forks: 0

volfpeter/localclustering

Python 3 implementation and documentation of the Hermina-Janos local graph clustering algorithm.

Language: Python - Size: 2.48 MB - Last synced at: 5 days ago - Pushed at: over 2 years ago - Stars: 23 - Forks: 1

pedrodbs/Aglomera

A hierarchical agglomerative clustering (HAC) library written in C#

Language: C# - Size: 2.62 MB - Last synced at: 18 days ago - Pushed at: almost 3 years ago - Stars: 52 - Forks: 17

Ali-Tharwat/Data-Science-Tasks

Comprehensive data preparation and exploration processes integrated with machine learning models for classification and clustering

Language: Jupyter Notebook - Size: 1.32 MB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 0 - Forks: 0

DolapoSalim/hierarchical-clustering-and-dendrogram

This project demonstrates how to generate synthetic (marine ecological) data and apply unsupervised machine learning (hierarchical clustering) to explore patterns in policy coverage across marine zones.

Language: Jupyter Notebook - Size: 0 Bytes - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 0 - Forks: 0

afairless/binary_classification_shap

Run histogram-based gradient boosted trees binary classifier on generated data and interpret results with standard metrics, SHAP, and supervised clustering

Language: Python - Size: 22.5 KB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 0 - Forks: 0

EvanGks/hierarchical-clustering-mall-customers

A comprehensive machine learning project demonstrating hierarchical clustering for customer segmentation on the Mall Customers dataset. Includes EDA, preprocessing, multiple linkage/distance comparisons, and professional visualizations.

Language: Jupyter Notebook - Size: 198 KB - Last synced at: 3 months ago - Pushed at: 5 months ago - Stars: 1 - Forks: 0

susanli2016/Machine-Learning-with-Python

Python code for common Machine Learning Algorithms

Language: Jupyter Notebook - Size: 58.1 MB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 4,440 - Forks: 4,819

cjunwon/ODAQ-SDA

Applying Categorical Exploratory Data Analysis (CEDA) methods to study audio quality perception

Language: Python - Size: 950 MB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 0 - Forks: 0

RohanFredriksson/agglomerative-clustering

🎨 Fast hierarchical agglomerative clustering powered by WebAssembly

Language: C++ - Size: 207 KB - Last synced at: 7 days ago - Pushed at: 5 months ago - Stars: 2 - Forks: 0

kamlesh0928/machine-learning

This repository contains machine learning algorithms implemented from scratch and using scikit-learn, covering classification, regression, and clustering. Each algorithm is well-documented, with clear code and explanations. To use K-Medoids, install sklearn_extra via pip install scikit-learn-extra. Contributions are welcome!

Language: Python - Size: 199 KB - Last synced at: about 1 month ago - Pushed at: 8 months ago - Stars: 1 - Forks: 0

tommygrammar/Markov-HVQ-Macro-Regime-Modeling-Pipeline

A Python toolkit for discovering and modeling macro-scale regimes in time-series data by combining Hierarchical Vector Quantisation (HVQ) with Markov chain transition modeling.

Language: Python - Size: 12.7 KB - Last synced at: 4 months ago - Pushed at: 5 months ago - Stars: 1 - Forks: 0

tommygrammar/stochastic-classifier

A two-stage clustering tool that converts noisy, high-entropy data into deterministic encodings for studying.

Language: Python - Size: 14.6 KB - Last synced at: 4 months ago - Pushed at: 5 months ago - Stars: 1 - Forks: 0

enricivi/growing_hierarchical_som

Self-Organizing Map [https://en.wikipedia.org/wiki/Self-organizing_map] is a popular method to perform cluster analysis. SOM shows two main limitations: fixed map size constraints how the data is being mapped and hierarchical relationships are not easily recognizable. Thus Growing Hierarchical SOM has been designed to overcome this issues

Language: Python - Size: 2.21 MB - Last synced at: 2 months ago - Pushed at: over 1 year ago - Stars: 47 - Forks: 11

monty-se/PINstimation

A comprehensive bundle of utilities for the estimation of probability of informed trading models: original PIN in Easley and O'Hara (1992) and Easley et al. (1996); Multilayer PIN (MPIN) in Ersan (2016); Adjusted PIN (AdjPIN) in Duarte and Young (2009); and volume-synchronized PIN (VPIN) in Easley et al. (2011, 2012). Implementations of various estimation methods suggested in the literature are included. Additional compelling features comprise posterior probabilities, an implementation of an expectation-maximization (EM) algorithm, and PIN decomposition into layers, and into bad/good components. Versatile data simulation tools, and trade classification algorithms are among the supplementary utilities. The package provides fast, compact, and precise utilities to tackle the sophisticated, error-prone, and time-consuming estimation procedure of informed trading, and this solely using the raw trade-level data.

Language: R - Size: 5.27 MB - Last synced at: 8 days ago - Pushed at: 12 months ago - Stars: 40 - Forks: 7

BjornMelin/stardex

🌟 Stardex: Explore GitHub Stars Intelligently. Stardex is a powerful web app that lets you search, filter, and cluster any GitHub user's starred repositories. Discover hidden patterns and find your next favorite project with intelligent, AI-powered exploration.

Language: TypeScript - Size: 549 KB - Last synced at: 4 months ago - Pushed at: 6 months ago - Stars: 5 - Forks: 0

John-sam1983/John_Ndaa_Samson_Data_Science_Portfolio

This repository is a compilation of all the data science and in particular Machine Learning projects I have successfully carried out.

Language: Jupyter Notebook - Size: 8.24 MB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 0 - Forks: 0

marcocampanario/ms-mlpa_HCA-ROC

Bioinformatics consultancy on MS-MLPA data analysis | Consultoria em Bioinformática para análise de dados de MS-MLPA

Language: R - Size: 394 KB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 0 - Forks: 0

392781/Country-Clustering-Analysis

Cluster analysis practice done on a dataset of 167 countries

Language: Jupyter Notebook - Size: 1.49 MB - Last synced at: 6 days ago - Pushed at: over 4 years ago - Stars: 1 - Forks: 0

sung-yeon-kim/HIER-CVPR23

Official PyTorch Implementation of HIER: Metric Learning Beyond Class Labels via Hierarchical Regularization, CVPR 2023

Language: Python - Size: 4.52 MB - Last synced at: 5 months ago - Pushed at: about 2 years ago - Stars: 59 - Forks: 6

OmarA32/House-Price-Prediction-and-Clustering-with-Deep-Learning

[WIP] House Price Prediction & Clustering. Utilizing PyTorch for neural networks, and Tkinter for the user interface.

Language: Python - Size: 932 KB - Last synced at: 5 months ago - Pushed at: 6 months ago - Stars: 0 - Forks: 0

nadeyyah/Clustering-the-Quality-of-Senior-High-School-Education-in-Each-District-of-Indonesia-in-2023

The clustering method can help to find out which provinces need special attention in overcoming education problems

Language: R - Size: 412 KB - Last synced at: 6 months ago - Pushed at: 6 months ago - Stars: 0 - Forks: 0

pajaskowiak/clusterConfusion

Clustering validation with ROC Curves

Language: R - Size: 1.2 MB - Last synced at: 6 months ago - Pushed at: 8 months ago - Stars: 7 - Forks: 1

dimitris-markopoulos/latent-semantic-clustering

Clustering book chapters with unsupervised ML—custom EM-GMM, sklearn baselines, and dimensionality reduction.

Language: Jupyter Notebook - Size: 87.2 MB - Last synced at: 6 months ago - Pushed at: 6 months ago - Stars: 0 - Forks: 1

socnetv/app

Social Network Analysis and Visualization software application.

Language: C++ - Size: 22.9 MB - Last synced at: 6 months ago - Pushed at: 6 months ago - Stars: 227 - Forks: 27

creme332/rowmerge

A heuristic algorithm for merging rows efficiently.

Language: C++ - Size: 7.18 MB - Last synced at: 6 months ago - Pushed at: 6 months ago - Stars: 0 - Forks: 0

pingyuu/student_performance_clustering_r

PCA-based clustering of student grades to explore academic performance patterns (R)

Language: R - Size: 146 KB - Last synced at: 6 months ago - Pushed at: 6 months ago - Stars: 0 - Forks: 0

aryddntaabbss/klasterisasi-minatbaca

Proyek ini merupakan hasil dari SKRIPSI saya yang berjudul "KLASTERISASI MINAT BACA MASYARAKAT KOTA TERNATE MENGGUNAKAN ALGORITMA HIERARCHICAL CLUSTERING (STUDI KASUS DINAS PERPUSTAKAAN DAN KEARSIPAN DAERAH KOTA TERNATE)".

Language: Python - Size: 0 Bytes - Last synced at: 6 months ago - Pushed at: 6 months ago - Stars: 0 - Forks: 0

dedupeio/hcluster 📦

Hierarchical Clustering Algorithms

Language: Python - Size: 1.75 MB - Last synced at: about 7 hours ago - Pushed at: over 3 years ago - Stars: 36 - Forks: 20

Markkreel/Binary-Static-Analysis-Through-Instruction-and-Operand-Extraction-and-AHC-Algorithm

A static binary analysis tool visualizes code blocks in the assembly of a disassembled binary file using the AHC algorithm, aided by entropy calculation and similarity measurement.

Language: Assembly - Size: 3.93 MB - Last synced at: 6 months ago - Pushed at: 6 months ago - Stars: 0 - Forks: 0

gcorso/NeuroSEED

Implementation of Neural Distance Embeddings for Biological Sequences (NeuroSEED) in PyTorch (NeurIPS 2021)

Language: Python - Size: 1.38 MB - Last synced at: 6 months ago - Pushed at: about 2 years ago - Stars: 73 - Forks: 18

DiogoFerrari/hdpGLM

Hierarchical Dirichlet Process Generalized Linear Models

Language: R - Size: 42.4 MB - Last synced at: 3 days ago - Pushed at: 6 months ago - Stars: 12 - Forks: 4

koonimaru/radialtree

A python module to draw a circular dendrogram

Language: Python - Size: 1.35 MB - Last synced at: 6 months ago - Pushed at: 12 months ago - Stars: 19 - Forks: 9

lamtong/car_price_analysis

This project performs exploratory data analysis on the CW car price dataset, applies machine learning models (Linear Regression, Neural Networks) for price prediction, and uses unsupervised learning techniques for product segmentation.

Language: Jupyter Notebook - Size: 1000 Bytes - Last synced at: 6 months ago - Pushed at: 6 months ago - Stars: 0 - Forks: 0

MengChunYou/hstoptics

Language: R - Size: 85.2 MB - Last synced at: 6 months ago - Pushed at: 6 months ago - Stars: 2 - Forks: 0

shallowManica/Curriculum-Design-through-Web-Scraping

This repository designs a course curriculum for a Master’s program in DS & AI. The project involves web-scraping job postings from Indeed.com, extracting and engineering skill-based features using NLP and OpenAI’s text embeddings, and applying both hierarchical and k-means clustering algorithms.

Language: Jupyter Notebook - Size: 4.85 MB - Last synced at: 6 months ago - Pushed at: 6 months ago - Stars: 0 - Forks: 0

xhan97/hunger

A python library for evaluating Hierarchical Clustering

Language: Python - Size: 13.7 KB - Last synced at: 6 months ago - Pushed at: over 3 years ago - Stars: 1 - Forks: 0

mishraanuraagx/FinSight

FinSight is a machine learning-driven financial analytics tool designed to explore, cluster, and visualize different financial assets based on their risk and return behaviors.

Language: Jupyter Notebook - Size: 379 KB - Last synced at: 4 months ago - Pushed at: 7 months ago - Stars: 0 - Forks: 0

nlp4se/FeaClustRE

API for feature clustering, generating hierarchical feature organization with feature family clustering.

Language: Python - Size: 288 MB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 1 - Forks: 0

rajarsheya/Scalable-Recognition-using-Vocabulary-Tree

Enhanced Vocabulary Trees for Real-Time Object Recognition in Image and Video Streams

Language: C++ - Size: 896 MB - Last synced at: 7 months ago - Pushed at: 7 months ago - Stars: 0 - Forks: 1

PZVivian/School.MSc.STATS780_DataScience

This repository contains projects for the STATS 780 Data Science course at McMaster University completed during my master's studies.

Language: Jupyter Notebook - Size: 39.3 MB - Last synced at: 7 months ago - Pushed at: 7 months ago - Stars: 0 - Forks: 0

havelhakimi/gene-expression

Agglomerative based clustering on gene expression dataset

Language: Jupyter Notebook - Size: 1.26 MB - Last synced at: 7 months ago - Pushed at: 7 months ago - Stars: 8 - Forks: 0

razamehar/Financial-Stock-Analysis-and-Clustering

Analyzed 157 US Energy stocks (Jan-Dec '23), identified Bullish/Bearish trends and risk categories. Used KMeans, Hierarchical, Spectral Clustering, revealing balanced returns and low volatility. Integrated data with Kafka for seamless subscriptions.

Language: Jupyter Notebook - Size: 4.34 MB - Last synced at: 5 months ago - Pushed at: 7 months ago - Stars: 1 - Forks: 2

greenelab/hclust

Agglomerative hierarchical clustering in JavaScript

Language: JavaScript - Size: 202 KB - Last synced at: 19 days ago - Pushed at: 10 months ago - Stars: 19 - Forks: 3

arj1211/cluster-links

pipeline that extracts, cleans, embeds, and clusters web links into topical groups using text extraction, semantic keyword extraction, and unsupervised clustering

Language: Python - Size: 34.2 KB - Last synced at: 7 months ago - Pushed at: 7 months ago - Stars: 0 - Forks: 0

ayushi-p/webscrapping-indeed

This project automates the extraction of job postings from Indeed using web scraping techniques. It gathers structured job data, including job titles, company names, locations, salaries, and job descriptions, to provide insights into hiring trends, salary benchmarks, and skill demand across industries.

Language: Jupyter Notebook - Size: 2.94 MB - Last synced at: 7 months ago - Pushed at: 7 months ago - Stars: 0 - Forks: 0

geoav74/Data_Scientist_Salaries_in_EUR_2025

Language: Jupyter Notebook - Size: 0 Bytes - Last synced at: 7 months ago - Pushed at: 7 months ago - Stars: 0 - Forks: 0

AniK4111/Netflix_Movies_And_TV_Shows_Clustering

Unsupervised Machine Learning project for Netflix Movies and TV Shows Clustering. The main goal of this project is to create a content-based recommender system that recommends top 10 shows to users based on their viewing history.

Size: 2.58 MB - Last synced at: 7 months ago - Pushed at: 7 months ago - Stars: 0 - Forks: 0

pamudu123/seeds_clustering

Machine Learning Clustering

Language: Jupyter Notebook - Size: 1.9 MB - Last synced at: 7 months ago - Pushed at: 7 months ago - Stars: 0 - Forks: 0

AdityaSreevatsaK/DS-ML-Playground

A collection of data science and machine learning projects showcasing complete workflows, from data cleaning and preprocessing to model building and evaluation. Dive into diverse datasets, explore a range of techniques, and experiment with models in this comprehensive playground for learning and innovation.

Language: Jupyter Notebook - Size: 40.2 MB - Last synced at: 7 months ago - Pushed at: 7 months ago - Stars: 0 - Forks: 0

Related Keywords
hierarchical-clustering 644 clustering 209 kmeans-clustering 164 machine-learning 148 python 127 k-means-clustering 116 unsupervised-learning 72 data-science 70 pca 66 dbscan-clustering 60 clustering-algorithm 59 logistic-regression 52 agglomerative-clustering 42 linear-regression 42 r 41 k-means 41 data-visualization 36 unsupervised-machine-learning 36 dbscan 35 dendrogram 35 decision-trees 34 machine-learning-algorithms 33 kmeans 33 random-forest 31 principal-component-analysis 30 pca-analysis 28 data-analysis 27 pandas 27 visualization 27 python3 26 gaussian-mixture-models 26 scikit-learn 26 classification 26 exploratory-data-analysis 24 jupyter-notebook 23 data-mining 23 knn-classification 23 silhouette-score 22 knn 21 numpy 20 dimensionality-reduction 18 polynomial-regression 18 sklearn 18 matplotlib 18 customer-segmentation 17 svm 16 support-vector-machine 16 supervised-learning 16 decision-tree-classifier 16 deep-learning 16 naive-bayes-classifier 16 scipy 16 elbow-method 15 eda 15 k-nearest-neighbours 14 cluster-analysis 14 dendogram 14 neural-network 13 regression 12 seaborn 12 support-vector-machines 12 spectral-clustering 11 clustering-analysis 11 rfm-analysis 11 clustering-methods 11 time-series 11 naive-bayes 11 t-sne 9 nlp 9 multiple-linear-regression 9 density-based-clustering 9 svm-classifier 9 apriori-algorithm 9 random-forest-classifier 9 pytorch 8 kmeans-clustering-algorithm 8 natural-language-processing 8 k-nearest-neighbors 8 statistics 8 elbow-plot 8 cluster 8 kmeans-algorithm 8 autoencoder 7 community-detection 7 data-preprocessing 7 neural-networks 7 lda 7 gradient-descent 7 segmentation 7 feature-engineering 7 data-mining-algorithms 7 dbscan-clustering-algorithm 7 gmm 7 feature-selection 6 knn-classifier 6 portfolio-optimization 6 optimization 6 support-vector-regression 6 nlp-machine-learning 6 artificial-intelligence 6