An open API service providing repository metadata for many open source software ecosystems.

GitHub topics: hdbscan-clustering-algorithm

gagolews/quitefastmst

quitefastmst: Euclidean and Mutual Reachability Minimum Spanning Trees

Language: C++ - Size: 25.3 MB - Last synced at: 9 days ago - Pushed at: 9 days ago - Stars: 1 - Forks: 0

HostServer001/jee_mains_pyqs_data_base

Tool to access and manage 14k+ jee main pyqs with semantic clustering

Language: Python - Size: 64.4 MB - Last synced at: 10 days ago - Pushed at: 10 days ago - Stars: 1 - Forks: 0

MartinTschechne/ASL-hdbscan

An optimized C/C++ implementation of the HDBSCAN algorithm for the course Advanced Systems Lab.

Language: C++ - Size: 23.7 MB - Last synced at: 2 months ago - Pushed at: almost 4 years ago - Stars: 4 - Forks: 1

mbari-org/sdcat

Sliced Detection and Clustering Analysis Toolkit

Language: Python - Size: 33.7 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 6 - Forks: 0

DanteTrb/ParkinsonBiomech_XAI_GAN

Discovering biomechanical phenotypes in Parkinson’s disease using GANs and explainable AI

Language: Jupyter Notebook - Size: 3.74 MB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 0 - Forks: 0

bakraamm/ML-for-Workforce-Analytics-Sales-Forecasting-Segmentation-Sentiment-Analysis

This repository explores machine learning applications in HR, Sales, Marketing, and PR to improve decision-making. It features models for predicting employee turnover, forecasting sales trends, and customer segmentation. 🐙📊

Language: Jupyter Notebook - Size: 4.83 MB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 0 - Forks: 0

MadhukarSaiBabu/ML-for-Workforce-Analytics-Sales-Forecasting-Segmentation-Sentiment-Analysis

Implemented machine learning across HR, Sales, Marketing, and PR to improve decision-making. Used models like XGBoost, Prophet, LSTM, clustering, and NLP to enhance retention, forecasting, segmentation, and sentiment analysis for business growth.

Language: Jupyter Notebook - Size: 4.88 MB - Last synced at: 6 months ago - Pushed at: 6 months ago - Stars: 1 - Forks: 0

aws-samples/amazon-sagemaker-local-mode

Amazon SageMaker Local Mode Examples

Language: Python - Size: 5.94 MB - Last synced at: 6 months ago - Pushed at: 7 months ago - Stars: 257 - Forks: 63

aidendorian/Spotify-Song-Recommendation

Recommends songs from dataset of 232K songs from Spotify. Uses HDBSCAN and Siamese Network. An ML Project

Language: Python - Size: 0 Bytes - Last synced at: 10 months ago - Pushed at: 10 months ago - Stars: 0 - Forks: 0

marthadais/TrajectoriesCompressionAnalysis

The proposed methodology assess how compression algorithms influence the clustering analysis with respect to anomaly detection of vessel trajectories.

Language: Python - Size: 4.69 MB - Last synced at: 4 months ago - Pushed at: almost 3 years ago - Stars: 11 - Forks: 2

NehaPant14/Density-based-clustering

Density based clustering

Language: Jupyter Notebook - Size: 459 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 2 - Forks: 0

brijeshhere/Unsupervised-Learning-Cluster-Streamlit-Dashboard

By utilizing various features, the Project assists the organization in categorizing countries and determining which country the NGO should allocate funds to.

Language: Jupyter Notebook - Size: 19.1 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

edo-pasto/Parallel-Flexible-Clustering

The thesis presents the parallelisation of a state-of-the art clustering algorithm, FISHDBC. This objective has been achived by improving the main data structures and components of the algorithm: HNSW, MST and HDBSCAN. My contribution is based on a lock-free strategy, completely wrote in Python.

Language: Python - Size: 5.81 MB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 1 - Forks: 0

kedir/GLG--Topic-Modeling-and-Document-Clustering

Cluster documents and extract global and local topics per cluster using LDA (Latent Dirichlet Allocation) algorithm

Language: Jupyter Notebook - Size: 17 MB - Last synced at: about 1 month ago - Pushed at: almost 3 years ago - Stars: 4 - Forks: 1

Face-Tagger/facetagger-lib

Python library designed to classify photos containing specific individuals from a collection of images.

Language: Python - Size: 2.22 MB - Last synced at: about 1 year ago - Pushed at: about 2 years ago - Stars: 1 - Forks: 1

felipetobars/Clustering_Jupyter

Implementación de algoritmos de aprendizaje no supervisado para realizar clustering a los datos del sensor LIDAR del KITTI-dataset

Language: Jupyter Notebook - Size: 14 MB - Last synced at: over 2 years ago - Pushed at: over 4 years ago - Stars: 3 - Forks: 1

SolanaO/TDS_Blogs_Knowledge_Graph

Create a knowledge graph based on Towards Data Science blogs.

Language: Jupyter Notebook - Size: 2.41 MB - Last synced at: over 2 years ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

pajaskowiak/dbcv

Density-Based Clustering Validation

Language: MATLAB - Size: 356 KB - Last synced at: over 2 years ago - Pushed at: over 2 years ago - Stars: 2 - Forks: 0

fredriko/metacurate-regularly

Finding the top news stories of 2022 among 54,000+ news on AI, ML, NLP, data science and related fields.

Language: HTML - Size: 10.8 MB - Last synced at: over 2 years ago - Pushed at: almost 3 years ago - Stars: 0 - Forks: 0

darpan-jain/sentence-embedding-clustering

Repository for fine-tuning and clustering sentence embeddings for Food Items

Language: Jupyter Notebook - Size: 781 KB - Last synced at: over 2 years ago - Pushed at: over 3 years ago - Stars: 0 - Forks: 0

Related Keywords
clustering 7 hdbscan 6 natural-language-processing 5 machine-learning 5 python 5 dbscan-clustering 4 umap-hdbscan 4 sentence-embeddings 3 kmeans-clustering 3 umap 2 sentence-transformers 2 nlp 2 cluster-analysis 2 clustering-evaluation 2 clustering-methods 2 density-based-clustering 2 sentiment-analysis 2 salesforcasting 2 random-forest 2 naive-bayes-classifier 2 gmm-clustering 2 feature-engineering 2 deep-learning 2 customer-segmentation 2 business-intelligence 2 bilstm-model 2 arima-forecasting 2 bert-embeddings 2 dbscan 2 reverse-engineering 1 pip 1 study 1 clustering-algorithm 1 clusters 1 clusters-detection 1 oops-in-python 1 jee 1 filtering 1 database 1 data-visualization 1 data-analysis 1 opencv 1 mtcnn 1 tensorflow-training 1 concurrent-programming 1 metacurate 1 metacurate-io 1 ml 1 plotly-express 1 topically 1 visualization 1 biomechanics 1 ctgan 1 explainable-ai 1 gait-analysis 1 generative-ai 1 parkinson-diagnosis 1 parkinson-disease 1 parkinsons-disease 1 shap 1 wearable 1 wearable-devices 1 xai 1 hdbscan-clustering 1 high-performance-computing 1 hnsw 1 hnswlib 1 minimum-spanning-trees 1 multiprocess 1 multiprocessing 1 parallel-computing 1 python3 1 transformer 1 siamese-network 1 siamese-neural-network 1 spotify 1 spotify-dataset 1 spotify-recommendations 1 graph-algorithms 1 knowledge-graph 1 data-science 1 ais-data 1 compression-algorithm 1 trajectory 1 deepsea 1 drone 1 object-detection 1 plankton 1 sahi 1 saliency-detection 1 uav 1 yolov11 1 yolov5 1 yolov8 1 latent-dirichlet-allocation 1 named-entity-recognition 1 nltk 1 amazon-sagemaker 1 catboost 1 dask 1