GitHub / juste97 / topic-modeling-pipeline
Pipeline leveraging UMAP and HDBSCAN with BERTopic for large datasets.
JSON API: http://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/juste97%2Ftopic-modeling-pipeline
Stars: 1
Forks: 0
Open issues: 0
License: None
Language: Jupyter Notebook
Size: 87.8 MB
Dependencies parsed at: Pending
Created at: over 1 year ago
Updated at: over 1 year ago
Pushed at: over 1 year ago
Last synced at: over 1 year ago
Commit Stats
Commits: 27
Authors: 2
Mean commits per author: 13.5
Development Distribution Score: 0.444
More commit stats: https://commits.ecosyste.ms/hosts/GitHub/repositories/juste97/topic-modeling-pipeline
Topics: bertopic, clustering, embedding, embeddings, hdbscan, language-processing, nlp, nlp-machine-learning, text-analysis, text-mining, text-processing, topic-modeling