GitHub topics: topic-models
MaartenGr/BERTopic
Leveraging BERT and c-TF-IDF to create easily interpretable topics.
Language: Python - Size: 23.7 MB - Last synced at: 1 day ago - Pushed at: 1 day ago - Stars: 6,834 - Forks: 829

cuongndc9/article-topic
🔎📰 Detecting topic for new article.
Language: Python - Size: 20.9 MB - Last synced at: 4 days ago - Pushed at: 4 days ago - Stars: 1 - Forks: 1

jonaschn/awesome-topic-models
✨ Awesome - A curated list of amazing Topic Models (implementations, libraries, and resources)
Size: 53.7 KB - Last synced at: 7 days ago - Pushed at: almost 3 years ago - Stars: 96 - Forks: 8

bobxwu/TopMost
A Topic Modeling System Toolkit (ACL 2024 Demo)
Language: Jupyter Notebook - Size: 254 MB - Last synced at: 5 days ago - Pushed at: 2 months ago - Stars: 253 - Forks: 26

MIND-Lab/OCTIS
OCTIS: Comparing Topic Models is Simple! A python package to optimize and evaluate topic models (accepted at EACL2021 demo track)
Language: Python - Size: 168 MB - Last synced at: 10 days ago - Pushed at: 11 months ago - Stars: 770 - Forks: 113

keyATM/keyATM
An R package for Keyword Assisted Topic Models
Language: R - Size: 50.1 MB - Last synced at: 27 days ago - Pushed at: 3 months ago - Stars: 110 - Forks: 15

AnFreTh/STREAM
A versatile Python package engineered for seamless topic modeling, topic evaluation, and topic visualization. Ideal for text analysis, natural language processing (NLP), and research in the social sciences, STREAM simplifies the extraction, interpretation, and visualization of topics from large, complex datasets.
Language: Python - Size: 228 MB - Last synced at: about 1 month ago - Pushed at: 4 months ago - Stars: 38 - Forks: 9

baidu/Familia
A Toolkit for Industrial Topic Modeling
Language: C++ - Size: 5.97 MB - Last synced at: 21 days ago - Pushed at: almost 4 years ago - Stars: 2,643 - Forks: 593

maximtrp/bitermplus
Biterm Topic Model (BTM): modeling topics in short texts
Language: Cython - Size: 693 KB - Last synced at: about 1 month ago - Pushed at: 11 months ago - Stars: 81 - Forks: 14

markoarnauto/biterm
Biterm Topic Model
Language: HTML - Size: 433 KB - Last synced at: 10 days ago - Pushed at: over 1 year ago - Stars: 136 - Forks: 26

bab2min/tomotopy
Python package of Tomoto, the Topic Modeling Tool
Language: C++ - Size: 2.33 MB - Last synced at: about 2 months ago - Pushed at: 11 months ago - Stars: 578 - Forks: 63

amazon-science/text_generation_diffusion_llm_topic
Topic Embedding, Text Generation and Modeling using diffusion
Language: Python - Size: 154 KB - Last synced at: about 2 months ago - Pushed at: 2 months ago - Stars: 12 - Forks: 3

jayholster/jayholster.github.io
Size: 763 KB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 0 - Forks: 0

datquocnguyen/jLDADMM
A Java package for the LDA and DMM topic models
Language: Java - Size: 256 KB - Last synced at: about 2 months ago - Pushed at: about 6 years ago - Stars: 81 - Forks: 20

JonasRieger/ldaPrototype
Determine a Prototype from a number of runs of Latent Dirichlet Allocation.
Language: R - Size: 799 KB - Last synced at: 4 days ago - Pushed at: over 2 years ago - Stars: 8 - Forks: 1

laserwave/topic_models
implemented : lsa, plsa, lda
Language: Python - Size: 6.65 MB - Last synced at: about 2 months ago - Pushed at: almost 9 years ago - Stars: 99 - Forks: 46

prrao87/topic-modelling
Comparing the scalability and quality of topic models in Gensim and PySpark
Language: Python - Size: 10.1 MB - Last synced at: 9 days ago - Pushed at: about 1 year ago - Stars: 6 - Forks: 0

silviatti/topic-model-diversity
A collection of topic diversity measures for topic modeling
Language: Python - Size: 30.3 KB - Last synced at: about 2 months ago - Pushed at: almost 4 years ago - Stars: 45 - Forks: 5

doug-friedman/topicdoc
Topic-Specific Diagnostics for LDA and CTM Topic Models
Language: R - Size: 594 KB - Last synced at: 11 days ago - Pushed at: almost 3 years ago - Stars: 25 - Forks: 0

lfmatosm/embedded-topic-model
A package to run embedded topic modelling with ETM. Adapted from the original at: https://github.com/adjidieng/ETM
Language: Python - Size: 4.28 MB - Last synced at: 10 months ago - Pushed at: almost 2 years ago - Stars: 84 - Forks: 8

JonasRieger/rollinglda
A rolling version of the Latent Dirichlet Allocation.
Language: R - Size: 1.19 MB - Last synced at: 25 days ago - Pushed at: over 1 year ago - Stars: 12 - Forks: 3

BobXWu/Paper-Neural-Topic-Models
Papers of Neural Topic Models (NTMs)
Size: 76.2 KB - Last synced at: 11 months ago - Pushed at: 11 months ago - Stars: 46 - Forks: 4

bean5/paper-itoptalk-lda_lds_gc
Whitepaper on Topical LDA application on documents. Base corpus: LDS General Conference articles spanning decades. Built for a Ling 485 class.
Language: TeX - Size: 259 KB - Last synced at: 4 months ago - Pushed at: about 1 year ago - Stars: 1 - Forks: 0

bean5/paper-thesis
My published paper on the application of LDA on documents. Base corpus: Thousands of LDS General Conference articles spanning decades.
Language: TeX - Size: 1.09 MB - Last synced at: 4 months ago - Pushed at: about 1 year ago - Stars: 2 - Forks: 0

bean5/paper-gc-tm-venue-entropy
Whitepaper on LDA Topic Models to compute topic entropy by year. Base corpus: LDS General Conference articles spanning decades.
Language: TeX - Size: 358 KB - Last synced at: 4 months ago - Pushed at: about 1 year ago - Stars: 1 - Forks: 0

gcdunn/ntc_analytics_2020
NTC Analytics Summit 2020
Language: Jupyter Notebook - Size: 8.79 KB - Last synced at: over 1 year ago - Pushed at: over 4 years ago - Stars: 0 - Forks: 0

BobXWu/TraCo
Code for On the Affinity, Rationality, and Diversity of Hierarchical Topic Modeling (AAAI 2024)
Language: Python - Size: 6.03 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 1 - Forks: 0

ahoho/kd-topic-models
Repo for EMNLP 2020 paper, "Improving Neural Topic Models using Knowledge Distillation"
Language: Python - Size: 124 KB - Last synced at: 2 months ago - Pushed at: over 4 years ago - Stars: 31 - Forks: 4

goerlitz/nlp-topic-models
Application of topic models for topic extraction and similarity search
Language: Jupyter Notebook - Size: 735 KB - Last synced at: over 1 year ago - Pushed at: almost 5 years ago - Stars: 16 - Forks: 1

JohannaRangel/FinalProject_YelpGoogleMaps
LABS Final Project Henry - Yelp_GoogleMaps - Roles: Data Engineer | Data Analyst | Machine Learning Engineer | Data Scientist
Language: HTML - Size: 10.8 GB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 4 - Forks: 4

amiekong/cross-lingual-retrieval
Implementing an English-Spanish Cross-Lingual Information Retrieval System With Topic Model Query Expansion
Language: HTML - Size: 1.67 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

tonyjward/trends-in-data-science
The objective of this project is to monitor the trends in data science job opportunities. We achieve this through scraping of the jobserve website.
Language: R - Size: 6.37 MB - Last synced at: about 1 year ago - Pushed at: about 4 years ago - Stars: 12 - Forks: 3

polsci/colab-gensim-mallet
This repository is designed for students in DIGI405 at the University of Canterbury to do topic modeling through their browser using Google Colab. It is relevant for others who want to do topic modeling through a browser with their own corpus.
Language: Jupyter Notebook - Size: 13.7 KB - Last synced at: about 9 hours ago - Pushed at: almost 4 years ago - Stars: 18 - Forks: 14

Rochan-A/sptm
Sentence Topic Prediction using Topic Modeling
Language: Python - Size: 1.68 MB - Last synced at: 25 days ago - Pushed at: about 6 years ago - Stars: 6 - Forks: 3

EdmundDuntis/gensim Fork of piskvorky/gensim
Topic Modelling for Humans
Language: Python - Size: 60.6 MB - Last synced at: 4 months ago - Pushed at: over 6 years ago - Stars: 0 - Forks: 1

rktamplayo/AutoSense
[AAAI2019] AutoSense Model for Word Sense Induction
Language: Java - Size: 5.06 MB - Last synced at: over 1 year ago - Pushed at: over 6 years ago - Stars: 13 - Forks: 5

inurutdinov/eaa
A Generative Probabilistic Model for NLP
Language: Python - Size: 149 MB - Last synced at: over 1 year ago - Pushed at: about 5 years ago - Stars: 0 - Forks: 7

laserwave/jst
Joint Sentiment/Topic Model
Language: Java - Size: 1.85 MB - Last synced at: over 1 year ago - Pushed at: over 6 years ago - Stars: 4 - Forks: 3

saurabhmathur96/gutenberg-stories
a collection of short stories from project gutenberg
Language: HTML - Size: 33 MB - Last synced at: over 1 year ago - Pushed at: over 5 years ago - Stars: 3 - Forks: 1

plkmo/general-topic-classifier
Topic classifier based on MPNet, for 16 general topics
Language: Python - Size: 301 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

BinFuPKU/AdvancedNLP
I have implemented the common operations in NLP domain (实现NLP中各种常规操作,如分词、句法、命名实体识别、语义话题模型、爬虫、ElasticSearch和Faiss向量检索,huggingface-transformers完成各种任务,2023)
Language: Jupyter Notebook - Size: 237 KB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 1 - Forks: 0

jdenes/TopicEmbeddings
An open-source framework to create and test document embeddings using topic models.
Language: Python - Size: 208 MB - Last synced at: about 1 year ago - Pushed at: about 5 years ago - Stars: 1 - Forks: 0

madhurima-nath/topicModeling
contains notebooks on topic modeling, spark and pandas implementation
Language: Jupyter Notebook - Size: 5.61 MB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 0 - Forks: 1

m-niemeyer/handwritten-digits-with-topic-modelling
This work shows an example of how handwritten digits can be learnt purely from data with the topic modelling concept
Language: Python - Size: 4.88 KB - Last synced at: almost 2 years ago - Pushed at: over 7 years ago - Stars: 1 - Forks: 0

paozer/visualization_of_topic_models
Bachelor thesis @ KIT
Language: Python - Size: 1.74 MB - Last synced at: almost 2 years ago - Pushed at: almost 5 years ago - Stars: 1 - Forks: 0

IntelligentSystemsLaboratory/JGI-PURE-Challenge
A competition to identify, analyse and visualise interdisciplinary research at UoB using the PURE dataset.
Language: Jupyter Notebook - Size: 14.9 MB - Last synced at: almost 2 years ago - Pushed at: almost 8 years ago - Stars: 1 - Forks: 2

christianrfg/tm_metrics
Quality Metrics for Topic Modeling
Language: Python - Size: 1.16 MB - Last synced at: over 1 year ago - Pushed at: over 5 years ago - Stars: 10 - Forks: 2

mohity5/QUXCon23_TopicModels
Companion repository for the demo presented in the session Topic Models: A tool for uncovering hidden themes in data at Quant UX Con 23.
Language: Jupyter Notebook - Size: 347 KB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 1 - Forks: 0

BobXWu/ECRTM
Code for Effective Neural Topic Modeling with Embedding Clustering Regularization (ICML2023)
Language: Python - Size: 43 MB - Last synced at: about 2 years ago - Pushed at: about 2 years ago - Stars: 8 - Forks: 0

arsena-k/discourse_atoms
How are topics encoded in semantic space? Repository to accompany PNAS article: https://www.pnas.org/doi/10.1073/pnas.2108801119
Language: Jupyter Notebook - Size: 26.4 KB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 18 - Forks: 5

flyingflying/data-mining-v1
数据挖掘示例项目
Language: HTML - Size: 13.1 MB - Last synced at: almost 2 years ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

kzhai/PyLDA
A Latent Dirichlet Allocation implementation in Python.
Language: Python - Size: 72.9 MB - Last synced at: about 2 years ago - Pushed at: about 6 years ago - Stars: 52 - Forks: 22

abhishek9sharma/TwitterAnalysis
Python Notebooks for Collecting Tweets and Analyze their text using various text classification and clustering techniques
Language: Jupyter Notebook - Size: 979 KB - Last synced at: over 2 years ago - Pushed at: about 6 years ago - Stars: 4 - Forks: 4

suyunu/theme-supervised-nmf
Theme Supervised Nonnegative Matrix Factorization
Language: Python - Size: 3.66 MB - Last synced at: over 2 years ago - Pushed at: over 4 years ago - Stars: 1 - Forks: 0

cs60050/TeamGabru
The official repository of TeamGabru.
Language: Python - Size: 97.5 MB - Last synced at: about 2 years ago - Pushed at: over 5 years ago - Stars: 13 - Forks: 3

usf-portal/his4936-dh1-course-workbook
Digital Workbook for HIS4936@University of South Florida
Language: HTML - Size: 18.5 MB - Last synced at: over 1 year ago - Pushed at: almost 3 years ago - Stars: 1 - Forks: 1

zengjichuan/DTDMN
Dynamic Dynamic Topic-Discourse Memory Networks (DTDMN)
Language: Python - Size: 4 MB - Last synced at: over 2 years ago - Pushed at: about 5 years ago - Stars: 7 - Forks: 1

polsci/binder-gensim-mallet
This repository is designed for students in DIGI405 at the University of Canterbury to do topic modeling through their browser using Binder. It is relevant for others who want to do topic modeling through a browser with their own corpus.
Language: Jupyter Notebook - Size: 29.3 KB - Last synced at: about 9 hours ago - Pushed at: over 3 years ago - Stars: 2 - Forks: 0

bobonovski/gotm
Topic Models in Go
Language: Go - Size: 47.9 KB - Last synced at: 12 months ago - Pushed at: about 7 years ago - Stars: 2 - Forks: 4

c0reyes/TextMiningGUI
Text Mining and Analysis with Biplots.
Language: R - Size: 2.27 MB - Last synced at: over 1 year ago - Pushed at: about 4 years ago - Stars: 3 - Forks: 0

Christoph/robics
Automatic detection of robust parametrizations for LDA and NMF. Compatible with scikit-learn and gensim.
Language: Python - Size: 126 KB - Last synced at: 5 days ago - Pushed at: over 4 years ago - Stars: 3 - Forks: 0

PuzaTech/Fugue
A research package for topic modeling
Language: Java - Size: 5.15 MB - Last synced at: about 2 months ago - Pushed at: over 4 years ago - Stars: 5 - Forks: 0

tteofili/jtm
tool for extraction of topics from jira issues
Language: Java - Size: 87.2 MB - Last synced at: 3 months ago - Pushed at: almost 7 years ago - Stars: 0 - Forks: 2

kzhai/PyLLDA
A Labeled Latent Dirichlet Allocation implementation in Python.
Language: Python - Size: 7.22 MB - Last synced at: about 2 years ago - Pushed at: over 5 years ago - Stars: 2 - Forks: 1

AndrewRPorter/wandering-analysis
Language: Python - Size: 28.3 KB - Last synced at: 3 months ago - Pushed at: over 5 years ago - Stars: 0 - Forks: 0

durgeshbhagat/topic_interpretability Fork of jhlau/topic_interpretability
Computation of the semantic interpretability of topics produced by topic models using Observed coherence and automated word intrusion
Language: Roff - Size: 10.9 MB - Last synced at: over 2 years ago - Pushed at: over 5 years ago - Stars: 0 - Forks: 0

Htiango/Topic-Model-Tutorial
A tutorial about practical usages of topic model.
Language: Jupyter Notebook - Size: 2.68 MB - Last synced at: over 2 years ago - Pushed at: about 7 years ago - Stars: 1 - Forks: 0

WING-NUS/lda2vec Fork of jethrokuan/lda2vec
Language: Python - Size: 34.2 KB - Last synced at: about 1 year ago - Pushed at: over 6 years ago - Stars: 0 - Forks: 1

casillas-qf/iir Fork of shuyo/iir
Machine Learning / Natural Language Processing / Information Retrieval
Language: Python - Size: 1.72 MB - Last synced at: 7 months ago - Pushed at: over 7 years ago - Stars: 0 - Forks: 0

haripo/pokemon-lda
Categorizing pokemons using LDA
Language: JavaScript - Size: 1.42 MB - Last synced at: 3 months ago - Pushed at: over 7 years ago - Stars: 0 - Forks: 0
