An open API service providing repository metadata for many open source software ecosystems.

GitHub topics: topic-models

MaartenGr/BERTopic

Leveraging BERT and c-TF-IDF to create easily interpretable topics.

Language: Python - Size: 23.7 MB - Last synced at: 1 day ago - Pushed at: 1 day ago - Stars: 6,834 - Forks: 829

cuongndc9/article-topic

🔎📰 Detecting topic for new article.

Language: Python - Size: 20.9 MB - Last synced at: 4 days ago - Pushed at: 4 days ago - Stars: 1 - Forks: 1

jonaschn/awesome-topic-models

✨ Awesome - A curated list of amazing Topic Models (implementations, libraries, and resources)

Size: 53.7 KB - Last synced at: 7 days ago - Pushed at: almost 3 years ago - Stars: 96 - Forks: 8

bobxwu/TopMost

A Topic Modeling System Toolkit (ACL 2024 Demo)

Language: Jupyter Notebook - Size: 254 MB - Last synced at: 5 days ago - Pushed at: 2 months ago - Stars: 253 - Forks: 26

MIND-Lab/OCTIS

OCTIS: Comparing Topic Models is Simple! A python package to optimize and evaluate topic models (accepted at EACL2021 demo track)

Language: Python - Size: 168 MB - Last synced at: 10 days ago - Pushed at: 11 months ago - Stars: 770 - Forks: 113

keyATM/keyATM

An R package for Keyword Assisted Topic Models

Language: R - Size: 50.1 MB - Last synced at: 27 days ago - Pushed at: 3 months ago - Stars: 110 - Forks: 15

AnFreTh/STREAM

A versatile Python package engineered for seamless topic modeling, topic evaluation, and topic visualization. Ideal for text analysis, natural language processing (NLP), and research in the social sciences, STREAM simplifies the extraction, interpretation, and visualization of topics from large, complex datasets.

Language: Python - Size: 228 MB - Last synced at: about 1 month ago - Pushed at: 4 months ago - Stars: 38 - Forks: 9

baidu/Familia

A Toolkit for Industrial Topic Modeling

Language: C++ - Size: 5.97 MB - Last synced at: 21 days ago - Pushed at: almost 4 years ago - Stars: 2,643 - Forks: 593

maximtrp/bitermplus

Biterm Topic Model (BTM): modeling topics in short texts

Language: Cython - Size: 693 KB - Last synced at: about 1 month ago - Pushed at: 11 months ago - Stars: 81 - Forks: 14

markoarnauto/biterm

Biterm Topic Model

Language: HTML - Size: 433 KB - Last synced at: 10 days ago - Pushed at: over 1 year ago - Stars: 136 - Forks: 26

bab2min/tomotopy

Python package of Tomoto, the Topic Modeling Tool

Language: C++ - Size: 2.33 MB - Last synced at: about 2 months ago - Pushed at: 11 months ago - Stars: 578 - Forks: 63

amazon-science/text_generation_diffusion_llm_topic

Topic Embedding, Text Generation and Modeling using diffusion

Language: Python - Size: 154 KB - Last synced at: about 2 months ago - Pushed at: 2 months ago - Stars: 12 - Forks: 3

jayholster/jayholster.github.io

Size: 763 KB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 0 - Forks: 0

datquocnguyen/jLDADMM

A Java package for the LDA and DMM topic models

Language: Java - Size: 256 KB - Last synced at: about 2 months ago - Pushed at: about 6 years ago - Stars: 81 - Forks: 20

JonasRieger/ldaPrototype

Determine a Prototype from a number of runs of Latent Dirichlet Allocation.

Language: R - Size: 799 KB - Last synced at: 4 days ago - Pushed at: over 2 years ago - Stars: 8 - Forks: 1

laserwave/topic_models

implemented : lsa, plsa, lda

Language: Python - Size: 6.65 MB - Last synced at: about 2 months ago - Pushed at: almost 9 years ago - Stars: 99 - Forks: 46

prrao87/topic-modelling

Comparing the scalability and quality of topic models in Gensim and PySpark

Language: Python - Size: 10.1 MB - Last synced at: 9 days ago - Pushed at: about 1 year ago - Stars: 6 - Forks: 0

silviatti/topic-model-diversity

A collection of topic diversity measures for topic modeling

Language: Python - Size: 30.3 KB - Last synced at: about 2 months ago - Pushed at: almost 4 years ago - Stars: 45 - Forks: 5

doug-friedman/topicdoc

Topic-Specific Diagnostics for LDA and CTM Topic Models

Language: R - Size: 594 KB - Last synced at: 11 days ago - Pushed at: almost 3 years ago - Stars: 25 - Forks: 0

lfmatosm/embedded-topic-model

A package to run embedded topic modelling with ETM. Adapted from the original at: https://github.com/adjidieng/ETM

Language: Python - Size: 4.28 MB - Last synced at: 10 months ago - Pushed at: almost 2 years ago - Stars: 84 - Forks: 8

JonasRieger/rollinglda

A rolling version of the Latent Dirichlet Allocation.

Language: R - Size: 1.19 MB - Last synced at: 25 days ago - Pushed at: over 1 year ago - Stars: 12 - Forks: 3

BobXWu/Paper-Neural-Topic-Models

Papers of Neural Topic Models (NTMs)

Size: 76.2 KB - Last synced at: 11 months ago - Pushed at: 11 months ago - Stars: 46 - Forks: 4

bean5/paper-itoptalk-lda_lds_gc

Whitepaper on Topical LDA application on documents. Base corpus: LDS General Conference articles spanning decades. Built for a Ling 485 class.

Language: TeX - Size: 259 KB - Last synced at: 4 months ago - Pushed at: about 1 year ago - Stars: 1 - Forks: 0

bean5/paper-thesis

My published paper on the application of LDA on documents. Base corpus: Thousands of LDS General Conference articles spanning decades.

Language: TeX - Size: 1.09 MB - Last synced at: 4 months ago - Pushed at: about 1 year ago - Stars: 2 - Forks: 0

bean5/paper-gc-tm-venue-entropy

Whitepaper on LDA Topic Models to compute topic entropy by year. Base corpus: LDS General Conference articles spanning decades.

Language: TeX - Size: 358 KB - Last synced at: 4 months ago - Pushed at: about 1 year ago - Stars: 1 - Forks: 0

gcdunn/ntc_analytics_2020

NTC Analytics Summit 2020

Language: Jupyter Notebook - Size: 8.79 KB - Last synced at: over 1 year ago - Pushed at: over 4 years ago - Stars: 0 - Forks: 0

BobXWu/TraCo

Code for On the Affinity, Rationality, and Diversity of Hierarchical Topic Modeling (AAAI 2024)

Language: Python - Size: 6.03 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 1 - Forks: 0

ahoho/kd-topic-models

Repo for EMNLP 2020 paper, "Improving Neural Topic Models using Knowledge Distillation"

Language: Python - Size: 124 KB - Last synced at: 2 months ago - Pushed at: over 4 years ago - Stars: 31 - Forks: 4

goerlitz/nlp-topic-models

Application of topic models for topic extraction and similarity search

Language: Jupyter Notebook - Size: 735 KB - Last synced at: over 1 year ago - Pushed at: almost 5 years ago - Stars: 16 - Forks: 1

JohannaRangel/FinalProject_YelpGoogleMaps

LABS Final Project Henry - Yelp_GoogleMaps - Roles: Data Engineer | Data Analyst | Machine Learning Engineer | Data Scientist

Language: HTML - Size: 10.8 GB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 4 - Forks: 4

amiekong/cross-lingual-retrieval

Implementing an English-Spanish Cross-Lingual Information Retrieval System With Topic Model Query Expansion

Language: HTML - Size: 1.67 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

tonyjward/trends-in-data-science

The objective of this project is to monitor the trends in data science job opportunities. We achieve this through scraping of the jobserve website.

Language: R - Size: 6.37 MB - Last synced at: about 1 year ago - Pushed at: about 4 years ago - Stars: 12 - Forks: 3

polsci/colab-gensim-mallet

This repository is designed for students in DIGI405 at the University of Canterbury to do topic modeling through their browser using Google Colab. It is relevant for others who want to do topic modeling through a browser with their own corpus.

Language: Jupyter Notebook - Size: 13.7 KB - Last synced at: about 9 hours ago - Pushed at: almost 4 years ago - Stars: 18 - Forks: 14

Rochan-A/sptm

Sentence Topic Prediction using Topic Modeling

Language: Python - Size: 1.68 MB - Last synced at: 25 days ago - Pushed at: about 6 years ago - Stars: 6 - Forks: 3

EdmundDuntis/gensim Fork of piskvorky/gensim

Topic Modelling for Humans

Language: Python - Size: 60.6 MB - Last synced at: 4 months ago - Pushed at: over 6 years ago - Stars: 0 - Forks: 1

rktamplayo/AutoSense

[AAAI2019] AutoSense Model for Word Sense Induction

Language: Java - Size: 5.06 MB - Last synced at: over 1 year ago - Pushed at: over 6 years ago - Stars: 13 - Forks: 5

inurutdinov/eaa

A Generative Probabilistic Model for NLP

Language: Python - Size: 149 MB - Last synced at: over 1 year ago - Pushed at: about 5 years ago - Stars: 0 - Forks: 7

laserwave/jst

Joint Sentiment/Topic Model

Language: Java - Size: 1.85 MB - Last synced at: over 1 year ago - Pushed at: over 6 years ago - Stars: 4 - Forks: 3

saurabhmathur96/gutenberg-stories

a collection of short stories from project gutenberg

Language: HTML - Size: 33 MB - Last synced at: over 1 year ago - Pushed at: over 5 years ago - Stars: 3 - Forks: 1

plkmo/general-topic-classifier

Topic classifier based on MPNet, for 16 general topics

Language: Python - Size: 301 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

BinFuPKU/AdvancedNLP

I have implemented the common operations in NLP domain (实现NLP中各种常规操作,如分词、句法、命名实体识别、语义话题模型、爬虫、ElasticSearch和Faiss向量检索,huggingface-transformers完成各种任务,2023)

Language: Jupyter Notebook - Size: 237 KB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 1 - Forks: 0

jdenes/TopicEmbeddings

An open-source framework to create and test document embeddings using topic models.

Language: Python - Size: 208 MB - Last synced at: about 1 year ago - Pushed at: about 5 years ago - Stars: 1 - Forks: 0

madhurima-nath/topicModeling

contains notebooks on topic modeling, spark and pandas implementation

Language: Jupyter Notebook - Size: 5.61 MB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 0 - Forks: 1

m-niemeyer/handwritten-digits-with-topic-modelling

This work shows an example of how handwritten digits can be learnt purely from data with the topic modelling concept

Language: Python - Size: 4.88 KB - Last synced at: almost 2 years ago - Pushed at: over 7 years ago - Stars: 1 - Forks: 0

paozer/visualization_of_topic_models

Bachelor thesis @ KIT

Language: Python - Size: 1.74 MB - Last synced at: almost 2 years ago - Pushed at: almost 5 years ago - Stars: 1 - Forks: 0

IntelligentSystemsLaboratory/JGI-PURE-Challenge

A competition to identify, analyse and visualise interdisciplinary research at UoB using the PURE dataset.

Language: Jupyter Notebook - Size: 14.9 MB - Last synced at: almost 2 years ago - Pushed at: almost 8 years ago - Stars: 1 - Forks: 2

christianrfg/tm_metrics

Quality Metrics for Topic Modeling

Language: Python - Size: 1.16 MB - Last synced at: over 1 year ago - Pushed at: over 5 years ago - Stars: 10 - Forks: 2

mohity5/QUXCon23_TopicModels

Companion repository for the demo presented in the session Topic Models: A tool for uncovering hidden themes in data at Quant UX Con 23.

Language: Jupyter Notebook - Size: 347 KB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 1 - Forks: 0

BobXWu/ECRTM

Code for Effective Neural Topic Modeling with Embedding Clustering Regularization (ICML2023)

Language: Python - Size: 43 MB - Last synced at: about 2 years ago - Pushed at: about 2 years ago - Stars: 8 - Forks: 0

arsena-k/discourse_atoms

How are topics encoded in semantic space? Repository to accompany PNAS article: https://www.pnas.org/doi/10.1073/pnas.2108801119

Language: Jupyter Notebook - Size: 26.4 KB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 18 - Forks: 5

flyingflying/data-mining-v1

数据挖掘示例项目

Language: HTML - Size: 13.1 MB - Last synced at: almost 2 years ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

kzhai/PyLDA

A Latent Dirichlet Allocation implementation in Python.

Language: Python - Size: 72.9 MB - Last synced at: about 2 years ago - Pushed at: about 6 years ago - Stars: 52 - Forks: 22

abhishek9sharma/TwitterAnalysis

Python Notebooks for Collecting Tweets and Analyze their text using various text classification and clustering techniques

Language: Jupyter Notebook - Size: 979 KB - Last synced at: over 2 years ago - Pushed at: about 6 years ago - Stars: 4 - Forks: 4

suyunu/theme-supervised-nmf

Theme Supervised Nonnegative Matrix Factorization

Language: Python - Size: 3.66 MB - Last synced at: over 2 years ago - Pushed at: over 4 years ago - Stars: 1 - Forks: 0

cs60050/TeamGabru

The official repository of TeamGabru.

Language: Python - Size: 97.5 MB - Last synced at: about 2 years ago - Pushed at: over 5 years ago - Stars: 13 - Forks: 3

usf-portal/his4936-dh1-course-workbook

Digital Workbook for HIS4936@University of South Florida

Language: HTML - Size: 18.5 MB - Last synced at: over 1 year ago - Pushed at: almost 3 years ago - Stars: 1 - Forks: 1

zengjichuan/DTDMN

Dynamic Dynamic Topic-Discourse Memory Networks (DTDMN)

Language: Python - Size: 4 MB - Last synced at: over 2 years ago - Pushed at: about 5 years ago - Stars: 7 - Forks: 1

polsci/binder-gensim-mallet

This repository is designed for students in DIGI405 at the University of Canterbury to do topic modeling through their browser using Binder. It is relevant for others who want to do topic modeling through a browser with their own corpus.

Language: Jupyter Notebook - Size: 29.3 KB - Last synced at: about 9 hours ago - Pushed at: over 3 years ago - Stars: 2 - Forks: 0

bobonovski/gotm

Topic Models in Go

Language: Go - Size: 47.9 KB - Last synced at: 12 months ago - Pushed at: about 7 years ago - Stars: 2 - Forks: 4

c0reyes/TextMiningGUI

Text Mining and Analysis with Biplots.

Language: R - Size: 2.27 MB - Last synced at: over 1 year ago - Pushed at: about 4 years ago - Stars: 3 - Forks: 0

Christoph/robics

Automatic detection of robust parametrizations for LDA and NMF. Compatible with scikit-learn and gensim.

Language: Python - Size: 126 KB - Last synced at: 5 days ago - Pushed at: over 4 years ago - Stars: 3 - Forks: 0

PuzaTech/Fugue

A research package for topic modeling

Language: Java - Size: 5.15 MB - Last synced at: about 2 months ago - Pushed at: over 4 years ago - Stars: 5 - Forks: 0

tteofili/jtm

tool for extraction of topics from jira issues

Language: Java - Size: 87.2 MB - Last synced at: 3 months ago - Pushed at: almost 7 years ago - Stars: 0 - Forks: 2

kzhai/PyLLDA

A Labeled Latent Dirichlet Allocation implementation in Python.

Language: Python - Size: 7.22 MB - Last synced at: about 2 years ago - Pushed at: over 5 years ago - Stars: 2 - Forks: 1

AndrewRPorter/wandering-analysis

Language: Python - Size: 28.3 KB - Last synced at: 3 months ago - Pushed at: over 5 years ago - Stars: 0 - Forks: 0

durgeshbhagat/topic_interpretability Fork of jhlau/topic_interpretability

Computation of the semantic interpretability of topics produced by topic models using Observed coherence and automated word intrusion

Language: Roff - Size: 10.9 MB - Last synced at: over 2 years ago - Pushed at: over 5 years ago - Stars: 0 - Forks: 0

Htiango/Topic-Model-Tutorial

A tutorial about practical usages of topic model.

Language: Jupyter Notebook - Size: 2.68 MB - Last synced at: over 2 years ago - Pushed at: about 7 years ago - Stars: 1 - Forks: 0

WING-NUS/lda2vec Fork of jethrokuan/lda2vec

Language: Python - Size: 34.2 KB - Last synced at: about 1 year ago - Pushed at: over 6 years ago - Stars: 0 - Forks: 1

casillas-qf/iir Fork of shuyo/iir

Machine Learning / Natural Language Processing / Information Retrieval

Language: Python - Size: 1.72 MB - Last synced at: 7 months ago - Pushed at: over 7 years ago - Stars: 0 - Forks: 0

haripo/pokemon-lda

Categorizing pokemons using LDA

Language: JavaScript - Size: 1.42 MB - Last synced at: 3 months ago - Pushed at: over 7 years ago - Stars: 0 - Forks: 0

Related Keywords
topic-models 70 topic-modeling 41 lda 26 nlp 20 latent-dirichlet-allocation 15 natural-language-processing 13 gensim 10 python 9 machine-learning 8 topic-model 7 neural-topic-models 6 text-mining 5 r 4 embeddings 4 topic 4 neural-topic-modeling 3 whitepaper 3 data-science 3 sentence-embeddings 3 nlp-library 3 topic-modelling 3 nlp-machine-learning 3 visualization 3 general-conference 3 research 3 topic-modeling-analysis 3 bayesian-inference 3 paper 3 mallet 3 gibbs-sampling 3 pyspark 2 data-mining 2 topicmodelling 2 topicmodeling 2 textdata 2 reliability 2 model-selection 2 analysis 2 deep-learning 2 transformers 2 dataset 2 variational-inference 2 bayesian-statistics 2 text 2 text-analysis 2 text-classification 2 topics 2 binder 2 computer-vision 2 sentiment-analysis 2 tutorial 2 modeling 2 nmf 2 latex 2 docker 2 ai 2 python-3 2 lds 2 author-topic-model 2 hierarchical-dirichlet-processes 2 latent-semantic-analysis 2 non-negative-matrix-factorization 2 sentence-lda 2 evaluation-metrics 2 cython 2 document-embedding 1 spark-nlp 1 lda-evaluation 1 matplotlib 1 handwritten-digit-recognition 1 unsupervised-learning 1 bachelor-thesis 1 karlsruhe-institute-of-technology 1 streaming 1 challenge 1 competition 1 uob 1 data-analysis 1 visualisation 1 lsa 1 lsi 1 metrics 1 scikit-learn 1 digital-humanities 1 npmi 1 pokemon 1 hdp 1 pokedex 1 stm 1 lda-model 1 classification-model 1 classifier-model 1 clustering 1 distributional-semantics 1 topic-classification 1 video-surveillance 1 crawler 1 topic-interpretability 1 topic-chorence 1 elasticsearch 1