An open API service providing repository metadata for many open source software ecosystems.

GitHub topics: short-text

goamegah/pytorch-stc

PyTorch implementation of Self-training approch for short text clustering

Language: Python - Size: 16.9 MB - Last synced at: 11 days ago - Pushed at: 11 months ago - Stars: 8 - Forks: 0

goamegah/Short-Text-Clustering

Short text clustering methods through differents approaches

Language: Python - Size: 259 MB - Last synced at: 12 days ago - Pushed at: 11 months ago - Stars: 5 - Forks: 0

datquocnguyen/jLDADMM

A Java package for the LDA and DMM topic models

Language: Java - Size: 256 KB - Last synced at: 5 months ago - Pushed at: about 6 years ago - Stars: 80 - Forks: 20

datquocnguyen/LFTM

Improving topic models LDA and DMM (one-topic-per-document model for short texts) with word embeddings (TACL 2015)

Language: Java - Size: 9.02 MB - Last synced at: 5 months ago - Pushed at: almost 8 years ago - Stars: 178 - Forks: 59

qiang2100/STTM

Short Text Topic Modeling, JAVA

Language: Java - Size: 7.5 MB - Last synced at: 5 months ago - Pushed at: almost 5 years ago - Stars: 154 - Forks: 40

jannis-baum/biterm-topic-model Fork of xiaohuiyan/BTM

Fork of original code for Biterm Topic Model to provide closer to real-world use interfaces

Language: C++ - Size: 10.4 MB - Last synced at: 12 months ago - Pushed at: about 3 years ago - Stars: 0 - Forks: 0

WHUIR/SATM

Our Java implementation of Self-Aggregation-Based Topic Model (SATM)

Language: Java - Size: 28.3 KB - Last synced at: 5 months ago - Pushed at: over 2 years ago - Stars: 8 - Forks: 7

BobXWu/NQTM

Code for Short Text Topic Modeling with Topic Distribution Quantization and Negative Sampling Decoder (EMNLP2020).

Language: Python - Size: 267 KB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 40 - Forks: 5

BobXWu/TSCTM

Code for Mitigating Data Sparsity for Short Text Topic Modeling by Topic-Semantic Contrastive Learning (EMNLP2022)

Language: Roff - Size: 388 KB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 20 - Forks: 3

rwalk/gsdmm

GSDMM: Short text clustering

Language: Python - Size: 12.7 KB - Last synced at: about 1 year ago - Pushed at: over 2 years ago - Stars: 344 - Forks: 95

jacerong/normalesp

An open-source spell checker for texts written in Spanish, with a focus on tweets.

Language: Python - Size: 4.85 MB - Last synced at: about 1 year ago - Pushed at: over 7 years ago - Stars: 1 - Forks: 2

rwalk/gsdmm-rust

GSDMM: Short text clustering (Rust implementation)

Language: Rust - Size: 27.3 KB - Last synced at: about 1 year ago - Pushed at: almost 2 years ago - Stars: 22 - Forks: 10

jacerong/TASS-2017

An open-source, top-ranked sentiment analysis system of Spanish tweets.

Language: Python - Size: 3.77 MB - Last synced at: 5 months ago - Pushed at: over 7 years ago - Stars: 7 - Forks: 3

ShimShim46/HFT-CNN

Convolutional Neural Network based on Hierarchical Category Structure for Multi-label Short Text Categorization

Language: Python - Size: 2.72 MB - Last synced at: over 1 year ago - Pushed at: over 2 years ago - Stars: 111 - Forks: 25

WHUIR/PDMM

The java implementation of "Enhancing Topic Modeling for Short Texts with Auxiliary Word Embeddings" TOIS 2017, Chenliang Li, Yu Duan, Haoran Wang, Zhiqian Zhang, Aixin Sun, Zongyang Ma, https://dl.acm.org/citation.cfm?doid=3133943.3091108

Language: Java - Size: 18.3 MB - Last synced at: over 1 year ago - Pushed at: over 2 years ago - Stars: 5 - Forks: 3

rktamplayo/MicroASM

[ICDM2017] Aspect Sentiment Model for Micro Reviews

Language: Java - Size: 6.84 KB - Last synced at: over 1 year ago - Pushed at: over 7 years ago - Stars: 8 - Forks: 4

cwwdaniel/invoice-text-classification

Semantic Enrichment, Data Augmentation and Deep Learning for Boosting Invoice Text Classification Performance: A Novel Natural Language Processing Strategy

Size: 68.4 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 2 - Forks: 0

ethanhezhao/MetaFTM

The code of the project that extends the paper "A Word Embeddings Informed Focused Topic Model"

Language: MATLAB - Size: 3.46 MB - Last synced at: over 1 year ago - Pushed at: almost 5 years ago - Stars: 0 - Forks: 0

ethanhezhao/MIGA

MIGA is a short text clustering/aggregation topic model that leverages document metadata

Language: MATLAB - Size: 400 KB - Last synced at: over 1 year ago - Pushed at: almost 5 years ago - Stars: 1 - Forks: 0

FKarl/short-text-classification

This repository contains code to reproduce the results in our paper "Transformers are Short Text Classifiers: A Study of Inductive Short Text Classifiers on Benchmarks and Real-world Datasets".

Language: Python - Size: 9.22 MB - Last synced at: almost 2 years ago - Pushed at: over 2 years ago - Stars: 15 - Forks: 4

WHUIR/GSDMM

Our implementation of collapsed Gibbs Sampling algorithm for Dirichlet Multinomial Mixture model(GSDMM) (published in KDD 2014)

Language: Java - Size: 11.7 KB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 7 - Forks: 3

BobXWu/Multiterm-Topic-Model

Code for Short Text Topic Modeling with Flexible Word Patterns (IJCNN2019)

Language: Java - Size: 271 KB - Last synced at: about 2 years ago - Pushed at: over 5 years ago - Stars: 12 - Forks: 4

WHUIR/GPUDMM

The implementation of GPU-based Dirichlet Multinomial Mixture model (GPU-DMM) (published in SIGIR 2016)

Language: Java - Size: 9.77 KB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 7 - Forks: 7

johpetsc/invoices-classification

Electronic Invoices classification

Language: Python - Size: 61.5 KB - Last synced at: almost 2 years ago - Pushed at: over 3 years ago - Stars: 0 - Forks: 0

johpetsc/ELINAC

Autoencoder Approach for Electronic Invoices Data Clustering

Language: Jupyter Notebook - Size: 1.96 MB - Last synced at: almost 2 years ago - Pushed at: almost 3 years ago - Stars: 1 - Forks: 0

WHUIR/BTM

Our implementation of Biterm Topic Model (BTM) (published in WWW 2013)

Language: Java - Size: 15.6 KB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 7 - Forks: 2

NLPIR-team/ShortTextClassification

A classification for text and short text.

Language: Java - Size: 17.5 MB - Last synced at: 15 days ago - Pushed at: almost 7 years ago - Stars: 5 - Forks: 3

lucaseliascrocha/Twitter-Topics-Identification

Final graduation project.

Language: Jupyter Notebook - Size: 28.5 MB - Last synced at: about 2 years ago - Pushed at: over 6 years ago - Stars: 0 - Forks: 0

lucaseliascrocha/PFC

Final graduation project. Working on short text topic identification.

Language: C++ - Size: 255 MB - Last synced at: about 2 years ago - Pushed at: about 6 years ago - Stars: 0 - Forks: 0

coditori/sylang

Sylang - minimal notes

Size: 9.77 KB - Last synced at: about 1 month ago - Pushed at: almost 2 years ago - Stars: 5 - Forks: 1