GitHub topics: text-classification
codelion/adaptive-classifier
A flexible, adaptive classification system for dynamic text classification
Language: Python - Size: 3.28 MB - Last synced at: about 4 hours ago - Pushed at: about 5 hours ago - Stars: 203 - Forks: 14

mazinsk2125/Adaptive-Message-Threat-Analyze
A full-stack app deployed on Render with environment-based dynamic port binding and optimized for seamless startup and scalability.
Language: Python - Size: 546 KB - Last synced at: about 9 hours ago - Pushed at: about 10 hours ago - Stars: 1 - Forks: 0

Cheetos19/EDA
Exploratory Data Analysis
Language: Jupyter Notebook - Size: 26.5 MB - Last synced at: about 15 hours ago - Pushed at: about 16 hours ago - Stars: 0 - Forks: 0

SamTheOneee1/kaggle-project-classification-of-tweets-from-northern-europe
Classifies 500K+ political tweets from Northern Europe using NLP and machine learning to analyze political discourse.
Language: Jupyter Notebook - Size: 2.25 MB - Last synced at: about 21 hours ago - Pushed at: about 22 hours ago - Stars: 0 - Forks: 0

Azie88/NLP-Huggingface-Covid-19-Tweet-Sentiment-Analysis
Fine Tuning text classification NLP models from huggingface with Covid-19 tweet data to build a model that classifies text based on Covid-19 sentiment
Language: Jupyter Notebook - Size: 4.85 MB - Last synced at: 1 day ago - Pushed at: 1 day ago - Stars: 0 - Forks: 0

Tech-Nomadic-X/NLP-Sentiment-Task2-CodeTech
Sentiment Analysis on IMDB Reviews using Logistic Regression & TF-IDF | CodeTech Internship Task 2 A machine learning project that classifies movie reviews as positive or negative using Natural Language Processing. Built with scikit-learn, evaluated with accuracy, ROC-AUC, and confusion matrix.
Language: Jupyter Notebook - Size: 189 KB - Last synced at: 1 day ago - Pushed at: 1 day ago - Stars: 0 - Forks: 0

clown456957/IndoBERTvsSVM
This repository contains the final project (skripsi) for sentiment classification on Indonesian Twitter data using the hashtag #KaburAjaDulu. It explores the performance comparison between a fine-tuned IndoBERT model and traditional machine learning models (such as SVM with IndoBERT embeddings). Built with 🤗 Hugging Face Transformers.
Language: Jupyter Notebook - Size: 2.75 MB - Last synced at: 1 day ago - Pushed at: 1 day ago - Stars: 0 - Forks: 0

shaharoded/Israel-Palestine-Political-Affiliation-Text-Classification
A research we conducted aiming to create a model capable of identifying political affiliation regarding the Israel-Palestine conflict
Language: Jupyter Notebook - Size: 3 MB - Last synced at: 1 day ago - Pushed at: 1 day ago - Stars: 0 - Forks: 0

pranayshekhar01/bert-sentiment-analyzer
A sentiment analysis web app powered by BERT, built with Streamlit. Classifies IMDb movie reviews as positive or negative with 93% accuracy.
Language: Jupyter Notebook - Size: 161 KB - Last synced at: 1 day ago - Pushed at: 1 day ago - Stars: 0 - Forks: 0

sergioburdisso/pyss3
A Python package implementing a new interpretable machine learning model for text classification (with visualization tools for Explainable AI :octocat:)
Language: Python - Size: 102 MB - Last synced at: 2 days ago - Pushed at: 2 days ago - Stars: 341 - Forks: 44

Lips7/Matcher
A high-performance matcher designed to solve LOGICAL and TEXT VARIATIONS problems in word matching, implemented in Rust.
Language: Rust - Size: 36.9 MB - Last synced at: 3 days ago - Pushed at: 3 days ago - Stars: 17 - Forks: 1

JustinJiang1994/chinese-text-classifier
基于tensorflow2.0中的keras进行中文的文本分类,实验数据为中文新闻分类文本cnews数据集。
Language: Python - Size: 129 KB - Last synced at: 3 days ago - Pushed at: 3 days ago - Stars: 43 - Forks: 15

Emriss0/Tech-Tweet
TechTweet is a microblogging platform for tech enthusiasts, allowing users to share short tech messages and engage in discussions. Join the community, post your thoughts, and connect with others! 🐙💻
Language: HTML - Size: 26.4 KB - Last synced at: 3 days ago - Pushed at: 3 days ago - Stars: 0 - Forks: 0

AdilShamim8/Sentiment-analysis
A machine learning project that decodes human emotions by analyzing text sentiment through an interactive web app.
Language: Jupyter Notebook - Size: 0 Bytes - Last synced at: 3 days ago - Pushed at: 3 days ago - Stars: 0 - Forks: 0

BrikerMan/Kashgari
Kashgari is a production-level NLP Transfer learning framework built on top of tf.keras for text-labeling and text-classification, includes Word2Vec, BERT, and GPT2 Language Embedding.
Language: Python - Size: 14.3 MB - Last synced at: 2 days ago - Pushed at: 9 months ago - Stars: 2,386 - Forks: 434

shaoncse/covid-tweet-nlp-analysis
📊 Sentiment classification and topic extraction from COVID-19 tweets using NLP techniques (TF-IDF, KMeans, Voting Classifiers). University project for text analytics and public opinion analysis.
Language: Jupyter Notebook - Size: 5.21 MB - Last synced at: 4 days ago - Pushed at: 4 days ago - Stars: 0 - Forks: 0

ieg-dhr/NLP-Course4Humanities_2024
This repository is part of an NLP course for humanities and cultural studies. This course uses historical newspapers as a source and applies NLP methods to them. NLP tasks: Tokenization, Lemmatization, TF-IDF, Part-of-speech tagging, semantic search with transformers, article extraction and OCR post-correction with LLMs, NER and text classification
Language: Jupyter Notebook - Size: 61.3 MB - Last synced at: 4 days ago - Pushed at: 4 days ago - Stars: 17 - Forks: 6

microsoft/nlp-recipes 📦
Natural Language Processing Best Practices & Examples
Language: Python - Size: 46.5 MB - Last synced at: 4 days ago - Pushed at: almost 3 years ago - Stars: 6,410 - Forks: 919

dayyass/text-classification-baseline
Pipeline for fast building text classification TF-IDF + LogReg baselines.
Language: Python - Size: 1.56 MB - Last synced at: 3 days ago - Pushed at: over 3 years ago - Stars: 62 - Forks: 4

Zoubyr/traditional-machine-learning
Geleneksel Makine Öğrenmesi Yöntemleri ile Çalışmalarım
Language: Jupyter Notebook - Size: 320 KB - Last synced at: 4 days ago - Pushed at: 4 days ago - Stars: 1 - Forks: 0

nuclia/nucliadb
NucliaDB, The AI Search database for RAG
Language: Python - Size: 39.9 MB - Last synced at: 4 days ago - Pushed at: 4 days ago - Stars: 695 - Forks: 53

vanhai1231/phobert-vi-comment
Finetune mô hình PhoBERT cho phân loại comment trên không gian mạng
Language: Jupyter Notebook - Size: 52.7 KB - Last synced at: 4 days ago - Pushed at: 5 days ago - Stars: 1 - Forks: 0

brightmart/nlp_chinese_corpus
大规模中文自然语言处理语料 Large Scale Chinese Corpus for NLP
Size: 3.91 MB - Last synced at: 1 day ago - Pushed at: about 1 year ago - Stars: 9,727 - Forks: 1,559

DzmitryPihulski/MachineLearningUniversityProject
University Research Project in one class classification
Language: Python - Size: 829 KB - Last synced at: 5 days ago - Pushed at: 5 days ago - Stars: 0 - Forks: 0

explosion/spaCy
💫 Industrial-strength Natural Language Processing (NLP) in Python
Language: Python - Size: 194 MB - Last synced at: 5 days ago - Pushed at: 11 days ago - Stars: 31,699 - Forks: 4,508

Sravyatogarla/NLP-RNN-KMeans-Project
NLP mini-projects using Deep Learning & Machine Learning: IMDB sentiment classification using RNN and BBC News clustering using KMeans with multiple vectorization techniques.
Language: Jupyter Notebook - Size: 796 KB - Last synced at: 5 days ago - Pushed at: 5 days ago - Stars: 0 - Forks: 0

embeddings-benchmark/mteb
MTEB: Massive Text Embedding Benchmark
Language: Python - Size: 39.9 MB - Last synced at: 5 days ago - Pushed at: 5 days ago - Stars: 2,568 - Forks: 408

allemandi/embed-classify-web
Text classification web app using CSV input and word embeddings (all-MiniLM-L6-v2).
Language: JavaScript - Size: 7.12 MB - Last synced at: 5 days ago - Pushed at: 5 days ago - Stars: 0 - Forks: 0

enessah00/adaptive-classifier
A flexible, adaptive classification system for dynamic text classification
Size: 1.95 KB - Last synced at: 5 days ago - Pushed at: 5 days ago - Stars: 0 - Forks: 0

JohnSnowLabs/spark-nlp
State of the Art Natural Language Processing
Language: Scala - Size: 3.4 GB - Last synced at: 5 days ago - Pushed at: 5 days ago - Stars: 3,988 - Forks: 726

massimoaria/tall
Text Analysis for aLL
Language: R - Size: 63.9 MB - Last synced at: 5 days ago - Pushed at: 5 days ago - Stars: 20 - Forks: 6

vyshnavidevi11/Alert_Text_Detector
Alert Text Detector is an NLP-based model that detects alert messages from social media posts. It is built using BERTweet Base and trained on a dataset of 23,000 tweets (alert & non-alert). The model flags emergency-related messages and classifies tweets based on textual content.
Size: 1.95 KB - Last synced at: 5 days ago - Pushed at: 5 days ago - Stars: 0 - Forks: 0

phurwicz/hover
:speedboat: Label data at scale. Fun and precision included.
Language: Python - Size: 294 MB - Last synced at: 5 days ago - Pushed at: 6 days ago - Stars: 327 - Forks: 19

catalyst-team/catalyst
Accelerated deep learning R&D
Language: Python - Size: 52.6 MB - Last synced at: about 10 hours ago - Pushed at: about 1 year ago - Stars: 3,355 - Forks: 395

Willgnner-Santos/IT-Residence
TJGO Information Technology Residency
Language: Jupyter Notebook - Size: 55.7 KB - Last synced at: 5 days ago - Pushed at: 6 days ago - Stars: 0 - Forks: 0

The-FinAI/PIXIU
This repository introduces PIXIU, an open-source resource featuring the first financial large language models (LLMs), instruction tuning data, and evaluation benchmarks to holistically assess financial LLMs. Our goal is to continually push forward the open-source development of financial artificial intelligence (AI).
Language: Jupyter Notebook - Size: 49.5 MB - Last synced at: 7 days ago - Pushed at: 3 months ago - Stars: 716 - Forks: 88

sun-vonxu/emotion-detection-nlp
# Emotion Detection from Text using NLP## 📘 Project OverviewThis project uses Natural Language Processing to detect emotions like joy, sadness, anger, and fear from text. It compares traditional machine learning and deep learning models, ensuring robust evaluation. ## Objectives- Build a strong emotion classifier using public datasets.- Anal
Language: Jupyter Notebook - Size: 21.2 MB - Last synced at: 7 days ago - Pushed at: 7 days ago - Stars: 0 - Forks: 0

AIVU2026/TextMining-project
Text Mining project - Artificial Intelligence, Vrije University,2025
Language: Jupyter Notebook - Size: 3.99 MB - Last synced at: 7 days ago - Pushed at: 7 days ago - Stars: 0 - Forks: 0

alessiopittiglio/mm-argfallacy
Language: Python - Size: 26.4 KB - Last synced at: 7 days ago - Pushed at: 7 days ago - Stars: 1 - Forks: 0

tiansztiansz/python-data-science
b站 AI日日新 不定期更新使用Python框架完成机器学习、深度学习、数据科学任务
Language: Jupyter Notebook - Size: 4.78 MB - Last synced at: 7 days ago - Pushed at: 7 days ago - Stars: 3 - Forks: 0

frank01101/channel_explorer
Managing channel/group data in instant messaging services (e.g., Telegram) and interacting with users.
Language: Python - Size: 105 KB - Last synced at: 7 days ago - Pushed at: 8 days ago - Stars: 0 - Forks: 0

ljubogdan/tweet-emotion-classifier
A TensorFlow-powered Recurrent Neural Network (RNN) for multi-class emotion classification of tweets. Utilizes NLP techniques like tokenization, padding, and gradient descent optimization to analyze and classify emotions in a large tweet dataset.
Size: 3.91 KB - Last synced at: 8 days ago - Pushed at: 8 days ago - Stars: 0 - Forks: 0

MohamedMoubarakHussein/Automatic-Document-Classification-Categorization-By-Subject
Machine Learning-powered document classifier using SVM and TF-IDF vectorization. Automatically categorizes BBC news articles into 5 subjects with 98.65% accuracy.
Language: Jupyter Notebook - Size: 2.43 MB - Last synced at: 8 days ago - Pushed at: 8 days ago - Stars: 0 - Forks: 0

Se00n00/NLP_Collection
This GitHub repository contains implementations of a wide range of NLP tasks, offering a comprehensive guide and reference to explore natural language processing.
Language: Jupyter Notebook - Size: 144 KB - Last synced at: 8 days ago - Pushed at: 8 days ago - Stars: 0 - Forks: 0

Clarifai/clarifai-python
Experience the power of Clarifai’s AI platform with the python SDK. 🌟 Star to support our work!
Language: Python - Size: 10.8 MB - Last synced at: 5 days ago - Pushed at: 5 days ago - Stars: 37 - Forks: 7

shaheennabi/Natural-Language-Processing-Practices-and-Mini-Projects
🎇 NLP Experiments 🎆 A hands-on collection of NLP experiments 💬, featuring models like RNN, LSTM, and Attention Mechanism. 🚀 Explore applications like text classification, sentiment analysis, and language generation 🌍. Continuously updated with new algorithms and research implementations! 🔥
Size: 8.79 KB - Last synced at: 3 days ago - Pushed at: 7 months ago - Stars: 1 - Forks: 0

ahsankhizar5/text-sentiment-analysis
A machine learning pipeline to classify IMDB reviews into positive or negative sentiment using TF-IDF and Logistic Regression.
Language: Python - Size: 2.93 KB - Last synced at: 9 days ago - Pushed at: 9 days ago - Stars: 0 - Forks: 0

NatLibFi/Annif
Annif is a multi-algorithm automated subject indexing tool for libraries, archives and museums.
Language: Python - Size: 8.99 MB - Last synced at: 6 days ago - Pushed at: 7 days ago - Stars: 227 - Forks: 43

Ricardokevins/Kevinpro-NLP-demo
All NLP you Need Here. 目前包含15个NLP demo的pytorch实现(大量代码借鉴于其他开源项目,原先是自己玩的,后来干脆也开源出来)
Language: Python - Size: 613 MB - Last synced at: 5 days ago - Pushed at: 6 days ago - Stars: 276 - Forks: 55

Joseph-Yusuff/emotion-detection-nlp
A Natural Language Processing project for detecting emotions (e.g., joy, sadness, anger, fear) from text using traditional ML and deep learning models (MNB, SVM, BiLSTM, BERT)
Language: Jupyter Notebook - Size: 21.3 MB - Last synced at: 10 days ago - Pushed at: 10 days ago - Stars: 1 - Forks: 0

RicciLee44/VOC-Auto-Tagging-System
A smart labeling system for VOC (Voice of Customer) data. Automatically tags customer feedback with journey touchpoints, issue types, and sentiment. Supports batch processing, model training, and visualized reports — no coding required.
Language: Python - Size: 52.7 KB - Last synced at: 10 days ago - Pushed at: 10 days ago - Stars: 0 - Forks: 0

SergeyFilipov/covid-tweet-nlp-analysis
📊 Sentiment classification and topic extraction from COVID-19 tweets using NLP techniques (TF-IDF, KMeans, Voting Classifiers). University project for text analytics and public opinion analysis.
Language: Python - Size: 5.91 MB - Last synced at: 10 days ago - Pushed at: 11 days ago - Stars: 0 - Forks: 0

yongzhuo/Pytorch-NLU
中文文本分类、序列标注工具包(pytorch),支持中文长文本、短文本的多类、多标签分类任务,支持中文命名实体识别、词性标注、分词、抽取式文本摘要等序列标注任务。 Chinese text classification and sequence labeling toolkit, supports multi class and multi label classification, text similsrity, text summary and NER.
Language: Python - Size: 379 KB - Last synced at: 10 days ago - Pushed at: 11 months ago - Stars: 346 - Forks: 50

Tongjilibo/bert4torch
An elegent pytorch implement of transformers
Language: Python - Size: 10.8 MB - Last synced at: about 3 hours ago - Pushed at: about 4 hours ago - Stars: 1,298 - Forks: 164

HarderThenHarder/transformers_tasks
⭐️ NLP Algorithms with transformers lib. Supporting Text-Classification, Text-Generation, Information-Extraction, Text-Matching, RLHF, SFT etc.
Language: Jupyter Notebook - Size: 71.1 MB - Last synced at: 10 days ago - Pushed at: over 1 year ago - Stars: 2,342 - Forks: 401

airbnb/artificial-adversary
🗣️ Tool to generate adversarial text examples and test machine learning models against them
Language: Python - Size: 116 KB - Last synced at: about 20 hours ago - Pushed at: over 3 years ago - Stars: 402 - Forks: 57

anum94/text-classification-word2vec
This a project I did with a university colleague for our seminar "Applied Deep Learning for Natural Language Processing" at TUM.
Language: Python - Size: 4.81 MB - Last synced at: 11 days ago - Pushed at: 11 days ago - Stars: 0 - Forks: 0

bgonzalezbustamante/TextClass-Benchmark
TextClass Benchmark Leaderboards
Language: Jupyter Notebook - Size: 148 MB - Last synced at: 5 days ago - Pushed at: 6 days ago - Stars: 0 - Forks: 0

CLUEbenchmark/CLUEDatasetSearch
搜索所有中文NLP数据集,附常用英文NLP数据集
Language: Python - Size: 8.87 MB - Last synced at: 11 days ago - Pushed at: over 2 years ago - Stars: 4,320 - Forks: 624

kingabzpro/bbc-news-class-mlops
A complete MLOps project.
Language: Jupyter Notebook - Size: 2.63 MB - Last synced at: 12 days ago - Pushed at: 12 days ago - Stars: 1 - Forks: 0

Clarifai/clarifai-nodejs
Experience the power of Clarifai’s AI platform with the nodejs SDK. 🌟 Star to support our work!
Language: TypeScript - Size: 2.73 MB - Last synced at: 11 days ago - Pushed at: 11 days ago - Stars: 20 - Forks: 0

ThilinaRajapakse/simpletransformers
Transformers for Information Retrieval, Text Classification, NER, QA, Language Modelling, Language Generation, T5, Multi-Modal, and Conversational AI
Language: Python - Size: 20 MB - Last synced at: 12 days ago - Pushed at: about 1 month ago - Stars: 4,185 - Forks: 728

Charley-xiao/nlp-project
VeriScribbi: The Text Authenticator. A simple solution to distinguish human-written text from machine-generated content.
Language: Python - Size: 42.8 MB - Last synced at: 12 days ago - Pushed at: 12 days ago - Stars: 1 - Forks: 0

gaussic/text-classification-cnn-rnn
CNN-RNN中文文本分类,基于TensorFlow
Language: Python - Size: 700 KB - Last synced at: 11 days ago - Pushed at: about 1 year ago - Stars: 4,232 - Forks: 1,466

prrao87/fine-grained-sentiment
A comparison and discussion of different NLP methods for 5-class sentiment classification on the SST-5 dataset.
Language: Python - Size: 1.6 MB - Last synced at: 6 days ago - Pushed at: about 2 months ago - Stars: 170 - Forks: 72

hulat-group/semeval2023_task10_EDOS Fork of isegura/hulat_edos
Repository for the participation in SemEval-2023 Task 10 (EDOS)
Size: 540 KB - Last synced at: 13 days ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

label-sleuth/label-sleuth
Open source no-code system for text annotation and building of text classifiers
Language: Python - Size: 227 MB - Last synced at: 13 days ago - Pushed at: 13 days ago - Stars: 261 - Forks: 40

AnjaliDharmik/Fake-News-Detection
In an era of rapid digital information spread, distinguishing real from fake news is challenging. The Fake News Detection Dataset helps researchers and data scientists train models for accurate fake news detection.
Language: Jupyter Notebook - Size: 53.6 MB - Last synced at: 13 days ago - Pushed at: 13 days ago - Stars: 2 - Forks: 0

explosion/spacy-llm
🦙 Integrating LLMs into structured NLP pipelines
Language: Python - Size: 1.79 MB - Last synced at: 6 days ago - Pushed at: 5 months ago - Stars: 1,254 - Forks: 97

vietnh1009/Hierarchical-attention-networks-pytorch
Hierarchical Attention Networks for document classification
Language: Python - Size: 48.5 MB - Last synced at: 12 days ago - Pushed at: over 3 years ago - Stars: 396 - Forks: 104

KennethEnevoldsen/augmenty
Augmenty is an augmentation library based on spaCy for augmenting texts.
Language: Python - Size: 6.12 MB - Last synced at: 4 days ago - Pushed at: about 1 year ago - Stars: 155 - Forks: 11

mancrurod/LinguaAnimae
Exploring emotions and meaning in Bible verses with NLP, transformers, and a custom Streamlit app.
Language: Jupyter Notebook - Size: 21.6 MB - Last synced at: 13 days ago - Pushed at: 14 days ago - Stars: 0 - Forks: 0

vinhkhuc/JFastText
Java interface for fastText
Language: Java - Size: 57.6 KB - Last synced at: 9 days ago - Pushed at: about 2 years ago - Stars: 237 - Forks: 98

Ychen463/cyber-security-text-classification-nlp
Cyber is a Natural Language Processing tool focused on analyzing global cybersecurity policies. Utilizing both supervised and unsupervised machine learning, the project categorizes and compares strategies from over 75 countries.
Language: Jupyter Notebook - Size: 23.6 MB - Last synced at: 7 days ago - Pushed at: over 1 year ago - Stars: 10 - Forks: 3

nehalvaghasiya/ml-nlp-projects
Collection of machine learning and NLP projects demonstrating various models and techniques.
Language: Jupyter Notebook - Size: 32.1 MB - Last synced at: 14 days ago - Pushed at: 14 days ago - Stars: 0 - Forks: 0

mustafaturan/omnicat-bayes
Naive Bayes text classification implementation as an OmniCat classifier strategy. (#ruby #naivebayes)
Language: Ruby - Size: 14.6 KB - Last synced at: 4 days ago - Pushed at: over 4 years ago - Stars: 31 - Forks: 3

KashifMoin1410/Text-Sentiment-Analysis
This project analyzes tweet sentiments using both traditional machine learning (Logistic Regression, Ridge, XGBoost) and deep learning (LSTM) models. The workflow covers text preprocessing, feature engineering, model training, and evaluation. Logistic Regression achieved an R² score of 0.80, while the LSTM model reached ~76% validation accuracy.
Language: Jupyter Notebook - Size: 3.58 MB - Last synced at: 15 days ago - Pushed at: 15 days ago - Stars: 0 - Forks: 0

lyeoni/nlp-tutorial
A list of NLP(Natural Language Processing) tutorials
Language: Jupyter Notebook - Size: 1.39 GB - Last synced at: 9 days ago - Pushed at: about 5 years ago - Stars: 1,376 - Forks: 264

aftabshaikhraza/toxic-comment-classifier
Multi-label NLP model to classify toxic online comments
Language: Jupyter Notebook - Size: 0 Bytes - Last synced at: 15 days ago - Pushed at: 15 days ago - Stars: 0 - Forks: 0

KanishkNavale/Text-Mining-with-TF-IDF-and-Cosine-Similarity
A simple python repository for developing perceptron based text mining involving dataset linguistics preprocessing for text classification and extracting similar text for a given query.
Language: Jupyter Notebook - Size: 7.34 MB - Last synced at: 10 days ago - Pushed at: about 3 years ago - Stars: 2 - Forks: 1

chrisliatas/dsnd-ml-pipeline
ML pipeline to categorize emergency messages based on the needs communicated by the sender.
Language: Jupyter Notebook - Size: 2.97 MB - Last synced at: 15 days ago - Pushed at: 15 days ago - Stars: 2 - Forks: 0

udaykiran9392/fakenews_detection_using_ML
Implemented a machine learning model to detect fake news using Natural Language Processing techniques like TF-IDF and stemming. Trained multiple classifiers including Logistic Regression and PassiveAggressiveClassifier for accurate classification. This project showcases practical NLP skills for tackling misinformation in media.
Language: Jupyter Notebook - Size: 10.6 MB - Last synced at: 16 days ago - Pushed at: 16 days ago - Stars: 0 - Forks: 0

yongzhuo/Keras-TextClassification
中文长文本分类、短句子分类、多标签分类、两句子相似度(Chinese Text Classification of Keras NLP, multi-label classify, or sentence classify, long or short),字词句向量嵌入层(embeddings)和网络层(graph)构建基类,FastText,TextCNN,CharCNN,TextRNN, RCNN, DCNN, DPCNN, VDCNN, CRNN, Bert, Xlnet, Albert, Attention, DeepMoji, HAN, 胶囊网络-CapsuleNet, Transformer-encode, Seq2seq, SWEM, LEAM, TextGCN
Language: Python - Size: 601 KB - Last synced at: 11 days ago - Pushed at: 12 months ago - Stars: 1,807 - Forks: 404

ntumlgroup/LibMultiLabel
A library for multi-class and multi-label classification
Language: Python - Size: 1.81 MB - Last synced at: 16 days ago - Pushed at: 18 days ago - Stars: 6 - Forks: 9

TylerMommsen/text-target-ga
Genetic Algorithm Evolving To Solve a Phrase
Language: JavaScript - Size: 15.6 KB - Last synced at: 1 day ago - Pushed at: 10 months ago - Stars: 2 - Forks: 1

Pooh555/AI_vs_human_generated_content_models
Infomatrix 2025
Language: Jupyter Notebook - Size: 34.3 MB - Last synced at: 16 days ago - Pushed at: 16 days ago - Stars: 1 - Forks: 0

interpretml/interpret-text
A library that incorporates state-of-the-art explainers for text-based machine learning models and visualizes the result with a built-in dashboard.
Language: Python - Size: 10.3 MB - Last synced at: 16 days ago - Pushed at: over 1 year ago - Stars: 425 - Forks: 68

RTIInternational/SMART
Smarter Manual Annotation for Resource-constrained collection of Training data
Language: Python - Size: 129 MB - Last synced at: 8 days ago - Pushed at: 6 months ago - Stars: 228 - Forks: 32

mwydmuch/extremeText Fork of facebookresearch/fastText
Library for fast text representation and extreme classification.
Language: HTML - Size: 39.8 MB - Last synced at: 8 days ago - Pushed at: over 4 years ago - Stars: 149 - Forks: 16

Alir3z4/python-stop-words
Get list of common stop words in various languages in Python
Language: Python - Size: 51.8 KB - Last synced at: 6 days ago - Pushed at: about 1 year ago - Stars: 156 - Forks: 29

shibing624/pytextclassifier
pytextclassifier is a toolkit for text classification. 文本分类,LR,Xgboost,TextCNN,FastText,TextRNN,BERT等分类模型实现,开箱即用。
Language: Python - Size: 17.2 MB - Last synced at: 17 days ago - Pushed at: 9 months ago - Stars: 510 - Forks: 75

AndyMalela/Finetune-BERT
Storing a simple project of me fine-tuning and training BERT for a text classification task.
Language: Python - Size: 2.42 MB - Last synced at: 17 days ago - Pushed at: 17 days ago - Stars: 0 - Forks: 0

EricFillion/happy-transformer
Happy Transformer makes it easy to fine-tune and perform inference with NLP Transformer models.
Language: Python - Size: 18.8 MB - Last synced at: 16 days ago - Pushed at: about 2 months ago - Stars: 536 - Forks: 68

shibing624/nlp-tutorial
自然语言处理(NLP)教程,包括:词向量,词法分析,预训练语言模型,文本分类,文本语义匹配,信息抽取,翻译,对话。
Language: Jupyter Notebook - Size: 2.69 MB - Last synced at: 14 days ago - Pushed at: about 3 years ago - Stars: 451 - Forks: 70

csinva/imodelsX
Interpret text data using LLMs (scikit-learn compatible).
Language: Python - Size: 35 MB - Last synced at: 17 days ago - Pushed at: 3 months ago - Stars: 165 - Forks: 26

webis-de/small-text
Active Learning for Text Classification in Python
Language: Python - Size: 3.03 MB - Last synced at: 11 days ago - Pushed at: 14 days ago - Stars: 615 - Forks: 71

juba/rainette
R implementation of the Reinert text clustering method
Language: R - Size: 15.5 MB - Last synced at: 5 days ago - Pushed at: about 1 year ago - Stars: 56 - Forks: 7

fastnlp/fastNLP
fastNLP: A Modularized and Extensible NLP Framework. Currently still in incubation.
Language: Python - Size: 35.1 MB - Last synced at: 18 days ago - Pushed at: about 2 years ago - Stars: 3,132 - Forks: 449

MahtaFetrat/Persian-Informal-Text-Detector
Python package for detecting informal Persian text using regular expressions and rule-based methods
Language: Python - Size: 21.5 KB - Last synced at: 18 days ago - Pushed at: 19 days ago - Stars: 6 - Forks: 0

mim-solutions/bert_for_longer_texts
BERT classification model for processing texts longer than 512 tokens. Text is first divided into smaller chunks and after feeding them to BERT, intermediate results are pooled. The implementation allows fine-tuning.
Language: Python - Size: 4.43 MB - Last synced at: 11 days ago - Pushed at: 12 months ago - Stars: 142 - Forks: 32
