An open API service providing repository metadata for many open source software ecosystems.

GitHub topics: text-classification

codelion/adaptive-classifier

A flexible, adaptive classification system for dynamic text classification

Language: Python - Size: 3.28 MB - Last synced at: about 4 hours ago - Pushed at: about 5 hours ago - Stars: 203 - Forks: 14

mazinsk2125/Adaptive-Message-Threat-Analyze

A full-stack app deployed on Render with environment-based dynamic port binding and optimized for seamless startup and scalability.

Language: Python - Size: 546 KB - Last synced at: about 9 hours ago - Pushed at: about 10 hours ago - Stars: 1 - Forks: 0

Cheetos19/EDA

Exploratory Data Analysis

Language: Jupyter Notebook - Size: 26.5 MB - Last synced at: about 15 hours ago - Pushed at: about 16 hours ago - Stars: 0 - Forks: 0

SamTheOneee1/kaggle-project-classification-of-tweets-from-northern-europe

Classifies 500K+ political tweets from Northern Europe using NLP and machine learning to analyze political discourse.

Language: Jupyter Notebook - Size: 2.25 MB - Last synced at: about 21 hours ago - Pushed at: about 22 hours ago - Stars: 0 - Forks: 0

Azie88/NLP-Huggingface-Covid-19-Tweet-Sentiment-Analysis

Fine Tuning text classification NLP models from huggingface with Covid-19 tweet data to build a model that classifies text based on Covid-19 sentiment

Language: Jupyter Notebook - Size: 4.85 MB - Last synced at: 1 day ago - Pushed at: 1 day ago - Stars: 0 - Forks: 0

Tech-Nomadic-X/NLP-Sentiment-Task2-CodeTech

Sentiment Analysis on IMDB Reviews using Logistic Regression & TF-IDF | CodeTech Internship Task 2 A machine learning project that classifies movie reviews as positive or negative using Natural Language Processing. Built with scikit-learn, evaluated with accuracy, ROC-AUC, and confusion matrix.

Language: Jupyter Notebook - Size: 189 KB - Last synced at: 1 day ago - Pushed at: 1 day ago - Stars: 0 - Forks: 0

clown456957/IndoBERTvsSVM

This repository contains the final project (skripsi) for sentiment classification on Indonesian Twitter data using the hashtag #KaburAjaDulu. It explores the performance comparison between a fine-tuned IndoBERT model and traditional machine learning models (such as SVM with IndoBERT embeddings). Built with 🤗 Hugging Face Transformers.

Language: Jupyter Notebook - Size: 2.75 MB - Last synced at: 1 day ago - Pushed at: 1 day ago - Stars: 0 - Forks: 0

shaharoded/Israel-Palestine-Political-Affiliation-Text-Classification

A research we conducted aiming to create a model capable of identifying political affiliation regarding the Israel-Palestine conflict

Language: Jupyter Notebook - Size: 3 MB - Last synced at: 1 day ago - Pushed at: 1 day ago - Stars: 0 - Forks: 0

pranayshekhar01/bert-sentiment-analyzer

A sentiment analysis web app powered by BERT, built with Streamlit. Classifies IMDb movie reviews as positive or negative with 93% accuracy.

Language: Jupyter Notebook - Size: 161 KB - Last synced at: 1 day ago - Pushed at: 1 day ago - Stars: 0 - Forks: 0

sergioburdisso/pyss3

A Python package implementing a new interpretable machine learning model for text classification (with visualization tools for Explainable AI :octocat:)

Language: Python - Size: 102 MB - Last synced at: 2 days ago - Pushed at: 2 days ago - Stars: 341 - Forks: 44

Lips7/Matcher

A high-performance matcher designed to solve LOGICAL and TEXT VARIATIONS problems in word matching, implemented in Rust.

Language: Rust - Size: 36.9 MB - Last synced at: 3 days ago - Pushed at: 3 days ago - Stars: 17 - Forks: 1

JustinJiang1994/chinese-text-classifier

基于tensorflow2.0中的keras进行中文的文本分类,实验数据为中文新闻分类文本cnews数据集。

Language: Python - Size: 129 KB - Last synced at: 3 days ago - Pushed at: 3 days ago - Stars: 43 - Forks: 15

Emriss0/Tech-Tweet

TechTweet is a microblogging platform for tech enthusiasts, allowing users to share short tech messages and engage in discussions. Join the community, post your thoughts, and connect with others! 🐙💻

Language: HTML - Size: 26.4 KB - Last synced at: 3 days ago - Pushed at: 3 days ago - Stars: 0 - Forks: 0

AdilShamim8/Sentiment-analysis

A machine learning project that decodes human emotions by analyzing text sentiment through an interactive web app.

Language: Jupyter Notebook - Size: 0 Bytes - Last synced at: 3 days ago - Pushed at: 3 days ago - Stars: 0 - Forks: 0

BrikerMan/Kashgari

Kashgari is a production-level NLP Transfer learning framework built on top of tf.keras for text-labeling and text-classification, includes Word2Vec, BERT, and GPT2 Language Embedding.

Language: Python - Size: 14.3 MB - Last synced at: 2 days ago - Pushed at: 9 months ago - Stars: 2,386 - Forks: 434

shaoncse/covid-tweet-nlp-analysis

📊 Sentiment classification and topic extraction from COVID-19 tweets using NLP techniques (TF-IDF, KMeans, Voting Classifiers). University project for text analytics and public opinion analysis.

Language: Jupyter Notebook - Size: 5.21 MB - Last synced at: 4 days ago - Pushed at: 4 days ago - Stars: 0 - Forks: 0

ieg-dhr/NLP-Course4Humanities_2024

This repository is part of an NLP course for humanities and cultural studies. This course uses historical newspapers as a source and applies NLP methods to them. NLP tasks: Tokenization, Lemmatization, TF-IDF, Part-of-speech tagging, semantic search with transformers, article extraction and OCR post-correction with LLMs, NER and text classification

Language: Jupyter Notebook - Size: 61.3 MB - Last synced at: 4 days ago - Pushed at: 4 days ago - Stars: 17 - Forks: 6

microsoft/nlp-recipes 📦

Natural Language Processing Best Practices & Examples

Language: Python - Size: 46.5 MB - Last synced at: 4 days ago - Pushed at: almost 3 years ago - Stars: 6,410 - Forks: 919

dayyass/text-classification-baseline

Pipeline for fast building text classification TF-IDF + LogReg baselines.

Language: Python - Size: 1.56 MB - Last synced at: 3 days ago - Pushed at: over 3 years ago - Stars: 62 - Forks: 4

Zoubyr/traditional-machine-learning

Geleneksel Makine Öğrenmesi Yöntemleri ile Çalışmalarım

Language: Jupyter Notebook - Size: 320 KB - Last synced at: 4 days ago - Pushed at: 4 days ago - Stars: 1 - Forks: 0

nuclia/nucliadb

NucliaDB, The AI Search database for RAG

Language: Python - Size: 39.9 MB - Last synced at: 4 days ago - Pushed at: 4 days ago - Stars: 695 - Forks: 53

vanhai1231/phobert-vi-comment

Finetune mô hình PhoBERT cho phân loại comment trên không gian mạng

Language: Jupyter Notebook - Size: 52.7 KB - Last synced at: 4 days ago - Pushed at: 5 days ago - Stars: 1 - Forks: 0

brightmart/nlp_chinese_corpus

大规模中文自然语言处理语料 Large Scale Chinese Corpus for NLP

Size: 3.91 MB - Last synced at: 1 day ago - Pushed at: about 1 year ago - Stars: 9,727 - Forks: 1,559

DzmitryPihulski/MachineLearningUniversityProject

University Research Project in one class classification

Language: Python - Size: 829 KB - Last synced at: 5 days ago - Pushed at: 5 days ago - Stars: 0 - Forks: 0

explosion/spaCy

💫 Industrial-strength Natural Language Processing (NLP) in Python

Language: Python - Size: 194 MB - Last synced at: 5 days ago - Pushed at: 11 days ago - Stars: 31,699 - Forks: 4,508

Sravyatogarla/NLP-RNN-KMeans-Project

NLP mini-projects using Deep Learning & Machine Learning: IMDB sentiment classification using RNN and BBC News clustering using KMeans with multiple vectorization techniques.

Language: Jupyter Notebook - Size: 796 KB - Last synced at: 5 days ago - Pushed at: 5 days ago - Stars: 0 - Forks: 0

embeddings-benchmark/mteb

MTEB: Massive Text Embedding Benchmark

Language: Python - Size: 39.9 MB - Last synced at: 5 days ago - Pushed at: 5 days ago - Stars: 2,568 - Forks: 408

allemandi/embed-classify-web

Text classification web app using CSV input and word embeddings (all-MiniLM-L6-v2).

Language: JavaScript - Size: 7.12 MB - Last synced at: 5 days ago - Pushed at: 5 days ago - Stars: 0 - Forks: 0

enessah00/adaptive-classifier

A flexible, adaptive classification system for dynamic text classification

Size: 1.95 KB - Last synced at: 5 days ago - Pushed at: 5 days ago - Stars: 0 - Forks: 0

JohnSnowLabs/spark-nlp

State of the Art Natural Language Processing

Language: Scala - Size: 3.4 GB - Last synced at: 5 days ago - Pushed at: 5 days ago - Stars: 3,988 - Forks: 726

massimoaria/tall

Text Analysis for aLL

Language: R - Size: 63.9 MB - Last synced at: 5 days ago - Pushed at: 5 days ago - Stars: 20 - Forks: 6

vyshnavidevi11/Alert_Text_Detector

Alert Text Detector is an NLP-based model that detects alert messages from social media posts. It is built using BERTweet Base and trained on a dataset of 23,000 tweets (alert & non-alert). The model flags emergency-related messages and classifies tweets based on textual content.

Size: 1.95 KB - Last synced at: 5 days ago - Pushed at: 5 days ago - Stars: 0 - Forks: 0

phurwicz/hover

:speedboat: Label data at scale. Fun and precision included.

Language: Python - Size: 294 MB - Last synced at: 5 days ago - Pushed at: 6 days ago - Stars: 327 - Forks: 19

catalyst-team/catalyst

Accelerated deep learning R&D

Language: Python - Size: 52.6 MB - Last synced at: about 10 hours ago - Pushed at: about 1 year ago - Stars: 3,355 - Forks: 395

Willgnner-Santos/IT-Residence

TJGO Information Technology Residency

Language: Jupyter Notebook - Size: 55.7 KB - Last synced at: 5 days ago - Pushed at: 6 days ago - Stars: 0 - Forks: 0

The-FinAI/PIXIU

This repository introduces PIXIU, an open-source resource featuring the first financial large language models (LLMs), instruction tuning data, and evaluation benchmarks to holistically assess financial LLMs. Our goal is to continually push forward the open-source development of financial artificial intelligence (AI).

Language: Jupyter Notebook - Size: 49.5 MB - Last synced at: 7 days ago - Pushed at: 3 months ago - Stars: 716 - Forks: 88

sun-vonxu/emotion-detection-nlp

# Emotion Detection from Text using NLP## 📘 Project OverviewThis project uses Natural Language Processing to detect emotions like joy, sadness, anger, and fear from text. It compares traditional machine learning and deep learning models, ensuring robust evaluation. ## Objectives- Build a strong emotion classifier using public datasets.- Anal

Language: Jupyter Notebook - Size: 21.2 MB - Last synced at: 7 days ago - Pushed at: 7 days ago - Stars: 0 - Forks: 0

AIVU2026/TextMining-project

Text Mining project - Artificial Intelligence, Vrije University,2025

Language: Jupyter Notebook - Size: 3.99 MB - Last synced at: 7 days ago - Pushed at: 7 days ago - Stars: 0 - Forks: 0

alessiopittiglio/mm-argfallacy

Language: Python - Size: 26.4 KB - Last synced at: 7 days ago - Pushed at: 7 days ago - Stars: 1 - Forks: 0

tiansztiansz/python-data-science

b站 AI日日新 不定期更新使用Python框架完成机器学习、深度学习、数据科学任务

Language: Jupyter Notebook - Size: 4.78 MB - Last synced at: 7 days ago - Pushed at: 7 days ago - Stars: 3 - Forks: 0

frank01101/channel_explorer

Managing channel/group data in instant messaging services (e.g., Telegram) and interacting with users.

Language: Python - Size: 105 KB - Last synced at: 7 days ago - Pushed at: 8 days ago - Stars: 0 - Forks: 0

ljubogdan/tweet-emotion-classifier

A TensorFlow-powered Recurrent Neural Network (RNN) for multi-class emotion classification of tweets. Utilizes NLP techniques like tokenization, padding, and gradient descent optimization to analyze and classify emotions in a large tweet dataset.

Size: 3.91 KB - Last synced at: 8 days ago - Pushed at: 8 days ago - Stars: 0 - Forks: 0

MohamedMoubarakHussein/Automatic-Document-Classification-Categorization-By-Subject

Machine Learning-powered document classifier using SVM and TF-IDF vectorization. Automatically categorizes BBC news articles into 5 subjects with 98.65% accuracy.

Language: Jupyter Notebook - Size: 2.43 MB - Last synced at: 8 days ago - Pushed at: 8 days ago - Stars: 0 - Forks: 0

Se00n00/NLP_Collection

This GitHub repository contains implementations of a wide range of NLP tasks, offering a comprehensive guide and reference to explore natural language processing.

Language: Jupyter Notebook - Size: 144 KB - Last synced at: 8 days ago - Pushed at: 8 days ago - Stars: 0 - Forks: 0

Clarifai/clarifai-python

Experience the power of Clarifai’s AI platform with the python SDK. 🌟 Star to support our work!

Language: Python - Size: 10.8 MB - Last synced at: 5 days ago - Pushed at: 5 days ago - Stars: 37 - Forks: 7

shaheennabi/Natural-Language-Processing-Practices-and-Mini-Projects

🎇 NLP Experiments 🎆 A hands-on collection of NLP experiments 💬, featuring models like RNN, LSTM, and Attention Mechanism. 🚀 Explore applications like text classification, sentiment analysis, and language generation 🌍. Continuously updated with new algorithms and research implementations! 🔥

Size: 8.79 KB - Last synced at: 3 days ago - Pushed at: 7 months ago - Stars: 1 - Forks: 0

ahsankhizar5/text-sentiment-analysis

A machine learning pipeline to classify IMDB reviews into positive or negative sentiment using TF-IDF and Logistic Regression.

Language: Python - Size: 2.93 KB - Last synced at: 9 days ago - Pushed at: 9 days ago - Stars: 0 - Forks: 0

NatLibFi/Annif

Annif is a multi-algorithm automated subject indexing tool for libraries, archives and museums.

Language: Python - Size: 8.99 MB - Last synced at: 6 days ago - Pushed at: 7 days ago - Stars: 227 - Forks: 43

Ricardokevins/Kevinpro-NLP-demo

All NLP you Need Here. 目前包含15个NLP demo的pytorch实现(大量代码借鉴于其他开源项目,原先是自己玩的,后来干脆也开源出来)

Language: Python - Size: 613 MB - Last synced at: 5 days ago - Pushed at: 6 days ago - Stars: 276 - Forks: 55

Joseph-Yusuff/emotion-detection-nlp

A Natural Language Processing project for detecting emotions (e.g., joy, sadness, anger, fear) from text using traditional ML and deep learning models (MNB, SVM, BiLSTM, BERT)

Language: Jupyter Notebook - Size: 21.3 MB - Last synced at: 10 days ago - Pushed at: 10 days ago - Stars: 1 - Forks: 0

RicciLee44/VOC-Auto-Tagging-System

A smart labeling system for VOC (Voice of Customer) data. Automatically tags customer feedback with journey touchpoints, issue types, and sentiment. Supports batch processing, model training, and visualized reports — no coding required.

Language: Python - Size: 52.7 KB - Last synced at: 10 days ago - Pushed at: 10 days ago - Stars: 0 - Forks: 0

SergeyFilipov/covid-tweet-nlp-analysis

📊 Sentiment classification and topic extraction from COVID-19 tweets using NLP techniques (TF-IDF, KMeans, Voting Classifiers). University project for text analytics and public opinion analysis.

Language: Python - Size: 5.91 MB - Last synced at: 10 days ago - Pushed at: 11 days ago - Stars: 0 - Forks: 0

yongzhuo/Pytorch-NLU

中文文本分类、序列标注工具包(pytorch),支持中文长文本、短文本的多类、多标签分类任务,支持中文命名实体识别、词性标注、分词、抽取式文本摘要等序列标注任务。 Chinese text classification and sequence labeling toolkit, supports multi class and multi label classification, text similsrity, text summary and NER.

Language: Python - Size: 379 KB - Last synced at: 10 days ago - Pushed at: 11 months ago - Stars: 346 - Forks: 50

Tongjilibo/bert4torch

An elegent pytorch implement of transformers

Language: Python - Size: 10.8 MB - Last synced at: about 3 hours ago - Pushed at: about 4 hours ago - Stars: 1,298 - Forks: 164

HarderThenHarder/transformers_tasks

⭐️ NLP Algorithms with transformers lib. Supporting Text-Classification, Text-Generation, Information-Extraction, Text-Matching, RLHF, SFT etc.

Language: Jupyter Notebook - Size: 71.1 MB - Last synced at: 10 days ago - Pushed at: over 1 year ago - Stars: 2,342 - Forks: 401

airbnb/artificial-adversary

🗣️ Tool to generate adversarial text examples and test machine learning models against them

Language: Python - Size: 116 KB - Last synced at: about 20 hours ago - Pushed at: over 3 years ago - Stars: 402 - Forks: 57

anum94/text-classification-word2vec

This a project I did with a university colleague for our seminar "Applied Deep Learning for Natural Language Processing" at TUM.

Language: Python - Size: 4.81 MB - Last synced at: 11 days ago - Pushed at: 11 days ago - Stars: 0 - Forks: 0

bgonzalezbustamante/TextClass-Benchmark

TextClass Benchmark Leaderboards

Language: Jupyter Notebook - Size: 148 MB - Last synced at: 5 days ago - Pushed at: 6 days ago - Stars: 0 - Forks: 0

CLUEbenchmark/CLUEDatasetSearch

搜索所有中文NLP数据集,附常用英文NLP数据集

Language: Python - Size: 8.87 MB - Last synced at: 11 days ago - Pushed at: over 2 years ago - Stars: 4,320 - Forks: 624

kingabzpro/bbc-news-class-mlops

A complete MLOps project.

Language: Jupyter Notebook - Size: 2.63 MB - Last synced at: 12 days ago - Pushed at: 12 days ago - Stars: 1 - Forks: 0

Clarifai/clarifai-nodejs

Experience the power of Clarifai’s AI platform with the nodejs SDK. 🌟 Star to support our work!

Language: TypeScript - Size: 2.73 MB - Last synced at: 11 days ago - Pushed at: 11 days ago - Stars: 20 - Forks: 0

ThilinaRajapakse/simpletransformers

Transformers for Information Retrieval, Text Classification, NER, QA, Language Modelling, Language Generation, T5, Multi-Modal, and Conversational AI

Language: Python - Size: 20 MB - Last synced at: 12 days ago - Pushed at: about 1 month ago - Stars: 4,185 - Forks: 728

Charley-xiao/nlp-project

VeriScribbi: The Text Authenticator. A simple solution to distinguish human-written text from machine-generated content.

Language: Python - Size: 42.8 MB - Last synced at: 12 days ago - Pushed at: 12 days ago - Stars: 1 - Forks: 0

gaussic/text-classification-cnn-rnn

CNN-RNN中文文本分类,基于TensorFlow

Language: Python - Size: 700 KB - Last synced at: 11 days ago - Pushed at: about 1 year ago - Stars: 4,232 - Forks: 1,466

prrao87/fine-grained-sentiment

A comparison and discussion of different NLP methods for 5-class sentiment classification on the SST-5 dataset.

Language: Python - Size: 1.6 MB - Last synced at: 6 days ago - Pushed at: about 2 months ago - Stars: 170 - Forks: 72

hulat-group/semeval2023_task10_EDOS Fork of isegura/hulat_edos

Repository for the participation in SemEval-2023 Task 10 (EDOS)

Size: 540 KB - Last synced at: 13 days ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

label-sleuth/label-sleuth

Open source no-code system for text annotation and building of text classifiers

Language: Python - Size: 227 MB - Last synced at: 13 days ago - Pushed at: 13 days ago - Stars: 261 - Forks: 40

AnjaliDharmik/Fake-News-Detection

In an era of rapid digital information spread, distinguishing real from fake news is challenging. The Fake News Detection Dataset helps researchers and data scientists train models for accurate fake news detection.

Language: Jupyter Notebook - Size: 53.6 MB - Last synced at: 13 days ago - Pushed at: 13 days ago - Stars: 2 - Forks: 0

explosion/spacy-llm

🦙 Integrating LLMs into structured NLP pipelines

Language: Python - Size: 1.79 MB - Last synced at: 6 days ago - Pushed at: 5 months ago - Stars: 1,254 - Forks: 97

vietnh1009/Hierarchical-attention-networks-pytorch

Hierarchical Attention Networks for document classification

Language: Python - Size: 48.5 MB - Last synced at: 12 days ago - Pushed at: over 3 years ago - Stars: 396 - Forks: 104

KennethEnevoldsen/augmenty

Augmenty is an augmentation library based on spaCy for augmenting texts.

Language: Python - Size: 6.12 MB - Last synced at: 4 days ago - Pushed at: about 1 year ago - Stars: 155 - Forks: 11

mancrurod/LinguaAnimae

Exploring emotions and meaning in Bible verses with NLP, transformers, and a custom Streamlit app.

Language: Jupyter Notebook - Size: 21.6 MB - Last synced at: 13 days ago - Pushed at: 14 days ago - Stars: 0 - Forks: 0

vinhkhuc/JFastText

Java interface for fastText

Language: Java - Size: 57.6 KB - Last synced at: 9 days ago - Pushed at: about 2 years ago - Stars: 237 - Forks: 98

Ychen463/cyber-security-text-classification-nlp

Cyber is a Natural Language Processing tool focused on analyzing global cybersecurity policies. Utilizing both supervised and unsupervised machine learning, the project categorizes and compares strategies from over 75 countries.

Language: Jupyter Notebook - Size: 23.6 MB - Last synced at: 7 days ago - Pushed at: over 1 year ago - Stars: 10 - Forks: 3

nehalvaghasiya/ml-nlp-projects

Collection of machine learning and NLP projects demonstrating various models and techniques.

Language: Jupyter Notebook - Size: 32.1 MB - Last synced at: 14 days ago - Pushed at: 14 days ago - Stars: 0 - Forks: 0

mustafaturan/omnicat-bayes

Naive Bayes text classification implementation as an OmniCat classifier strategy. (#ruby #naivebayes)

Language: Ruby - Size: 14.6 KB - Last synced at: 4 days ago - Pushed at: over 4 years ago - Stars: 31 - Forks: 3

KashifMoin1410/Text-Sentiment-Analysis

This project analyzes tweet sentiments using both traditional machine learning (Logistic Regression, Ridge, XGBoost) and deep learning (LSTM) models. The workflow covers text preprocessing, feature engineering, model training, and evaluation. Logistic Regression achieved an R² score of 0.80, while the LSTM model reached ~76% validation accuracy.

Language: Jupyter Notebook - Size: 3.58 MB - Last synced at: 15 days ago - Pushed at: 15 days ago - Stars: 0 - Forks: 0

lyeoni/nlp-tutorial

A list of NLP(Natural Language Processing) tutorials

Language: Jupyter Notebook - Size: 1.39 GB - Last synced at: 9 days ago - Pushed at: about 5 years ago - Stars: 1,376 - Forks: 264

aftabshaikhraza/toxic-comment-classifier

Multi-label NLP model to classify toxic online comments

Language: Jupyter Notebook - Size: 0 Bytes - Last synced at: 15 days ago - Pushed at: 15 days ago - Stars: 0 - Forks: 0

KanishkNavale/Text-Mining-with-TF-IDF-and-Cosine-Similarity

A simple python repository for developing perceptron based text mining involving dataset linguistics preprocessing for text classification and extracting similar text for a given query.

Language: Jupyter Notebook - Size: 7.34 MB - Last synced at: 10 days ago - Pushed at: about 3 years ago - Stars: 2 - Forks: 1

chrisliatas/dsnd-ml-pipeline

ML pipeline to categorize emergency messages based on the needs communicated by the sender.

Language: Jupyter Notebook - Size: 2.97 MB - Last synced at: 15 days ago - Pushed at: 15 days ago - Stars: 2 - Forks: 0

udaykiran9392/fakenews_detection_using_ML

Implemented a machine learning model to detect fake news using Natural Language Processing techniques like TF-IDF and stemming. Trained multiple classifiers including Logistic Regression and PassiveAggressiveClassifier for accurate classification. This project showcases practical NLP skills for tackling misinformation in media.

Language: Jupyter Notebook - Size: 10.6 MB - Last synced at: 16 days ago - Pushed at: 16 days ago - Stars: 0 - Forks: 0

yongzhuo/Keras-TextClassification

中文长文本分类、短句子分类、多标签分类、两句子相似度(Chinese Text Classification of Keras NLP, multi-label classify, or sentence classify, long or short),字词句向量嵌入层(embeddings)和网络层(graph)构建基类,FastText,TextCNN,CharCNN,TextRNN, RCNN, DCNN, DPCNN, VDCNN, CRNN, Bert, Xlnet, Albert, Attention, DeepMoji, HAN, 胶囊网络-CapsuleNet, Transformer-encode, Seq2seq, SWEM, LEAM, TextGCN

Language: Python - Size: 601 KB - Last synced at: 11 days ago - Pushed at: 12 months ago - Stars: 1,807 - Forks: 404

ntumlgroup/LibMultiLabel

A library for multi-class and multi-label classification

Language: Python - Size: 1.81 MB - Last synced at: 16 days ago - Pushed at: 18 days ago - Stars: 6 - Forks: 9

TylerMommsen/text-target-ga

Genetic Algorithm Evolving To Solve a Phrase

Language: JavaScript - Size: 15.6 KB - Last synced at: 1 day ago - Pushed at: 10 months ago - Stars: 2 - Forks: 1

Pooh555/AI_vs_human_generated_content_models

Infomatrix 2025

Language: Jupyter Notebook - Size: 34.3 MB - Last synced at: 16 days ago - Pushed at: 16 days ago - Stars: 1 - Forks: 0

interpretml/interpret-text

A library that incorporates state-of-the-art explainers for text-based machine learning models and visualizes the result with a built-in dashboard.

Language: Python - Size: 10.3 MB - Last synced at: 16 days ago - Pushed at: over 1 year ago - Stars: 425 - Forks: 68

RTIInternational/SMART

Smarter Manual Annotation for Resource-constrained collection of Training data

Language: Python - Size: 129 MB - Last synced at: 8 days ago - Pushed at: 6 months ago - Stars: 228 - Forks: 32

mwydmuch/extremeText Fork of facebookresearch/fastText

Library for fast text representation and extreme classification.

Language: HTML - Size: 39.8 MB - Last synced at: 8 days ago - Pushed at: over 4 years ago - Stars: 149 - Forks: 16

Alir3z4/python-stop-words

Get list of common stop words in various languages in Python

Language: Python - Size: 51.8 KB - Last synced at: 6 days ago - Pushed at: about 1 year ago - Stars: 156 - Forks: 29

shibing624/pytextclassifier

pytextclassifier is a toolkit for text classification. 文本分类,LR,Xgboost,TextCNN,FastText,TextRNN,BERT等分类模型实现,开箱即用。

Language: Python - Size: 17.2 MB - Last synced at: 17 days ago - Pushed at: 9 months ago - Stars: 510 - Forks: 75

AndyMalela/Finetune-BERT

Storing a simple project of me fine-tuning and training BERT for a text classification task.

Language: Python - Size: 2.42 MB - Last synced at: 17 days ago - Pushed at: 17 days ago - Stars: 0 - Forks: 0

EricFillion/happy-transformer

Happy Transformer makes it easy to fine-tune and perform inference with NLP Transformer models.

Language: Python - Size: 18.8 MB - Last synced at: 16 days ago - Pushed at: about 2 months ago - Stars: 536 - Forks: 68

shibing624/nlp-tutorial

自然语言处理(NLP)教程,包括:词向量,词法分析,预训练语言模型,文本分类,文本语义匹配,信息抽取,翻译,对话。

Language: Jupyter Notebook - Size: 2.69 MB - Last synced at: 14 days ago - Pushed at: about 3 years ago - Stars: 451 - Forks: 70

csinva/imodelsX

Interpret text data using LLMs (scikit-learn compatible).

Language: Python - Size: 35 MB - Last synced at: 17 days ago - Pushed at: 3 months ago - Stars: 165 - Forks: 26

webis-de/small-text

Active Learning for Text Classification in Python

Language: Python - Size: 3.03 MB - Last synced at: 11 days ago - Pushed at: 14 days ago - Stars: 615 - Forks: 71

juba/rainette

R implementation of the Reinert text clustering method

Language: R - Size: 15.5 MB - Last synced at: 5 days ago - Pushed at: about 1 year ago - Stars: 56 - Forks: 7

fastnlp/fastNLP

fastNLP: A Modularized and Extensible NLP Framework. Currently still in incubation.

Language: Python - Size: 35.1 MB - Last synced at: 18 days ago - Pushed at: about 2 years ago - Stars: 3,132 - Forks: 449

MahtaFetrat/Persian-Informal-Text-Detector

Python package for detecting informal Persian text using regular expressions and rule-based methods

Language: Python - Size: 21.5 KB - Last synced at: 18 days ago - Pushed at: 19 days ago - Stars: 6 - Forks: 0

mim-solutions/bert_for_longer_texts

BERT classification model for processing texts longer than 512 tokens. Text is first divided into smaller chunks and after feeding them to BERT, intermediate results are pooled. The implementation allows fine-tuning.

Language: Python - Size: 4.43 MB - Last synced at: 11 days ago - Pushed at: 12 months ago - Stars: 142 - Forks: 32

Related Keywords
text-classification 3,659 nlp 1,256 machine-learning 1,041 natural-language-processing 718 python 673 deep-learning 558 sentiment-analysis 522 bert 321 tensorflow 320 pytorch 306 nlp-machine-learning 253 text-mining 248 classification 222 keras 178 text-processing 157 data-science 156 cnn 149 lstm 149 transformers 148 text-analysis 139 python3 124 naive-bayes-classifier 122 scikit-learn 120 logistic-regression 118 neural-network 114 nltk 107 word2vec 98 artificial-intelligence 93 transformer 92 image-classification 91 tf-idf 89 sentiment-classification 88 text 86 bert-model 86 rnn 83 huggingface 82 ai 81 sklearn 81 text-generation 77 convolutional-neural-networks 76 flask 74 topic-modeling 73 named-entity-recognition 70 svm 66 neural-networks 65 spacy 64 dataset 62 word-embeddings 62 naive-bayes 58 machine-learning-algorithms 55 jupyter-notebook 55 fasttext 55 bert-fine-tuning 53 huggingface-transformers 52 ner 52 question-answering 51 random-forest 50 embeddings 50 pandas 49 text-summarization 49 transfer-learning 49 svm-classifier 48 language-model 47 streamlit 47 roberta 46 deep-neural-networks 46 ml 45 data-mining 45 bag-of-words 43 twitter 42 attention-mechanism 42 multi-label-classification 41 keras-tensorflow 41 document-classification 41 llm 41 supervised-learning 41 fine-tuning 41 text-preprocessing 40 kaggle 40 information-retrieval 40 multilabel-classification 40 docker 39 lstm-neural-networks 38 recurrent-neural-networks 38 tokenization 36 spam-detection 36 textcnn 35 fastapi 35 tensorflow2 35 large-language-models 33 chatbot 33 tfidf 33 data-visualization 33 r 32 nltk-python 32 multiclass-classification 32 gru 31 text-clustering 31 sentence-classification 30 machine-translation 30