Ecosyste.ms: Repos

An open API service providing repository metadata for many open source software ecosystems.

GitHub topics: text-classification

hankcs/HanLP

中文分词 词性标注 命名实体识别 依存句法分析 成分句法分析 语义依存分析 语义角色标注 指代消解 风格转换 语义相似度 新词发现 关键词短语提取 自动摘要 文本分类聚类 拼音简繁转换 自然语言处理

Language: Python - Size: 69.5 MB - Last synced: 6 days ago - Pushed: about 1 month ago - Stars: 32,516 - Forks: 9,601

explosion/spaCy

💫 Industrial-strength Natural Language Processing (NLP) in Python

Language: Python - Size: 194 MB - Last synced: about 1 month ago - Pushed: about 1 month ago - Stars: 28,659 - Forks: 4,274

brightmart/nlp_chinese_corpus

大规模中文自然语言处理语料 Large Scale Chinese Corpus for NLP

Size: 4.01 MB - Last synced: about 2 months ago - Pushed: 12 months ago - Stars: 9,089 - Forks: 1,526

brightmart/text_classification

all kinds of text classification models and more with deep learning

Language: Python - Size: 14.1 MB - Last synced: 2 months ago - Pushed: 8 months ago - Stars: 7,723 - Forks: 2,579

microsoft/nlp-recipes 📦

Natural Language Processing Best Practices & Examples

Language: Python - Size: 46.5 MB - Last synced: about 15 hours ago - Pushed: over 1 year ago - Stars: 6,335 - Forks: 914

gaussic/text-classification-cnn-rnn

CNN-RNN中文文本分类,基于TensorFlow

Language: Python - Size: 700 KB - Last synced: 6 months ago - Pushed: about 5 years ago - Stars: 3,999 - Forks: 1,463

ThilinaRajapakse/simpletransformers

Transformers for Information Retrieval, Text Classification, NER, QA, Language Modelling, Language Generation, T5, Multi-Modal, and Conversational AI

Language: Python - Size: 20.1 MB - Last synced: 8 days ago - Pushed: 2 months ago - Stars: 3,997 - Forks: 722

snipsco/snips-nlu

Snips Python library to extract meaning from text

Language: Python - Size: 19.3 MB - Last synced: 7 days ago - Pushed: 12 months ago - Stars: 3,867 - Forks: 516

CLUEbenchmark/CLUEDatasetSearch

搜索所有中文NLP数据集,附常用英文NLP数据集

Language: Python - Size: 8.87 MB - Last synced: 3 months ago - Pushed: over 1 year ago - Stars: 3,772 - Forks: 581

JohnSnowLabs/spark-nlp

State of the Art Natural Language Processing

Language: Scala - Size: 1.47 GB - Last synced: about 15 hours ago - Pushed: about 16 hours ago - Stars: 3,716 - Forks: 702

catalyst-team/catalyst

Accelerated deep learning R&D

Language: Python - Size: 52.6 MB - Last synced: about 6 hours ago - Pushed: 2 months ago - Stars: 3,234 - Forks: 385

fastnlp/fastNLP

fastNLP: A Modularized and Extensible NLP Framework. Currently still in incubation.

Language: Python - Size: 35.1 MB - Last synced: about 2 months ago - Pushed: 12 months ago - Stars: 3,032 - Forks: 454

BrikerMan/Kashgari

Kashgari is a production-level NLP Transfer learning framework built on top of tf.keras for text-labeling and text-classification, includes Word2Vec, BERT, and GPT2 Language Embedding.

Language: Python - Size: 14.3 MB - Last synced: about 23 hours ago - Pushed: almost 3 years ago - Stars: 2,379 - Forks: 440

x4nth055/pythoncode-tutorials

The Python Code Tutorials

Language: Jupyter Notebook - Size: 312 MB - Last synced: 8 days ago - Pushed: 8 days ago - Stars: 2,010 - Forks: 1,850

alibaba/EasyNLP

EasyNLP: A Comprehensive and Easy-to-use NLP Toolkit

Language: Python - Size: 19.9 MB - Last synced: 11 days ago - Pushed: 2 months ago - Stars: 1,955 - Forks: 247

kk7nc/Text_Classification

Text Classification Algorithms: A Survey

Language: Python - Size: 13.7 MB - Last synced: 3 months ago - Pushed: over 1 year ago - Stars: 1,774 - Forks: 588

xlang-ai/instructor-embedding

[ACL 2023] One Embedder, Any Task: Instruction-Finetuned Text Embeddings

Language: Python - Size: 170 MB - Last synced: 6 days ago - Pushed: 26 days ago - Stars: 1,724 - Forks: 126

yongzhuo/Keras-TextClassification

中文长文本分类、短句子分类、多标签分类、两句子相似度(Chinese Text Classification of Keras NLP, multi-label classify, or sentence classify, long or short),字词句向量嵌入层(embeddings)和网络层(graph)构建基类,FastText,TextCNN,CharCNN,TextRNN, RCNN, DCNN, DPCNN, VDCNN, CRNN, Bert, Xlnet, Albert, Attention, DeepMoji, HAN, 胶囊网络-CapsuleNet, Transformer-encode, Seq2seq, SWEM, LEAM, TextGCN

Language: Python - Size: 599 KB - Last synced: 16 days ago - Pushed: 16 days ago - Stars: 1,705 - Forks: 404

dipanjanS/text-analytics-with-python

Learn how to process, classify, cluster, summarize, understand syntax, semantics and sentiment of text data with the power of Python! This repository contains code and datasets used in my book, "Text Analytics with Python" published by Apress/Springer.

Language: Jupyter Notebook - Size: 38.8 MB - Last synced: about 2 months ago - Pushed: over 3 years ago - Stars: 1,617 - Forks: 836

Delta-ML/delta

DELTA is a deep learning based natural language and speech processing platform.

Language: Python - Size: 59.5 MB - Last synced: about 15 hours ago - Pushed: about 1 month ago - Stars: 1,585 - Forks: 294

HarderThenHarder/transformers_tasks

⭐️ NLP Algorithms with transformers lib. Supporting Text-Classification, Text-Generation, Information-Extraction, Text-Matching, RLHF, SFT etc.

Language: Jupyter Notebook - Size: 71.1 MB - Last synced: 7 months ago - Pushed: 8 months ago - Stars: 1,564 - Forks: 303

yongzhuo/nlp_xiaojiang

自然语言处理(nlp),小姜机器人(闲聊检索式chatbot),BERT句向量-相似度(Sentence Similarity),XLNET句向量-相似度(text xlnet embedding),文本分类(Text classification), 实体提取(ner,bert+bilstm+crf),数据增强(text augment, data enhance),同义句同义词生成,句子主干提取(mainpart),中文汉语短文本相似度,文本特征工程,keras-http-service调用

Language: Python - Size: 23.4 MB - Last synced: 17 days ago - Pushed: over 2 years ago - Stars: 1,512 - Forks: 395

jasonwei20/eda_nlp

Data augmentation for NLP, presented at EMNLP 2019

Language: Python - Size: 19.9 MB - Last synced: 6 months ago - Pushed: about 1 year ago - Stars: 1,503 - Forks: 311

bfelbo/DeepMoji

State-of-the-art deep learning model for analyzing sentiment, emotion, sarcasm etc.

Language: Python - Size: 108 MB - Last synced: about 2 months ago - Pushed: 8 months ago - Stars: 1,491 - Forks: 313

microsoft/NeuronBlocks

NLP DNN Toolkit - Building Your NLP DNN Models Like Playing Lego

Language: Python - Size: 14.9 MB - Last synced: about 15 hours ago - Pushed: 10 months ago - Stars: 1,441 - Forks: 192

embeddings-benchmark/mteb

MTEB: Massive Text Embedding Benchmark

Language: Python - Size: 3.71 MB - Last synced: 9 days ago - Pushed: 9 days ago - Stars: 1,441 - Forks: 175

code-kern-ai/refinery

The data scientist's open-source choice to scale, assess and maintain natural language data. Treat training data like a software artifact.

Language: Python - Size: 3.54 MB - Last synced: 1 day ago - Pushed: about 1 month ago - Stars: 1,365 - Forks: 64

lyeoni/nlp-tutorial

A list of NLP(Natural Language Processing) tutorials

Language: Jupyter Notebook - Size: 1.39 GB - Last synced: 3 months ago - Pushed: about 4 years ago - Stars: 1,355 - Forks: 267

yao8839836/text_gcn

Graph Convolutional Networks for Text Classification. AAAI 2019

Language: Python - Size: 841 MB - Last synced: 2 months ago - Pushed: over 2 years ago - Stars: 1,324 - Forks: 431

zhanlaoban/EDA_NLP_for_Chinese

An implement of the paper of EDA for Chinese corpus.中文语料的EDA数据增强工具。NLP数据增强。论文阅读笔记。

Language: Python - Size: 22.5 KB - Last synced: 6 months ago - Pushed: almost 2 years ago - Stars: 1,265 - Forks: 232

920232796/bert_seq2seq

pytorch实现 Bert 做seq2seq任务,使用unilm方案,现在也可以做自动摘要,文本分类,情感分析,NER,词性标注等任务,支持t5模型,支持GPT2进行文章续写。

Language: Python - Size: 3.45 MB - Last synced: 12 days ago - Pushed: almost 2 years ago - Stars: 1,263 - Forks: 206

charlesXu86/Chatbot_CN

基于金融-司法领域(兼有闲聊性质)的聊天机器人,其中的主要模块有信息抽取、NLU、NLG、知识图谱等,并且利用Django整合了前端展示,目前已经封装了nlp和kg的restful接口

Size: 2.2 MB - Last synced: 2 months ago - Pushed: almost 3 years ago - Stars: 1,258 - Forks: 433

Hello-SimpleAI/chatgpt-comparison-detection

Human ChatGPT Comparison Corpus (HC3), Detectors, and more! 🔥

Language: Python - Size: 53.7 KB - Last synced: about 2 months ago - Pushed: 6 months ago - Stars: 1,184 - Forks: 114

kavgan/nlp-in-practice

Starter code to solve real world text data problems. Includes: Gensim Word2Vec, phrase embeddings, Text Classification with Logistic Regression, word count with pyspark, simple text preprocessing, pre-trained embeddings and more.

Language: Jupyter Notebook - Size: 91.8 MB - Last synced: 5 days ago - Pushed: over 3 years ago - Stars: 1,120 - Forks: 781

obsei/obsei

Obsei is a low code AI powered automation tool. It can be used in various business flows like social listening, AI based alerting, brand image analysis, comparative study and more .

Language: Python - Size: 16.2 MB - Last synced: 14 days ago - Pushed: 14 days ago - Stars: 1,083 - Forks: 151

richliao/textClassifier

Text classifier for Hierarchical Attention Networks for Document Classification

Language: Python - Size: 21.5 KB - Last synced: 3 months ago - Pushed: over 2 years ago - Stars: 1,061 - Forks: 384

Tongjilibo/bert4torch

An elegent pytorch implement of transformers

Language: Python - Size: 8.35 MB - Last synced: 5 months ago - Pushed: 5 months ago - Stars: 985 - Forks: 129

brightmart/bert_language_understanding

Pre-training of Deep Bidirectional Transformers for Language Understanding: pre-train TextCNN

Language: Python - Size: 16 MB - Last synced: 3 months ago - Pushed: over 5 years ago - Stars: 959 - Forks: 212

explosion/spacy-llm

🦙 Integrating LLMs into structured NLP pipelines

Language: Python - Size: 1.78 MB - Last synced: 1 day ago - Pushed: 3 days ago - Stars: 958 - Forks: 74

greyblake/whatlang-rs

Natural language detection library for Rust. Try demo online: https://whatlang.org/

Language: Rust - Size: 2.04 MB - Last synced: 20 days ago - Pushed: 2 months ago - Stars: 952 - Forks: 108

wikipedia2vec/wikipedia2vec

A tool for learning vector representations of words and entities from Wikipedia

Language: Python - Size: 2.41 MB - Last synced: about 1 hour ago - Pushed: 16 days ago - Stars: 922 - Forks: 101

rodrigopivi/Chatito

🎯🗯 Dataset generation for AI chatbots, NLP tasks, named entity recognition or text classification models using a simple DSL!

Language: TypeScript - Size: 6.42 MB - Last synced: about 1 month ago - Pushed: 9 months ago - Stars: 861 - Forks: 157

lonePatient/Bert-Multi-Label-Text-Classification

This repo contains a PyTorch implementation of a pretrained BERT model for multi-label text classification.

Language: Python - Size: 187 KB - Last synced: 6 days ago - Pushed: about 1 year ago - Stars: 836 - Forks: 208

smilelight/lightNLP

基于Pytorch和torchtext的自然语言处理深度学习框架。

Language: Python - Size: 49.3 MB - Last synced: about 2 months ago - Pushed: over 3 years ago - Stars: 823 - Forks: 212

JohnSnowLabs/nlu

1 line for thousands of State of The Art NLP models in hundreds of languages The fastest and most accurate way to solve text problems.

Language: Python - Size: 476 MB - Last synced: about 15 hours ago - Pushed: 1 day ago - Stars: 821 - Forks: 124

styfeng/DataAug4NLP

Collection of papers and resources for data augmentation for NLP.

Size: 120 KB - Last synced: 6 days ago - Pushed: almost 2 years ago - Stars: 817 - Forks: 76

ShawnyXiao/TextClassification-Keras

Text classification models implemented in Keras, including: FastText, TextCNN, TextRNN, TextBiRNN, TextAttBiRNN, HAN, RCNN, RCNNVariant, etc.

Language: Python - Size: 1.87 MB - Last synced: 6 months ago - Pushed: about 1 year ago - Stars: 809 - Forks: 190

prakashpandey9/Text-Classification-Pytorch

Text classification using deep learning models in Pytorch

Language: Python - Size: 31.3 KB - Last synced: about 2 months ago - Pushed: over 5 years ago - Stars: 801 - Forks: 237

explosion/spacy-streamlit

👑 spaCy building blocks and visualizers for Streamlit apps

Language: Python - Size: 61.5 KB - Last synced: 5 days ago - Pushed: 10 months ago - Stars: 766 - Forks: 115

CLUEbenchmark/CLUEPretrainedModels

高质量中文预训练模型集合:最先进大模型、最快小模型、相似度专门模型

Language: Python - Size: 789 KB - Last synced: 7 months ago - Pushed: almost 4 years ago - Stars: 765 - Forks: 94

ilivans/tf-rnn-attention

Tensorflow implementation of attention mechanism for text classification tasks.

Language: Python - Size: 1.93 MB - Last synced: 3 months ago - Pushed: over 4 years ago - Stars: 746 - Forks: 291

TobiasLee/Text-Classification

Implementation of papers for text classification task on DBpedia

Language: Python - Size: 76.2 KB - Last synced: 6 months ago - Pushed: over 3 years ago - Stars: 739 - Forks: 200

meta-toolkit/meta

A Modern C++ Data Sciences Toolkit

Language: C++ - Size: 30.4 MB - Last synced: 17 days ago - Pushed: about 1 year ago - Stars: 686 - Forks: 232

daiquocnguyen/Graph-Transformer

Universal Graph Transformer Self-Attention Networks (TheWebConf WWW 2022) (Pytorch and Tensorflow)

Language: Python - Size: 109 MB - Last synced: 4 days ago - Pushed: almost 2 years ago - Stars: 622 - Forks: 77

FreedomIntelligence/TextClassificationBenchmark

A Benchmark of Text Classification in PyTorch

Language: Python - Size: 1.76 MB - Last synced: 22 days ago - Pushed: 30 days ago - Stars: 592 - Forks: 137

jiegzhan/multi-class-text-classification-cnn-rnn

Classify Kaggle San Francisco Crime Description into 39 classes. Build the model with CNN, RNN (GRU and LSTM) and Word Embeddings on Tensorflow.

Language: Python - Size: 88.6 MB - Last synced: about 1 month ago - Pushed: about 6 years ago - Stars: 592 - Forks: 263

brightmart/sentiment_analysis_fine_grain

Multi-label Classification with BERT; Fine Grained Sentiment Analysis from AI challenger

Language: Jupyter Notebook - Size: 3.39 MB - Last synced: 3 months ago - Pushed: over 5 years ago - Stars: 586 - Forks: 162

nuclia/nucliadb

NucliaDB, The AI Search database for RAG

Language: Python - Size: 34 MB - Last synced: about 1 month ago - Pushed: about 1 month ago - Stars: 569 - Forks: 45

hellonlp/classifier-multi-label

多标签文本分类,多标签分类,文本分类, multi-label, classifier, text classification, BERT, seq2seq,attention, multi-label-classification

Language: Python - Size: 3.48 MB - Last synced: 3 months ago - Pushed: 3 months ago - Stars: 569 - Forks: 132

NirantK/NLP_Quickbook

NLP in Python with Deep Learning

Language: Jupyter Notebook - Size: 3.15 MB - Last synced: about 1 month ago - Pushed: 4 months ago - Stars: 560 - Forks: 231

xuyige/BERT4doc-Classification

Code and source for paper ``How to Fine-Tune BERT for Text Classification?``

Language: Python - Size: 795 KB - Last synced: 6 months ago - Pushed: over 2 years ago - Stars: 556 - Forks: 92

stepthom/text_mining_resources

Resources for learning about Text Mining and Natural Language Processing

Size: 707 KB - Last synced: 1 day ago - Pushed: over 1 year ago - Stars: 553 - Forks: 200

VinAIResearch/BERTweet

BERTweet: A pre-trained language model for English Tweets (EMNLP-2020)

Language: Python - Size: 134 KB - Last synced: 3 months ago - Pushed: 5 months ago - Stars: 548 - Forks: 51

JayYip/m3tl

BERT for Multitask Learning

Language: Jupyter Notebook - Size: 29.1 MB - Last synced: 4 days ago - Pushed: about 1 year ago - Stars: 544 - Forks: 126

RandolphVI/Multi-Label-Text-Classification

About Muti-Label Text Classification Based on Neural Network.

Language: Python - Size: 451 KB - Last synced: about 2 months ago - Pushed: over 3 years ago - Stars: 543 - Forks: 147

shawroad/NLP_pytorch_project

Embedding, NMT, Text_Classification, Text_Generation, NER etc.

Language: Python - Size: 164 MB - Last synced: 6 months ago - Pushed: 11 months ago - Stars: 536 - Forks: 113

murray-z/text_analysis_tools

中文文本分析工具包(包括- 文本分类 - 文本聚类 - 文本相似性 - 关键词抽取 - 关键短语抽取 - 情感分析 - 文本纠错 - 文本摘要 - 主题关键词-同义词、近义词-事件三元组抽取)

Language: Python - Size: 9.98 MB - Last synced: 6 months ago - Pushed: 8 months ago - Stars: 533 - Forks: 114

webis-de/small-text

Active Learning for Text Classification in Python

Language: Python - Size: 2 MB - Last synced: 7 days ago - Pushed: 8 days ago - Stars: 521 - Forks: 57

moon-hotel/BertWithPretrained

An implementation of the BERT model and its related downstream tasks based on the PyTorch framework

Language: Python - Size: 69.6 MB - Last synced: about 1 month ago - Pushed: 7 months ago - Stars: 519 - Forks: 103

PracticingMan/chinese_text_cnn

TextCNN Pytorch实现 中文文本分类 情感分析

Language: Python - Size: 4.12 MB - Last synced: 6 months ago - Pushed: over 5 years ago - Stars: 510 - Forks: 108

dongjun-Lee/text-classification-models-tf

Tensorflow implementations of Text Classification Models.

Language: Python - Size: 10.7 KB - Last synced: 22 days ago - Pushed: over 1 year ago - Stars: 505 - Forks: 167

gaoisbest/NLP-Projects

word2vec, sentence2vec, machine reading comprehension, dialog system, text classification, pretrained language model (i.e., XLNet, BERT, ELMo, GPT), sequence labeling, information retrieval, information extraction (i.e., entity, relation and event extraction), knowledge graph, text generation, network embedding

Language: OpenEdge ABL - Size: 384 MB - Last synced: 14 days ago - Pushed: over 3 years ago - Stars: 503 - Forks: 149

EricFillion/happy-transformer

Happy Transformer makes it easy to fine-tune and perform inference with NLP Transformer models.

Language: Python - Size: 18.8 MB - Last synced: about 2 months ago - Pushed: 2 months ago - Stars: 496 - Forks: 63

FXLP/MarkTool

DoTAT 是一款基于web、面向领域的通用文本标注工具,支持大规模实体标注、关系标注、事件标注、文本分类、基于字典匹配和正则匹配的自动标注以及用于实现归一化的标准名标注,同时也支持迭代标注、嵌套实体标注和嵌套事件标注。标注规范可自定义且同类型任务中可“一次创建多次复用”。通过分级实体集合扩大了实体类型的规模,并设计了全新高效的标注方式,提升了用户体验和标注效率。此外,本工具增加了审核环节,可对多人的标注结果进行一致性检验、自动合并和手动调整,提高了标注结果的准确率。

Language: Vue - Size: 854 KB - Last synced: 6 months ago - Pushed: almost 2 years ago - Stars: 480 - Forks: 83

jind11/TextFooler

A Model for Natural Language Attack on Text Classification and Inference

Language: Python - Size: 2.77 MB - Last synced: about 1 month ago - Pushed: over 1 year ago - Stars: 470 - Forks: 78

eBay/Sequence-Semantic-Embedding 📦

Tools and recipes to train deep learning models and build services for NLP tasks such as text classification, semantic search ranking and recall fetching, cross-lingual information retrieval, and question answering etc.

Language: Python - Size: 180 MB - Last synced: 22 days ago - Pushed: over 5 years ago - Stars: 459 - Forks: 122

yuanxiaosc/BERT-for-Sequence-Labeling-and-Text-Classification

This is the template code to use BERT for sequence lableing and text classification, in order to facilitate BERT for more tasks. Currently, the template code has included conll-2003 named entity identification, Snips Slot Filling and Intent Prediction.

Language: Python - Size: 2.75 MB - Last synced: 3 months ago - Pushed: over 1 year ago - Stars: 456 - Forks: 96

nishitpatel01/Fake_News_Detection

Fake News Detection in Python

Language: Jupyter Notebook - Size: 12.6 MB - Last synced: 6 months ago - Pushed: 9 months ago - Stars: 453 - Forks: 307

sourcedexter/tfClassifier

Tensorflow based training and classification scripts for text, images, etc

Language: Python - Size: 82.9 MB - Last synced: 3 months ago - Pushed: over 6 years ago - Stars: 452 - Forks: 277

jasoncao11/nlp-notebook

NLP 领域常见任务的实现,包括新词发现、以及基于pytorch的词向量、中文文本分类、实体识别、摘要文本生成、句子相似度判断、三元组抽取、预训练模型等。

Language: Python - Size: 53 MB - Last synced: 6 months ago - Pushed: almost 1 year ago - Stars: 440 - Forks: 100

shibing624/pytextclassifier

pytextclassifier is a toolkit for text classification. 文本分类,LR,Xgboost,TextCNN,FastText,TextRNN,BERT等分类模型实现,开箱即用。

Language: Python - Size: 17.4 MB - Last synced: 11 days ago - Pushed: 11 days ago - Stars: 432 - Forks: 71

cjymz886/text-cnn

嵌入Word2vec词向量的CNN中文文本分类

Language: Python - Size: 22.6 MB - Last synced: 6 months ago - Pushed: about 4 years ago - Stars: 430 - Forks: 117

jiegzhan/multi-class-text-classification-cnn

Classify Kaggle Consumer Finance Complaints into 11 classes. Build the model with CNN (Convolutional Neural Network) and Word Embeddings on Tensorflow.

Language: Python - Size: 296 MB - Last synced: about 1 month ago - Pushed: about 6 years ago - Stars: 427 - Forks: 200

raghakot/keras-text

Text Classification Library in Keras

Language: Python - Size: 11.6 MB - Last synced: about 23 hours ago - Pushed: almost 6 years ago - Stars: 420 - Forks: 98

The-FinAI/PIXIU

This repository introduces PIXIU, an open-source resource featuring the first financial large language models (LLMs), instruction tuning data, and evaluation benchmarks to holistically assess financial LLMs. Our goal is to continually push forward the open-source development of financial artificial intelligence (AI).

Language: Jupyter Notebook - Size: 50.2 MB - Last synced: about 19 hours ago - Pushed: 7 days ago - Stars: 414 - Forks: 43

kk7nc/RMDL

RMDL: Random Multimodel Deep Learning for Classification

Language: Python - Size: 223 MB - Last synced: 21 days ago - Pushed: about 1 year ago - Stars: 413 - Forks: 124

yongyehuang/zhihu-text-classification

[2017知乎看山杯 多标签 文本分类] ye组(第六名) 解题方案

Language: Jupyter Notebook - Size: 20.7 MB - Last synced: about 1 month ago - Pushed: about 6 years ago - Stars: 405 - Forks: 158

interpretml/interpret-text

A library that incorporates state-of-the-art explainers for text-based machine learning models and visualizes the result with a built-in dashboard.

Language: Python - Size: 10.3 MB - Last synced: 5 days ago - Pushed: 3 months ago - Stars: 402 - Forks: 67

airbnb/artificial-adversary

🗣️ Tool to generate adversarial text examples and test machine learning models against them

Language: Python - Size: 116 KB - Last synced: 8 days ago - Pushed: over 2 years ago - Stars: 391 - Forks: 57

Clarifai/clarifai-python

Experience the power of Clarifai’s AI platform with the python SDK. 🌟 Star to support our work!

Language: Python - Size: 7.69 MB - Last synced: 6 days ago - Pushed: 6 days ago - Stars: 389 - Forks: 118

kermitt2/delft

a Deep Learning Framework for Text

Language: Python - Size: 832 MB - Last synced: 3 days ago - Pushed: 18 days ago - Stars: 385 - Forks: 65

uvipen/Hierarchical-attention-networks-pytorch

Hierarchical Attention Networks for document classification

Language: Python - Size: 48.5 MB - Last synced: 6 months ago - Pushed: over 2 years ago - Stars: 365 - Forks: 101

hellonlp/sentiment-analysis

情感分析、文本分类、词典、bayes、sentiment analysis、TextCNN、classification、tensorflow、BERT、CNN、text classification

Language: Python - Size: 9.11 MB - Last synced: about 2 months ago - Pushed: about 2 months ago - Stars: 347 - Forks: 62

zzzDavid/ICDAR-2019-SROIE

ICDAR 2019 Robust Reading Challenge on Scanned Receipts OCR and Information Extraction

Language: Python - Size: 272 MB - Last synced: 7 months ago - Pushed: almost 4 years ago - Stars: 345 - Forks: 132

sergioburdisso/pyss3

A Python package implementing a new interpretable machine learning model for text classification (with visualization tools for Explainable AI :octocat:)

Language: Python - Size: 102 MB - Last synced: 8 days ago - Pushed: 9 months ago - Stars: 332 - Forks: 44

yongzhuo/Macadam

Macadam是一个以Tensorflow(Keras)和bert4keras为基础,专注于文本分类、序列标注和关系抽取的自然语言处理工具包。支持RANDOM、WORD2VEC、FASTTEXT、BERT、ALBERT、ROBERTA、NEZHA、XLNET、ELECTRA、GPT-2等EMBEDDING嵌入; 支持FineTune、FastText、TextCNN、CharCNN、BiRNN、RCNN、DCNN、CRNN、DeepMoji、SelfAttention、HAN、Capsule等文本分类算法; 支持CRF、Bi-LSTM-CRF、CNN-LSTM、DGCNN、Bi-LSTM-LAN、Lattice-LSTM-Batch、MRC等序列标注算法。

Language: Python - Size: 975 KB - Last synced: 28 days ago - Pushed: about 1 year ago - Stars: 324 - Forks: 38

phurwicz/hover

:speedboat: Label data at scale. Fun and precision included.

Language: Python - Size: 192 MB - Last synced: 6 days ago - Pushed: 6 days ago - Stars: 316 - Forks: 18

zhanlaoban/Transformers_for_Text_Classification

基于Transformers的文本分类

Language: Python - Size: 39.9 MB - Last synced: 6 months ago - Pushed: over 2 years ago - Stars: 315 - Forks: 65

shibing624/nlp-tutorial

自然语言处理(NLP)教程,包括:词向量,词法分析,预训练语言模型,文本分类,文本语义匹配,信息抽取,翻译,对话。

Language: Jupyter Notebook - Size: 2.69 MB - Last synced: 6 months ago - Pushed: about 2 years ago - Stars: 296 - Forks: 56

RandolphVI/Hierarchical-Multi-Label-Text-Classification

The code of CIKM'19 paper《Hierarchical Multi-label Text Classification: An Attention-based Recurrent Network Approach》

Language: Python - Size: 300 KB - Last synced: 6 months ago - Pushed: over 1 year ago - Stars: 295 - Forks: 68

Related Keywords
text-classification 3,080 nlp 1,053 machine-learning 856 natural-language-processing 584 python 527 deep-learning 471 sentiment-analysis 405 tensorflow 278 pytorch 269 bert 266 text-mining 230 nlp-machine-learning 209 classification 196 keras 158 text-processing 132 text-analysis 127 cnn 126 lstm 121 transformers 112 data-science 111 python3 109 naive-bayes-classifier 107 neural-network 98 logistic-regression 87 scikit-learn 87 word2vec 86 nltk 84 transformer 76 image-classification 73 tf-idf 73 artificial-intelligence 72 convolutional-neural-networks 70 rnn 70 sentiment-classification 70 sklearn 69 text 67 bert-model 62 text-generation 61 huggingface 61 word-embeddings 60 dataset 58 neural-networks 56 topic-modeling 56 ai 56 spacy 53 flask 53 named-entity-recognition 53 fasttext 51 svm 51 machine-learning-algorithms 48 jupyter-notebook 47 transfer-learning 45 ner 45 question-answering 45 embeddings 43 svm-classifier 43 bert-fine-tuning 42 language-model 42 data-mining 41 naive-bayes 41 huggingface-transformers 41 text-summarization 41 document-classification 40 random-forest 40 deep-neural-networks 40 ml 39 information-retrieval 38 keras-tensorflow 37 attention-mechanism 37 multi-label-classification 35 twitter 35 supervised-learning 35 pandas 34 multilabel-classification 33 tensorflow2 33 recurrent-neural-networks 33 bag-of-words 33 roberta 33 textcnn 33 kaggle 32 lstm-neural-networks 31 r 29 sentence-classification 28 text-preprocessing 28 docker 28 machine-translation 27 multiclass-classification 27 chatbot 27 bert-embeddings 26 streamlit 26 nltk-python 26 gru 26 graph-neural-networks 25 tokenization 25 fine-tuning 25 clustering 24 fastapi 24 text-clustering 24 attention 24 summarization 23