Ecosyste.ms: Repos
An open API service providing repository metadata for many open source software ecosystems.
GitHub topics: text-classification
hankcs/HanLP
中文分词 词性标注 命名实体识别 依存句法分析 成分句法分析 语义依存分析 语义角色标注 指代消解 风格转换 语义相似度 新词发现 关键词短语提取 自动摘要 文本分类聚类 拼音简繁转换 自然语言处理
Language: Python - Size: 69.5 MB - Last synced: 6 days ago - Pushed: about 1 month ago - Stars: 32,516 - Forks: 9,601
explosion/spaCy
💫 Industrial-strength Natural Language Processing (NLP) in Python
Language: Python - Size: 194 MB - Last synced: about 1 month ago - Pushed: about 1 month ago - Stars: 28,659 - Forks: 4,274
brightmart/nlp_chinese_corpus
大规模中文自然语言处理语料 Large Scale Chinese Corpus for NLP
Size: 4.01 MB - Last synced: about 2 months ago - Pushed: 12 months ago - Stars: 9,089 - Forks: 1,526
brightmart/text_classification
all kinds of text classification models and more with deep learning
Language: Python - Size: 14.1 MB - Last synced: 2 months ago - Pushed: 8 months ago - Stars: 7,723 - Forks: 2,579
microsoft/nlp-recipes 📦
Natural Language Processing Best Practices & Examples
Language: Python - Size: 46.5 MB - Last synced: about 15 hours ago - Pushed: over 1 year ago - Stars: 6,335 - Forks: 914
gaussic/text-classification-cnn-rnn
CNN-RNN中文文本分类,基于TensorFlow
Language: Python - Size: 700 KB - Last synced: 6 months ago - Pushed: about 5 years ago - Stars: 3,999 - Forks: 1,463
ThilinaRajapakse/simpletransformers
Transformers for Information Retrieval, Text Classification, NER, QA, Language Modelling, Language Generation, T5, Multi-Modal, and Conversational AI
Language: Python - Size: 20.1 MB - Last synced: 8 days ago - Pushed: 2 months ago - Stars: 3,997 - Forks: 722
snipsco/snips-nlu
Snips Python library to extract meaning from text
Language: Python - Size: 19.3 MB - Last synced: 7 days ago - Pushed: 12 months ago - Stars: 3,867 - Forks: 516
CLUEbenchmark/CLUEDatasetSearch
搜索所有中文NLP数据集,附常用英文NLP数据集
Language: Python - Size: 8.87 MB - Last synced: 3 months ago - Pushed: over 1 year ago - Stars: 3,772 - Forks: 581
JohnSnowLabs/spark-nlp
State of the Art Natural Language Processing
Language: Scala - Size: 1.47 GB - Last synced: about 15 hours ago - Pushed: about 16 hours ago - Stars: 3,716 - Forks: 702
catalyst-team/catalyst
Accelerated deep learning R&D
Language: Python - Size: 52.6 MB - Last synced: about 6 hours ago - Pushed: 2 months ago - Stars: 3,234 - Forks: 385
fastnlp/fastNLP
fastNLP: A Modularized and Extensible NLP Framework. Currently still in incubation.
Language: Python - Size: 35.1 MB - Last synced: about 2 months ago - Pushed: 12 months ago - Stars: 3,032 - Forks: 454
BrikerMan/Kashgari
Kashgari is a production-level NLP Transfer learning framework built on top of tf.keras for text-labeling and text-classification, includes Word2Vec, BERT, and GPT2 Language Embedding.
Language: Python - Size: 14.3 MB - Last synced: about 23 hours ago - Pushed: almost 3 years ago - Stars: 2,379 - Forks: 440
x4nth055/pythoncode-tutorials
The Python Code Tutorials
Language: Jupyter Notebook - Size: 312 MB - Last synced: 8 days ago - Pushed: 8 days ago - Stars: 2,010 - Forks: 1,850
alibaba/EasyNLP
EasyNLP: A Comprehensive and Easy-to-use NLP Toolkit
Language: Python - Size: 19.9 MB - Last synced: 11 days ago - Pushed: 2 months ago - Stars: 1,955 - Forks: 247
kk7nc/Text_Classification
Text Classification Algorithms: A Survey
Language: Python - Size: 13.7 MB - Last synced: 3 months ago - Pushed: over 1 year ago - Stars: 1,774 - Forks: 588
xlang-ai/instructor-embedding
[ACL 2023] One Embedder, Any Task: Instruction-Finetuned Text Embeddings
Language: Python - Size: 170 MB - Last synced: 6 days ago - Pushed: 26 days ago - Stars: 1,724 - Forks: 126
yongzhuo/Keras-TextClassification
中文长文本分类、短句子分类、多标签分类、两句子相似度(Chinese Text Classification of Keras NLP, multi-label classify, or sentence classify, long or short),字词句向量嵌入层(embeddings)和网络层(graph)构建基类,FastText,TextCNN,CharCNN,TextRNN, RCNN, DCNN, DPCNN, VDCNN, CRNN, Bert, Xlnet, Albert, Attention, DeepMoji, HAN, 胶囊网络-CapsuleNet, Transformer-encode, Seq2seq, SWEM, LEAM, TextGCN
Language: Python - Size: 599 KB - Last synced: 16 days ago - Pushed: 16 days ago - Stars: 1,705 - Forks: 404
dipanjanS/text-analytics-with-python
Learn how to process, classify, cluster, summarize, understand syntax, semantics and sentiment of text data with the power of Python! This repository contains code and datasets used in my book, "Text Analytics with Python" published by Apress/Springer.
Language: Jupyter Notebook - Size: 38.8 MB - Last synced: about 2 months ago - Pushed: over 3 years ago - Stars: 1,617 - Forks: 836
Delta-ML/delta
DELTA is a deep learning based natural language and speech processing platform.
Language: Python - Size: 59.5 MB - Last synced: about 15 hours ago - Pushed: about 1 month ago - Stars: 1,585 - Forks: 294
HarderThenHarder/transformers_tasks
⭐️ NLP Algorithms with transformers lib. Supporting Text-Classification, Text-Generation, Information-Extraction, Text-Matching, RLHF, SFT etc.
Language: Jupyter Notebook - Size: 71.1 MB - Last synced: 7 months ago - Pushed: 8 months ago - Stars: 1,564 - Forks: 303
yongzhuo/nlp_xiaojiang
自然语言处理(nlp),小姜机器人(闲聊检索式chatbot),BERT句向量-相似度(Sentence Similarity),XLNET句向量-相似度(text xlnet embedding),文本分类(Text classification), 实体提取(ner,bert+bilstm+crf),数据增强(text augment, data enhance),同义句同义词生成,句子主干提取(mainpart),中文汉语短文本相似度,文本特征工程,keras-http-service调用
Language: Python - Size: 23.4 MB - Last synced: 17 days ago - Pushed: over 2 years ago - Stars: 1,512 - Forks: 395
jasonwei20/eda_nlp
Data augmentation for NLP, presented at EMNLP 2019
Language: Python - Size: 19.9 MB - Last synced: 6 months ago - Pushed: about 1 year ago - Stars: 1,503 - Forks: 311
bfelbo/DeepMoji
State-of-the-art deep learning model for analyzing sentiment, emotion, sarcasm etc.
Language: Python - Size: 108 MB - Last synced: about 2 months ago - Pushed: 8 months ago - Stars: 1,491 - Forks: 313
microsoft/NeuronBlocks
NLP DNN Toolkit - Building Your NLP DNN Models Like Playing Lego
Language: Python - Size: 14.9 MB - Last synced: about 15 hours ago - Pushed: 10 months ago - Stars: 1,441 - Forks: 192
embeddings-benchmark/mteb
MTEB: Massive Text Embedding Benchmark
Language: Python - Size: 3.71 MB - Last synced: 9 days ago - Pushed: 9 days ago - Stars: 1,441 - Forks: 175
code-kern-ai/refinery
The data scientist's open-source choice to scale, assess and maintain natural language data. Treat training data like a software artifact.
Language: Python - Size: 3.54 MB - Last synced: 1 day ago - Pushed: about 1 month ago - Stars: 1,365 - Forks: 64
lyeoni/nlp-tutorial
A list of NLP(Natural Language Processing) tutorials
Language: Jupyter Notebook - Size: 1.39 GB - Last synced: 3 months ago - Pushed: about 4 years ago - Stars: 1,355 - Forks: 267
yao8839836/text_gcn
Graph Convolutional Networks for Text Classification. AAAI 2019
Language: Python - Size: 841 MB - Last synced: 2 months ago - Pushed: over 2 years ago - Stars: 1,324 - Forks: 431
zhanlaoban/EDA_NLP_for_Chinese
An implement of the paper of EDA for Chinese corpus.中文语料的EDA数据增强工具。NLP数据增强。论文阅读笔记。
Language: Python - Size: 22.5 KB - Last synced: 6 months ago - Pushed: almost 2 years ago - Stars: 1,265 - Forks: 232
920232796/bert_seq2seq
pytorch实现 Bert 做seq2seq任务,使用unilm方案,现在也可以做自动摘要,文本分类,情感分析,NER,词性标注等任务,支持t5模型,支持GPT2进行文章续写。
Language: Python - Size: 3.45 MB - Last synced: 12 days ago - Pushed: almost 2 years ago - Stars: 1,263 - Forks: 206
charlesXu86/Chatbot_CN
基于金融-司法领域(兼有闲聊性质)的聊天机器人,其中的主要模块有信息抽取、NLU、NLG、知识图谱等,并且利用Django整合了前端展示,目前已经封装了nlp和kg的restful接口
Size: 2.2 MB - Last synced: 2 months ago - Pushed: almost 3 years ago - Stars: 1,258 - Forks: 433
Hello-SimpleAI/chatgpt-comparison-detection
Human ChatGPT Comparison Corpus (HC3), Detectors, and more! 🔥
Language: Python - Size: 53.7 KB - Last synced: about 2 months ago - Pushed: 6 months ago - Stars: 1,184 - Forks: 114
kavgan/nlp-in-practice
Starter code to solve real world text data problems. Includes: Gensim Word2Vec, phrase embeddings, Text Classification with Logistic Regression, word count with pyspark, simple text preprocessing, pre-trained embeddings and more.
Language: Jupyter Notebook - Size: 91.8 MB - Last synced: 5 days ago - Pushed: over 3 years ago - Stars: 1,120 - Forks: 781
obsei/obsei
Obsei is a low code AI powered automation tool. It can be used in various business flows like social listening, AI based alerting, brand image analysis, comparative study and more .
Language: Python - Size: 16.2 MB - Last synced: 14 days ago - Pushed: 14 days ago - Stars: 1,083 - Forks: 151
richliao/textClassifier
Text classifier for Hierarchical Attention Networks for Document Classification
Language: Python - Size: 21.5 KB - Last synced: 3 months ago - Pushed: over 2 years ago - Stars: 1,061 - Forks: 384
Tongjilibo/bert4torch
An elegent pytorch implement of transformers
Language: Python - Size: 8.35 MB - Last synced: 5 months ago - Pushed: 5 months ago - Stars: 985 - Forks: 129
brightmart/bert_language_understanding
Pre-training of Deep Bidirectional Transformers for Language Understanding: pre-train TextCNN
Language: Python - Size: 16 MB - Last synced: 3 months ago - Pushed: over 5 years ago - Stars: 959 - Forks: 212
explosion/spacy-llm
🦙 Integrating LLMs into structured NLP pipelines
Language: Python - Size: 1.78 MB - Last synced: 1 day ago - Pushed: 3 days ago - Stars: 958 - Forks: 74
greyblake/whatlang-rs
Natural language detection library for Rust. Try demo online: https://whatlang.org/
Language: Rust - Size: 2.04 MB - Last synced: 20 days ago - Pushed: 2 months ago - Stars: 952 - Forks: 108
wikipedia2vec/wikipedia2vec
A tool for learning vector representations of words and entities from Wikipedia
Language: Python - Size: 2.41 MB - Last synced: about 1 hour ago - Pushed: 16 days ago - Stars: 922 - Forks: 101
rodrigopivi/Chatito
🎯🗯 Dataset generation for AI chatbots, NLP tasks, named entity recognition or text classification models using a simple DSL!
Language: TypeScript - Size: 6.42 MB - Last synced: about 1 month ago - Pushed: 9 months ago - Stars: 861 - Forks: 157
lonePatient/Bert-Multi-Label-Text-Classification
This repo contains a PyTorch implementation of a pretrained BERT model for multi-label text classification.
Language: Python - Size: 187 KB - Last synced: 6 days ago - Pushed: about 1 year ago - Stars: 836 - Forks: 208
smilelight/lightNLP
基于Pytorch和torchtext的自然语言处理深度学习框架。
Language: Python - Size: 49.3 MB - Last synced: about 2 months ago - Pushed: over 3 years ago - Stars: 823 - Forks: 212
JohnSnowLabs/nlu
1 line for thousands of State of The Art NLP models in hundreds of languages The fastest and most accurate way to solve text problems.
Language: Python - Size: 476 MB - Last synced: about 15 hours ago - Pushed: 1 day ago - Stars: 821 - Forks: 124
styfeng/DataAug4NLP
Collection of papers and resources for data augmentation for NLP.
Size: 120 KB - Last synced: 6 days ago - Pushed: almost 2 years ago - Stars: 817 - Forks: 76
ShawnyXiao/TextClassification-Keras
Text classification models implemented in Keras, including: FastText, TextCNN, TextRNN, TextBiRNN, TextAttBiRNN, HAN, RCNN, RCNNVariant, etc.
Language: Python - Size: 1.87 MB - Last synced: 6 months ago - Pushed: about 1 year ago - Stars: 809 - Forks: 190
prakashpandey9/Text-Classification-Pytorch
Text classification using deep learning models in Pytorch
Language: Python - Size: 31.3 KB - Last synced: about 2 months ago - Pushed: over 5 years ago - Stars: 801 - Forks: 237
explosion/spacy-streamlit
👑 spaCy building blocks and visualizers for Streamlit apps
Language: Python - Size: 61.5 KB - Last synced: 5 days ago - Pushed: 10 months ago - Stars: 766 - Forks: 115
CLUEbenchmark/CLUEPretrainedModels
高质量中文预训练模型集合:最先进大模型、最快小模型、相似度专门模型
Language: Python - Size: 789 KB - Last synced: 7 months ago - Pushed: almost 4 years ago - Stars: 765 - Forks: 94
ilivans/tf-rnn-attention
Tensorflow implementation of attention mechanism for text classification tasks.
Language: Python - Size: 1.93 MB - Last synced: 3 months ago - Pushed: over 4 years ago - Stars: 746 - Forks: 291
TobiasLee/Text-Classification
Implementation of papers for text classification task on DBpedia
Language: Python - Size: 76.2 KB - Last synced: 6 months ago - Pushed: over 3 years ago - Stars: 739 - Forks: 200
meta-toolkit/meta
A Modern C++ Data Sciences Toolkit
Language: C++ - Size: 30.4 MB - Last synced: 17 days ago - Pushed: about 1 year ago - Stars: 686 - Forks: 232
daiquocnguyen/Graph-Transformer
Universal Graph Transformer Self-Attention Networks (TheWebConf WWW 2022) (Pytorch and Tensorflow)
Language: Python - Size: 109 MB - Last synced: 4 days ago - Pushed: almost 2 years ago - Stars: 622 - Forks: 77
FreedomIntelligence/TextClassificationBenchmark
A Benchmark of Text Classification in PyTorch
Language: Python - Size: 1.76 MB - Last synced: 22 days ago - Pushed: 30 days ago - Stars: 592 - Forks: 137
jiegzhan/multi-class-text-classification-cnn-rnn
Classify Kaggle San Francisco Crime Description into 39 classes. Build the model with CNN, RNN (GRU and LSTM) and Word Embeddings on Tensorflow.
Language: Python - Size: 88.6 MB - Last synced: about 1 month ago - Pushed: about 6 years ago - Stars: 592 - Forks: 263
brightmart/sentiment_analysis_fine_grain
Multi-label Classification with BERT; Fine Grained Sentiment Analysis from AI challenger
Language: Jupyter Notebook - Size: 3.39 MB - Last synced: 3 months ago - Pushed: over 5 years ago - Stars: 586 - Forks: 162
nuclia/nucliadb
NucliaDB, The AI Search database for RAG
Language: Python - Size: 34 MB - Last synced: about 1 month ago - Pushed: about 1 month ago - Stars: 569 - Forks: 45
hellonlp/classifier-multi-label
多标签文本分类,多标签分类,文本分类, multi-label, classifier, text classification, BERT, seq2seq,attention, multi-label-classification
Language: Python - Size: 3.48 MB - Last synced: 3 months ago - Pushed: 3 months ago - Stars: 569 - Forks: 132
NirantK/NLP_Quickbook
NLP in Python with Deep Learning
Language: Jupyter Notebook - Size: 3.15 MB - Last synced: about 1 month ago - Pushed: 4 months ago - Stars: 560 - Forks: 231
xuyige/BERT4doc-Classification
Code and source for paper ``How to Fine-Tune BERT for Text Classification?``
Language: Python - Size: 795 KB - Last synced: 6 months ago - Pushed: over 2 years ago - Stars: 556 - Forks: 92
stepthom/text_mining_resources
Resources for learning about Text Mining and Natural Language Processing
Size: 707 KB - Last synced: 1 day ago - Pushed: over 1 year ago - Stars: 553 - Forks: 200
VinAIResearch/BERTweet
BERTweet: A pre-trained language model for English Tweets (EMNLP-2020)
Language: Python - Size: 134 KB - Last synced: 3 months ago - Pushed: 5 months ago - Stars: 548 - Forks: 51
JayYip/m3tl
BERT for Multitask Learning
Language: Jupyter Notebook - Size: 29.1 MB - Last synced: 4 days ago - Pushed: about 1 year ago - Stars: 544 - Forks: 126
RandolphVI/Multi-Label-Text-Classification
About Muti-Label Text Classification Based on Neural Network.
Language: Python - Size: 451 KB - Last synced: about 2 months ago - Pushed: over 3 years ago - Stars: 543 - Forks: 147
shawroad/NLP_pytorch_project
Embedding, NMT, Text_Classification, Text_Generation, NER etc.
Language: Python - Size: 164 MB - Last synced: 6 months ago - Pushed: 11 months ago - Stars: 536 - Forks: 113
murray-z/text_analysis_tools
中文文本分析工具包(包括- 文本分类 - 文本聚类 - 文本相似性 - 关键词抽取 - 关键短语抽取 - 情感分析 - 文本纠错 - 文本摘要 - 主题关键词-同义词、近义词-事件三元组抽取)
Language: Python - Size: 9.98 MB - Last synced: 6 months ago - Pushed: 8 months ago - Stars: 533 - Forks: 114
webis-de/small-text
Active Learning for Text Classification in Python
Language: Python - Size: 2 MB - Last synced: 7 days ago - Pushed: 8 days ago - Stars: 521 - Forks: 57
moon-hotel/BertWithPretrained
An implementation of the BERT model and its related downstream tasks based on the PyTorch framework
Language: Python - Size: 69.6 MB - Last synced: about 1 month ago - Pushed: 7 months ago - Stars: 519 - Forks: 103
PracticingMan/chinese_text_cnn
TextCNN Pytorch实现 中文文本分类 情感分析
Language: Python - Size: 4.12 MB - Last synced: 6 months ago - Pushed: over 5 years ago - Stars: 510 - Forks: 108
dongjun-Lee/text-classification-models-tf
Tensorflow implementations of Text Classification Models.
Language: Python - Size: 10.7 KB - Last synced: 22 days ago - Pushed: over 1 year ago - Stars: 505 - Forks: 167
gaoisbest/NLP-Projects
word2vec, sentence2vec, machine reading comprehension, dialog system, text classification, pretrained language model (i.e., XLNet, BERT, ELMo, GPT), sequence labeling, information retrieval, information extraction (i.e., entity, relation and event extraction), knowledge graph, text generation, network embedding
Language: OpenEdge ABL - Size: 384 MB - Last synced: 14 days ago - Pushed: over 3 years ago - Stars: 503 - Forks: 149
EricFillion/happy-transformer
Happy Transformer makes it easy to fine-tune and perform inference with NLP Transformer models.
Language: Python - Size: 18.8 MB - Last synced: about 2 months ago - Pushed: 2 months ago - Stars: 496 - Forks: 63
FXLP/MarkTool
DoTAT 是一款基于web、面向领域的通用文本标注工具,支持大规模实体标注、关系标注、事件标注、文本分类、基于字典匹配和正则匹配的自动标注以及用于实现归一化的标准名标注,同时也支持迭代标注、嵌套实体标注和嵌套事件标注。标注规范可自定义且同类型任务中可“一次创建多次复用”。通过分级实体集合扩大了实体类型的规模,并设计了全新高效的标注方式,提升了用户体验和标注效率。此外,本工具增加了审核环节,可对多人的标注结果进行一致性检验、自动合并和手动调整,提高了标注结果的准确率。
Language: Vue - Size: 854 KB - Last synced: 6 months ago - Pushed: almost 2 years ago - Stars: 480 - Forks: 83
jind11/TextFooler
A Model for Natural Language Attack on Text Classification and Inference
Language: Python - Size: 2.77 MB - Last synced: about 1 month ago - Pushed: over 1 year ago - Stars: 470 - Forks: 78
eBay/Sequence-Semantic-Embedding 📦
Tools and recipes to train deep learning models and build services for NLP tasks such as text classification, semantic search ranking and recall fetching, cross-lingual information retrieval, and question answering etc.
Language: Python - Size: 180 MB - Last synced: 22 days ago - Pushed: over 5 years ago - Stars: 459 - Forks: 122
yuanxiaosc/BERT-for-Sequence-Labeling-and-Text-Classification
This is the template code to use BERT for sequence lableing and text classification, in order to facilitate BERT for more tasks. Currently, the template code has included conll-2003 named entity identification, Snips Slot Filling and Intent Prediction.
Language: Python - Size: 2.75 MB - Last synced: 3 months ago - Pushed: over 1 year ago - Stars: 456 - Forks: 96
nishitpatel01/Fake_News_Detection
Fake News Detection in Python
Language: Jupyter Notebook - Size: 12.6 MB - Last synced: 6 months ago - Pushed: 9 months ago - Stars: 453 - Forks: 307
sourcedexter/tfClassifier
Tensorflow based training and classification scripts for text, images, etc
Language: Python - Size: 82.9 MB - Last synced: 3 months ago - Pushed: over 6 years ago - Stars: 452 - Forks: 277
jasoncao11/nlp-notebook
NLP 领域常见任务的实现,包括新词发现、以及基于pytorch的词向量、中文文本分类、实体识别、摘要文本生成、句子相似度判断、三元组抽取、预训练模型等。
Language: Python - Size: 53 MB - Last synced: 6 months ago - Pushed: almost 1 year ago - Stars: 440 - Forks: 100
shibing624/pytextclassifier
pytextclassifier is a toolkit for text classification. 文本分类,LR,Xgboost,TextCNN,FastText,TextRNN,BERT等分类模型实现,开箱即用。
Language: Python - Size: 17.4 MB - Last synced: 11 days ago - Pushed: 11 days ago - Stars: 432 - Forks: 71
cjymz886/text-cnn
嵌入Word2vec词向量的CNN中文文本分类
Language: Python - Size: 22.6 MB - Last synced: 6 months ago - Pushed: about 4 years ago - Stars: 430 - Forks: 117
jiegzhan/multi-class-text-classification-cnn
Classify Kaggle Consumer Finance Complaints into 11 classes. Build the model with CNN (Convolutional Neural Network) and Word Embeddings on Tensorflow.
Language: Python - Size: 296 MB - Last synced: about 1 month ago - Pushed: about 6 years ago - Stars: 427 - Forks: 200
raghakot/keras-text
Text Classification Library in Keras
Language: Python - Size: 11.6 MB - Last synced: about 23 hours ago - Pushed: almost 6 years ago - Stars: 420 - Forks: 98
The-FinAI/PIXIU
This repository introduces PIXIU, an open-source resource featuring the first financial large language models (LLMs), instruction tuning data, and evaluation benchmarks to holistically assess financial LLMs. Our goal is to continually push forward the open-source development of financial artificial intelligence (AI).
Language: Jupyter Notebook - Size: 50.2 MB - Last synced: about 19 hours ago - Pushed: 7 days ago - Stars: 414 - Forks: 43
kk7nc/RMDL
RMDL: Random Multimodel Deep Learning for Classification
Language: Python - Size: 223 MB - Last synced: 21 days ago - Pushed: about 1 year ago - Stars: 413 - Forks: 124
yongyehuang/zhihu-text-classification
[2017知乎看山杯 多标签 文本分类] ye组(第六名) 解题方案
Language: Jupyter Notebook - Size: 20.7 MB - Last synced: about 1 month ago - Pushed: about 6 years ago - Stars: 405 - Forks: 158
interpretml/interpret-text
A library that incorporates state-of-the-art explainers for text-based machine learning models and visualizes the result with a built-in dashboard.
Language: Python - Size: 10.3 MB - Last synced: 5 days ago - Pushed: 3 months ago - Stars: 402 - Forks: 67
airbnb/artificial-adversary
🗣️ Tool to generate adversarial text examples and test machine learning models against them
Language: Python - Size: 116 KB - Last synced: 8 days ago - Pushed: over 2 years ago - Stars: 391 - Forks: 57
Clarifai/clarifai-python
Experience the power of Clarifai’s AI platform with the python SDK. 🌟 Star to support our work!
Language: Python - Size: 7.69 MB - Last synced: 6 days ago - Pushed: 6 days ago - Stars: 389 - Forks: 118
kermitt2/delft
a Deep Learning Framework for Text
Language: Python - Size: 832 MB - Last synced: 3 days ago - Pushed: 18 days ago - Stars: 385 - Forks: 65
uvipen/Hierarchical-attention-networks-pytorch
Hierarchical Attention Networks for document classification
Language: Python - Size: 48.5 MB - Last synced: 6 months ago - Pushed: over 2 years ago - Stars: 365 - Forks: 101
hellonlp/sentiment-analysis
情感分析、文本分类、词典、bayes、sentiment analysis、TextCNN、classification、tensorflow、BERT、CNN、text classification
Language: Python - Size: 9.11 MB - Last synced: about 2 months ago - Pushed: about 2 months ago - Stars: 347 - Forks: 62
zzzDavid/ICDAR-2019-SROIE
ICDAR 2019 Robust Reading Challenge on Scanned Receipts OCR and Information Extraction
Language: Python - Size: 272 MB - Last synced: 7 months ago - Pushed: almost 4 years ago - Stars: 345 - Forks: 132
sergioburdisso/pyss3
A Python package implementing a new interpretable machine learning model for text classification (with visualization tools for Explainable AI :octocat:)
Language: Python - Size: 102 MB - Last synced: 8 days ago - Pushed: 9 months ago - Stars: 332 - Forks: 44
yongzhuo/Macadam
Macadam是一个以Tensorflow(Keras)和bert4keras为基础,专注于文本分类、序列标注和关系抽取的自然语言处理工具包。支持RANDOM、WORD2VEC、FASTTEXT、BERT、ALBERT、ROBERTA、NEZHA、XLNET、ELECTRA、GPT-2等EMBEDDING嵌入; 支持FineTune、FastText、TextCNN、CharCNN、BiRNN、RCNN、DCNN、CRNN、DeepMoji、SelfAttention、HAN、Capsule等文本分类算法; 支持CRF、Bi-LSTM-CRF、CNN-LSTM、DGCNN、Bi-LSTM-LAN、Lattice-LSTM-Batch、MRC等序列标注算法。
Language: Python - Size: 975 KB - Last synced: 28 days ago - Pushed: about 1 year ago - Stars: 324 - Forks: 38
phurwicz/hover
:speedboat: Label data at scale. Fun and precision included.
Language: Python - Size: 192 MB - Last synced: 6 days ago - Pushed: 6 days ago - Stars: 316 - Forks: 18
zhanlaoban/Transformers_for_Text_Classification
基于Transformers的文本分类
Language: Python - Size: 39.9 MB - Last synced: 6 months ago - Pushed: over 2 years ago - Stars: 315 - Forks: 65
shibing624/nlp-tutorial
自然语言处理(NLP)教程,包括:词向量,词法分析,预训练语言模型,文本分类,文本语义匹配,信息抽取,翻译,对话。
Language: Jupyter Notebook - Size: 2.69 MB - Last synced: 6 months ago - Pushed: about 2 years ago - Stars: 296 - Forks: 56
RandolphVI/Hierarchical-Multi-Label-Text-Classification
The code of CIKM'19 paper《Hierarchical Multi-label Text Classification: An Attention-based Recurrent Network Approach》
Language: Python - Size: 300 KB - Last synced: 6 months ago - Pushed: over 1 year ago - Stars: 295 - Forks: 68