GitHub topics: chinese-nlp
crownpku/Awesome-Chinese-NLP
A curated list of resources for Chinese NLP 中文自然语言处理相关资料
Size: 317 KB - Last synced at: 13 minutes ago - Pushed at: over 1 year ago - Stars: 7,864 - Forks: 1,717

hiDaDeng/hidadeng.github.io
大邓的个人博客,博客域名在下方, 访问可能有点慢啊。
Language: HTML - Size: 1.61 GB - Last synced at: 1 day ago - Pushed at: 2 days ago - Stars: 4 - Forks: 2

pwxcoo/chinese-xinhua
:orange_book: 中华新华字典数据库。包括歇后语,成语,词语,汉字。
Language: Python - Size: 34.6 MB - Last synced at: 1 day ago - Pushed at: over 1 year ago - Stars: 11,163 - Forks: 2,616

HIT-SCIR/ltp
Language Technology Platform
Language: Python - Size: 15.6 MB - Last synced at: 3 days ago - Pushed at: 23 days ago - Stars: 5,089 - Forks: 1,051

LianjiaTech/BELLE
BELLE: Be Everyone's Large Language model Engine(开源中文对话大模型)
Language: HTML - Size: 18 MB - Last synced at: 1 day ago - Pushed at: 6 months ago - Stars: 8,130 - Forks: 769

lyogavin/airllm
AirLLM 70B inference with single 4GB GPU
Language: Jupyter Notebook - Size: 3.22 MB - Last synced at: 2 days ago - Pushed at: 5 months ago - Stars: 5,757 - Forks: 457

crazywhalecc/idiom-database
成语数据库,成语接龙数据库,拥有30000+个成语,可直接使用首拼音和尾拼音编写自己的成语接龙
Size: 12.2 MB - Last synced at: about 6 hours ago - Pushed at: about 4 years ago - Stars: 93 - Forks: 22

iflytek/cino
CINO: Pre-trained Language Models for Chinese Minority (少数民族语言预训练模型)
Language: Python - Size: 21.7 MB - Last synced at: about 1 hour ago - Pushed at: about 2 years ago - Stars: 242 - Forks: 30

ECNU-ICALK/EduChat
An open-source educational chat model from ICALK, East China Normal University. 开源中英教育对话大模型。(通用基座模型,GPU部署,数据清理) 致敬: LLaMA, MOSS, BELLE, Ziya, vLLM
Language: Jupyter Notebook - Size: 210 MB - Last synced at: 6 days ago - Pushed at: 7 months ago - Stars: 785 - Forks: 86

hiDaDeng/hidadeng
github介绍页
Size: 34.2 KB - Last synced at: 7 days ago - Pushed at: 8 days ago - Stars: 2 - Forks: 0

thunlp/THULAC-Python
An Efficient Lexical Analyzer for Chinese
Language: Python - Size: 78.1 KB - Last synced at: 10 days ago - Pushed at: about 3 years ago - Stars: 2,059 - Forks: 335

rime/rime-cantonese
Rime Cantonese input schema | 粵語拼音輸入方案
Language: Python - Size: 88.5 MB - Last synced at: 11 days ago - Pushed at: 2 months ago - Stars: 580 - Forks: 64

CVI-SZU/Linly
Chinese-LLaMA 1&2、Chinese-Falcon 基础模型;ChatFlow中文对话模型;中文OpenLLaMA模型;NLP预训练/指令微调数据集
Language: Python - Size: 7.27 MB - Last synced at: 11 days ago - Pushed at: about 1 year ago - Stars: 3,049 - Forks: 234

ydli-ai/CSL
[COLING 2022] CSL: A Large-scale Chinese Scientific Literature Dataset 中文科学文献数据集
Language: Python - Size: 3.97 MB - Last synced at: 11 days ago - Pushed at: almost 2 years ago - Stars: 623 - Forks: 59

esbatmop/MNBVC
MNBVC(Massive Never-ending BT Vast Chinese corpus)超大规模中文语料集。对标chatGPT训练的40T数据。MNBVC数据集不但包括主流文化,也包括各个小众文化甚至火星文的数据。MNBVC数据集包括新闻、作文、小说、书籍、杂志、论文、台词、帖子、wiki、古诗、歌词、商品介绍、笑话、糗事、聊天记录等一切形式的纯文本中文数据。
Size: 681 KB - Last synced at: 12 days ago - Pushed at: 12 days ago - Stars: 3,812 - Forks: 265

HIT-SCIR/pyltp Fork of HuangFJ/pyltp
pyltp: the python extension for LTP
Language: C++ - Size: 8.76 MB - Last synced at: 10 days ago - Pushed at: almost 3 years ago - Stars: 1,544 - Forks: 350

crownpku/Information-Extraction-Chinese
Chinese Named Entity Recognition with IDCNN/biLSTM+CRF, and Relation Extraction with biGRU+2ATT 中文实体识别与关系提取
Language: Python - Size: 78.9 MB - Last synced at: 11 days ago - Pushed at: about 1 year ago - Stars: 2,255 - Forks: 808

fastnlp/fastNLP
fastNLP: A Modularized and Extensible NLP Framework. Currently still in incubation.
Language: Python - Size: 35.1 MB - Last synced at: 12 days ago - Pushed at: almost 2 years ago - Stars: 3,126 - Forks: 450

IDEA-CCNL/Fengshenbang-LM
Fengshenbang-LM(封神榜大模型)是IDEA研究院认知计算与自然语言研究中心主导的大模型开源体系,成为中文AIGC和认知智能的基础设施。
Language: Python - Size: 84.5 MB - Last synced at: 15 days ago - Pushed at: 8 months ago - Stars: 4,106 - Forks: 380

baidu/lac
百度NLP:分词,词性标注,命名实体识别,词重要性
Language: C++ - Size: 63.6 MB - Last synced at: 14 days ago - Pushed at: almost 4 years ago - Stars: 3,921 - Forks: 596

thunlp/THULAC
An Efficient Lexical Analyzer for Chinese
Language: C++ - Size: 93.8 KB - Last synced at: 2 days ago - Pushed at: almost 2 years ago - Stars: 806 - Forks: 173

modelscope/AdaSeq
AdaSeq: An All-in-One Library for Developing State-of-the-Art Sequence Understanding Models
Language: Python - Size: 5.03 MB - Last synced at: 13 days ago - Pushed at: over 1 year ago - Stars: 434 - Forks: 41

baidu/DDParser
百度开源的依存句法分析系统
Language: Python - Size: 354 KB - Last synced at: 17 days ago - Pushed at: about 2 years ago - Stars: 986 - Forks: 162

Doragd/Chinese-Chatbot-PyTorch-Implementation
:four_leaf_clover: Another Chinese chatbot implemented in PyTorch, which is the sub-module of intelligent work order processing robot. 👩🔧
Language: Python - Size: 81.6 MB - Last synced at: 16 days ago - Pushed at: 9 months ago - Stars: 890 - Forks: 194

rhzxg/MicroblogCrawler
微博热榜爬虫
Language: Python - Size: 234 KB - Last synced at: 21 days ago - Pushed at: 21 days ago - Stars: 2 - Forks: 0

zake7749/Kyara
Lightweight and Effective Chinese LLM.
Language: Jupyter Notebook - Size: 255 KB - Last synced at: 14 days ago - Pushed at: 2 months ago - Stars: 22 - Forks: 1

taishan1994/pytorch_bert_event_extraction
基于pytorch+bert的中文事件抽取
Language: Python - Size: 5.81 MB - Last synced at: 19 days ago - Pushed at: almost 3 years ago - Stars: 72 - Forks: 4

didi/ChineseNLP
Datasets, SOTA results of every fields of Chinese NLP
Language: HTML - Size: 875 KB - Last synced at: 19 days ago - Pushed at: about 3 years ago - Stars: 1,804 - Forks: 271

linonetwo/segmentit
任何 JS 环境可用的中文分词包,fork from leizongmin/node-segment
Language: JavaScript - Size: 3.18 MB - Last synced at: 21 days ago - Pushed at: about 2 years ago - Stars: 283 - Forks: 16

FerdinandZhong/punctuator
A small seq2seq punctuator tool based on DistilBERT
Language: Python - Size: 119 MB - Last synced at: 19 days ago - Pushed at: 4 months ago - Stars: 51 - Forks: 8

hscspring/pnlp
NLP预/后处理工具。
Language: Python - Size: 106 KB - Last synced at: 16 days ago - Pushed at: 25 days ago - Stars: 29 - Forks: 6

howl-anderson/MicroTokenizer
一个轻量且功能全面的中文分词器,帮助学生了解分词器的工作原理。MicroTokenizer: A lightweight Chinese tokenizer designed for educational and research purposes. Provides a practical, hands-on approach to understanding NLP concepts, featuring multiple tokenization algorithms and customizable models. Ideal for students, researchers, and NLP enthusiasts..
Language: Python - Size: 174 MB - Last synced at: 13 days ago - Pushed at: 6 months ago - Stars: 150 - Forks: 22

Isaac-JL-Chen/rouge_chinese Fork of pltrdy/rouge
Python ROUGE Score Implementation for Chinese Language Task (official rouge score)
Language: Python - Size: 90.8 KB - Last synced at: 16 days ago - Pushed at: 10 months ago - Stars: 101 - Forks: 5

lionsoul2014/jcseg
Jcseg is a light weight NLP framework developed with Java. Provide CJK and English segmentation based on MMSEG algorithm, With also keywords extraction, key sentence extraction, summary extraction implemented based on TEXTRANK algorithm. Jcseg had a build-in http server and search modules for lucene,solr,elasticsearch,opensearch
Language: Java - Size: 21.1 MB - Last synced at: 18 days ago - Pushed at: over 1 year ago - Stars: 923 - Forks: 212

brightmart/nlp_chinese_corpus
大规模中文自然语言处理语料 Large Scale Chinese Corpus for NLP
Size: 3.91 MB - Last synced at: about 1 month ago - Pushed at: 11 months ago - Stars: 9,635 - Forks: 1,558

OYE93/Chinese-NLP-Corpus
Collections of Chinese NLP corpus
Language: Python - Size: 7.14 MB - Last synced at: about 1 month ago - Pushed at: over 4 years ago - Stars: 897 - Forks: 210

Kyubyong/g2pC
g2pC: A Context-aware Grapheme-to-Phoneme Conversion module for Chinese
Language: Python - Size: 21.8 MB - Last synced at: 18 days ago - Pushed at: almost 6 years ago - Stars: 240 - Forks: 31

LindiaC/ChatGLM2-With-Rua-Tutorial
无需预算,使用你的个人数据克隆自己——赛博飞升!Clone yourself by tuning a LLM using your own data.
Language: Jupyter Notebook - Size: 113 KB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 34 - Forks: 0

howl-anderson/Chinese_models_for_SpaCy
SpaCy 中文模型 | Models for SpaCy that support Chinese
Language: Jupyter Notebook - Size: 709 KB - Last synced at: 10 days ago - Pushed at: 4 months ago - Stars: 661 - Forks: 111

boat-group/fancy-nlp
NLP for human. A fast and easy-to-use natural language processing (NLP) toolkit, satisfying your imagination about NLP.
Language: Python - Size: 769 KB - Last synced at: 20 days ago - Pushed at: over 2 years ago - Stars: 284 - Forks: 40

bububa/jiagu
Jiagu深度学习自然语言处理工具 知识图谱关系抽取 中文分词 词性标注 命名实体识别 情感分析 新词发现 关键词 文本摘要 文本聚类
Language: Go - Size: 88.4 MB - Last synced at: 10 days ago - Pushed at: almost 4 years ago - Stars: 21 - Forks: 6

cingtiye/Awesome-Open-domain-Dialogue-Models
Awesome Open-domain Dialogue Models,高质量开放域对话模型集合
Size: 25.4 KB - Last synced at: 1 day ago - Pushed at: about 2 years ago - Stars: 36 - Forks: 2

thunlp/THULAC-Java
An Efficient Lexical Analyzer for Chinese
Language: Java - Size: 332 KB - Last synced at: 18 days ago - Pushed at: over 7 years ago - Stars: 332 - Forks: 111

thunlp/THUCKE
THU Chinese Keyphrase Extraction Toolkit
Language: C++ - Size: 44.9 KB - Last synced at: 21 days ago - Pushed at: about 7 years ago - Stars: 125 - Forks: 19

thunlp/THULAC.so
An Efficient Lexical Analyzer for Chinese
Language: C++ - Size: 47.9 KB - Last synced at: 21 days ago - Pushed at: over 5 years ago - Stars: 42 - Forks: 20

thunlp/THUCTC
An Efficient Chinese Text Classifier
Language: Java - Size: 1.67 MB - Last synced at: 8 days ago - Pushed at: over 6 years ago - Stars: 206 - Forks: 67

stephenlzc/AI-Agent-Debate_Autogen_Turtorial
🤖 基于AutoGen的AI辩论系统 | 🗣️ 支持中文交互 | 🔄 多智能体协作 | 📝 自动记录辩论过程 🤖 AI Debate System based on AutoGen | 🗣️ Chinese Interaction | 🔄 Multi-Agent Collaboration | 📝 Auto Debate Recording
Language: Python - Size: 5.86 KB - Last synced at: 2 months ago - Pushed at: 6 months ago - Stars: 5 - Forks: 0

howl-anderson/WeatherBot
一个基于 Rasa 的中文天气情况问询机器人(chatbot), 带 Web UI 界面
Size: 97.6 MB - Last synced at: about 1 month ago - Pushed at: about 6 years ago - Stars: 237 - Forks: 68

Nannan-Liu/Multidimensional-Analysis-Tagger-of-Mandarin-Chinese
An open-source library in Python for analysing Chinese registers
Language: Python - Size: 10 MB - Last synced at: 12 days ago - Pushed at: about 3 years ago - Stars: 3 - Forks: 1

hailiang-wang/hanlp-api Fork of beyai/node-hanlp
中文分词,命名实体识别,关键词提取,自动摘要,短语提取,拼音转换,简繁转换,文本推荐,依存句法分析
Language: JavaScript - Size: 1.27 MB - Last synced at: 3 days ago - Pushed at: almost 8 years ago - Stars: 42 - Forks: 14

amutu/zhparser
zhparser is a PostgreSQL extension for full-text search of Chinese language
Language: C - Size: 5.75 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 723 - Forks: 86

BrikerMan/classic_chinese_punctuate
classic Chinese punctuate experiment with keras using daizhige(殆知阁古代文献藏书) dataset
Language: Jupyter Notebook - Size: 175 KB - Last synced at: 15 days ago - Pushed at: over 2 years ago - Stars: 34 - Forks: 2

crownpku/Small-Chinese-Corpus
Some useful Chinese corpus datasets 中文语料小数据
Size: 92.4 MB - Last synced at: 2 months ago - Pushed at: about 5 years ago - Stars: 529 - Forks: 162

Hoiy/berserker
Berserker - BERt chineSE woRd toKenizER
Language: Python - Size: 174 KB - Last synced at: 23 days ago - Pushed at: about 6 years ago - Stars: 16 - Forks: 1

benywon/ChineseBert
This is a chinese Bert model specific for question answering
Language: Python - Size: 1.2 MB - Last synced at: 23 days ago - Pushed at: over 5 years ago - Stars: 26 - Forks: 8

ChanMeng666/customer-insight
【Star us if you're awesome!⭐️】A comprehensive customer review analysis system that provides deep insights through sentiment analysis, keyword extraction, topic modeling, and interactive visualizations. Built with Python and Streamlit, optimized for Chinese text with English language support.
Language: Python - Size: 290 KB - Last synced at: 2 months ago - Pushed at: 3 months ago - Stars: 0 - Forks: 1

Abbey4799/CuteGPT
An open-source conversational language model developed by the Knowledge Works Research Laboratory at Fudan University.
Language: Python - Size: 276 KB - Last synced at: 6 days ago - Pushed at: over 1 year ago - Stars: 64 - Forks: 3

chatopera/chop
Chinese Tokenizer module for Python
Language: Python - Size: 9.32 MB - Last synced at: 19 days ago - Pushed at: almost 7 years ago - Stars: 15 - Forks: 7

DreamerGPT/DreamerGPT
🌱 梦想家(DreamerGPT):中文大语言模型指令精调
Language: Python - Size: 8.93 MB - Last synced at: 6 days ago - Pushed at: almost 2 years ago - Stars: 50 - Forks: 2

syltruong/explorehsk
Explore Chinese words and reinforce your HSK vocabulary
Language: Python - Size: 3.2 MB - Last synced at: 6 days ago - Pushed at: over 2 years ago - Stars: 4 - Forks: 1

limccn/cacl2
Lexicon for Chinese lexical analyzing, 中文语言分词词库
Language: Python - Size: 291 MB - Last synced at: 5 months ago - Pushed at: over 3 years ago - Stars: 117 - Forks: 22

imsanjoykb/Saici.ai-NLP
Saici.ai NLP Assessment
Language: Jupyter Notebook - Size: 1.76 MB - Last synced at: about 1 month ago - Pushed at: over 4 years ago - Stars: 4 - Forks: 0

mbyd916/doc-feature
基于企鹅号自媒体发布的 1000 多万篇文章语料,训练 word2vec 模型; 基于 Flask 框架提供一个简单的抽取文档向量的服务
Language: Python - Size: 131 KB - Last synced at: 8 months ago - Pushed at: over 6 years ago - Stars: 7 - Forks: 6

celtics1863/envtext
中文环境领域文本分析包,纯神经网络架构,支持EnvBert,LSTM,RNN,word2vec等模型,支持自定义模型,下游任务包括分类,回归,多选,情感分析,命名实体识别等,专题包括气候变化文本分析,环境知识图谱等。针对领域研究进行了接口的优化,一键使用模型。
Language: Python - Size: 408 MB - Last synced at: 7 months ago - Pushed at: about 2 years ago - Stars: 29 - Forks: 5

old-wang-95/easy-bert
easy-bert是一个中文NLP工具,提供诸多bert变体调用和调参方法,极速上手;清晰的设计和代码注释,也很适合学习
Language: Python - Size: 9.05 MB - Last synced at: 9 months ago - Pushed at: over 2 years ago - Stars: 68 - Forks: 12

yuanhao-chen-nyoeghau/yitizi-rs
Get all variants (yitizi, 異體字) of a Chinese character (Sinograph)!
Language: Rust - Size: 94.7 KB - Last synced at: 22 days ago - Pushed at: 11 months ago - Stars: 3 - Forks: 0

tim5go/zhopenie
Chinese Open Information Extraction (Tree-based Triple Relation Extraction Module)
Language: Python - Size: 89.8 KB - Last synced at: 10 months ago - Pushed at: almost 8 years ago - Stars: 119 - Forks: 26

niuwz/Mini-Chinese-Phi3
基于Phi3模型结构,使用常见的中文预料从零训练的小参数量LLM。包括了tokenizer训练、模型预训练、指令微调和直接偏好优化等流程。
Language: Python - Size: 45.9 KB - Last synced at: 10 months ago - Pushed at: 10 months ago - Stars: 0 - Forks: 0

YuzukiTsuru/AutoPoem
🎊 Automatically write Chinese ancient poems | 自动写古诗
Language: Python - Size: 5.21 MB - Last synced at: 16 days ago - Pushed at: almost 6 years ago - Stars: 7 - Forks: 2

nonamestreet/weixin_public_corpus
微信公众号语料库
Size: 1.37 GB - Last synced at: 9 months ago - Pushed at: over 6 years ago - Stars: 568 - Forks: 165

Linusp/zhtools 📦
Tools for Chinese language processing.
Language: Python - Size: 21.5 KB - Last synced at: 7 days ago - Pushed at: almost 6 years ago - Stars: 4 - Forks: 1

SUFE-AIFLM-Lab/StatChat
StatChat是一个专门用于统计学及相关应用领域(金融学、经济学、商业分析、数据科学等)知识问答的数字化智能学习助手
Size: 1.78 MB - Last synced at: 11 months ago - Pushed at: 11 months ago - Stars: 3 - Forks: 0

weather-bot/chrono 📦
Javascript 時間自然語言模組 (fork 中文強化版)
Language: JavaScript - Size: 8.41 MB - Last synced at: 8 days ago - Pushed at: over 6 years ago - Stars: 8 - Forks: 2

colibrisson/numerica_sinologica_siku_htr_models 📦
Numerica Sinologica Siku HTR models
Language: Python - Size: 35.3 MB - Last synced at: 12 days ago - Pushed at: over 2 years ago - Stars: 5 - Forks: 0

ww-rm/weibo-rmdt
Detect rumors on Weibo by PyTorch.
Language: Python - Size: 640 MB - Last synced at: 11 months ago - Pushed at: almost 4 years ago - Stars: 13 - Forks: 2

zake7749/Gossiping-Chinese-Corpus
PTT 八卦版問答中文語料
Language: Jupyter Notebook - Size: 116 MB - Last synced at: 10 months ago - Pushed at: over 4 years ago - Stars: 231 - Forks: 36

chen0040/keras-chinese-resume-parser-and-analyzer
keras project that parses and analyze chinese resumes
Language: Python - Size: 4.39 MB - Last synced at: 21 days ago - Pushed at: about 7 years ago - Stars: 15 - Forks: 13

aplmikex/deduplication_mnbvc
文本去重
Language: Python - Size: 104 KB - Last synced at: 12 months ago - Pushed at: 12 months ago - Stars: 57 - Forks: 6

voidism/Chinese_Sentence_Dependency_Analyzer
Using Word2vec's center vector and context vector to analysis the collocation relations between Chinese words, and greedily want to extract some dependency relations in sentence (but not so successful).
Language: Python - Size: 5.64 MB - Last synced at: 12 months ago - Pushed at: over 6 years ago - Stars: 2 - Forks: 0

dongrixinyu/jiojio
A convenient Chinese word segmentation tool 简便中文分词器
Language: Python - Size: 507 MB - Last synced at: 12 months ago - Pushed at: 12 months ago - Stars: 34 - Forks: 5

yaoxiaoyuan/mimix
Mimix: A Text Generation Tool and Pretrained Chinese Models
Language: Python - Size: 6.2 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 144 - Forks: 16

SleepingMonster/Keras_BiLSTM-CRF_Chinese_Sequence_Annotation
中山大学自然语言处理项目:中文分词(序列标注/命名实体识别)。Keras实现,BiLSTM+CRF框架。
Language: Jupyter Notebook - Size: 15 MB - Last synced at: about 1 year ago - Pushed at: about 4 years ago - Stars: 13 - Forks: 4

open-chinese/chinese-word-structure
研究所有汉字的结构,为NLP中汉字结构问题提供完备的解。
Size: 202 KB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 9 - Forks: 2

richard-peng-xia/KD-CGEC
Code for Chinese grammatical error correction based on knowledge distillation
Language: Python - Size: 29 MB - Last synced at: 12 months ago - Pushed at: over 2 years ago - Stars: 11 - Forks: 1

duduscript/split
中文分词程序
Language: Python - Size: 71 MB - Last synced at: about 1 year ago - Pushed at: about 8 years ago - Stars: 0 - Forks: 1

JherezTaylor/f360-textmining-test
Python code for text mining test
Language: Jupyter Notebook - Size: 9.88 MB - Last synced at: about 1 year ago - Pushed at: over 7 years ago - Stars: 0 - Forks: 0

tim5go/cnn-question-classification-keras
Chinese Question Classifier (Keras Implementation) on BQuLD
Language: Python - Size: 693 KB - Last synced at: about 1 year ago - Pushed at: about 2 years ago - Stars: 30 - Forks: 14

wut0n9/cnn_chinese_text_classification
运用cnn + highway network网络结构中文文本分类
Language: Python - Size: 2.07 MB - Last synced at: 8 days ago - Pushed at: over 7 years ago - Stars: 13 - Forks: 1

HIT-SCIR/ltp4j Fork of ruoshui1126/ltp4j
ltp4j: Language Technology Platform For Java
Language: C++ - Size: 12.7 MB - Last synced at: 12 months ago - Pushed at: about 4 years ago - Stars: 162 - Forks: 82

richard-peng-xia/Chinese-Noisy-Text
This repository stores the code of the data augmentation method from Chinese word and character levels, which adds noise to words and characters in redundant, missing, selection and ordering respectively.
Language: Python - Size: 68.4 KB - Last synced at: 12 months ago - Pushed at: over 2 years ago - Stars: 17 - Forks: 3

thinkwee/eda_zh_bert
Chinese version code for the paper "EDA: Easy Data Augmentation Techniques for Boosting Performance on Text Classification Tasks"
Language: Python - Size: 7.81 KB - Last synced at: 22 days ago - Pushed at: almost 6 years ago - Stars: 11 - Forks: 1

Aguila-team/Chinese_NLU_by_using_RASA_NLU
使用 RASA NLU 来构建中文自然语言理解系统(NLU)| Use RASA NLU to build a Chinese Natural Language Understanding System (NLU)
Language: Python - Size: 52.7 KB - Last synced at: 12 months ago - Pushed at: over 2 years ago - Stars: 125 - Forks: 32

ericchw/Youth_Discord_NLP_Chatbot
A python AI chatbot with emotion detection model. Frontend using PHP, API using Flask and database using PostgreSQL. Collaborate with CyberYouth from SJS. @HKMU 2022-2023 FYP
Language: CSS - Size: 24.6 MB - Last synced at: 18 days ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

esun-ai/phonetic_mlm
Integrated Semantic and Phonetic Post-correction for Chinese Speech Recognition
Language: Python - Size: 24.3 MB - Last synced at: about 1 year ago - Pushed at: over 3 years ago - Stars: 15 - Forks: 5

kevinhu/hotpot
A lightweight Chinese-English dictionary
Language: JavaScript - Size: 119 MB - Last synced at: about 1 year ago - Pushed at: over 2 years ago - Stars: 3 - Forks: 2

TheOne1006/m3e-server
m3e api
Language: Python - Size: 20.5 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

VilTea/2-gram
2-gram中文分词
Language: Python - Size: 13.9 MB - Last synced at: over 1 year ago - Pushed at: almost 5 years ago - Stars: 4 - Forks: 2

falcondai/chinese-char-lm
explores Chinese language models with sub-character level visual information
Language: Python - Size: 77.1 KB - Last synced at: about 1 year ago - Pushed at: over 6 years ago - Stars: 16 - Forks: 3

ksOAn6g5/TaiSu
TaiSu(太素)--a large-scale Chinese multimodal dataset(亿级大规模中文视觉语言预训练数据集)
Language: Python - Size: 3.98 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 129 - Forks: 9

guhhhhaa/4675-scifi
chinese NLP corpus of chinese science fiction,chinese science fiction corpus : About 4675 Chinese science fiction novels 大约有4675本科幻小说,中文科幻小说自然语言处理语料库,中文科幻小说文本语料库,中文科幻小说文本数据库,科幻小说语料
Size: 113 MB - Last synced at: over 1 year ago - Pushed at: over 2 years ago - Stars: 277 - Forks: 50
