An open API service providing repository metadata for many open source software ecosystems.

Topic: "wordsegmentation"

fastcws/fastcws

轻量级高性能中文分词项目

Language: C++ - Size: 524 KB - Last synced at: over 1 year ago - Pushed at: almost 2 years ago - Stars: 190 - Forks: 8

dongrixinyu/jiojio

A convenient Chinese word segmentation tool 简便中文分词器

Language: Python - Size: 507 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 34 - Forks: 5

mrpeerat/OSKut

Handling Cross- and Out-of-Domain Samples in Thai Word Segmentation (ACL 2021 Findings).

Language: Jupyter Notebook - Size: 122 MB - Last synced at: 13 days ago - Pushed at: about 1 year ago - Stars: 30 - Forks: 6

yxuansu/Chinese-TaCL-BERT-NER-CWS

基于中文TaCL-BERT的中文命名实体识别及中文分词

Language: Python - Size: 122 KB - Last synced at: about 2 years ago - Pushed at: over 3 years ago - Stars: 26 - Forks: 3

mrpeerat/SEFR_CUT

Domain Adaptation of Thai Word Segmentation Models using Stacked Ensemble (EMNLP2020)

Language: Jupyter Notebook - Size: 10.4 MB - Last synced at: 13 days ago - Pushed at: about 1 year ago - Stars: 20 - Forks: 1

DavionWu2018/Word_frequency

[数据+代码] 上市公司年报文本分词、关键词词频统计+数字化转型关键词表

Language: Jupyter Notebook - Size: 235 KB - Last synced at: about 2 years ago - Pushed at: almost 3 years ago - Stars: 4 - Forks: 2

semihsevinc/Noteedom

Handwriting Text Recognition

Language: Python - Size: 29.7 MB - Last synced at: over 1 year ago - Pushed at: over 3 years ago - Stars: 2 - Forks: 0

subtosilencio/unigrams_pt-br

Word segmentation to create unigrams in Portuguese (pt-br)

Language: Python - Size: 675 KB - Last synced at: over 1 year ago - Pushed at: almost 2 years ago - Stars: 0 - Forks: 0