Topic: "word-segmenter"
undertheseanlp/underthesea
Underthesea - Vietnamese NLP Toolkit
Language: Python - Size: 166 MB - Last synced at: about 7 hours ago - Pushed at: about 1 month ago - Stars: 1,518 - Forks: 281

vncorenlp/VnCoreNLP
A Vietnamese natural language processing toolkit (NAACL 2018)
Language: Java - Size: 232 MB - Last synced at: 9 months ago - Pushed at: about 2 years ago - Stars: 570 - Forks: 141

fastcws/fastcws
轻量级高性能中文分词项目
Language: C++ - Size: 524 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 190 - Forks: 8

VietHoang1512/khmer-nltk
Khmer language processing toolkit
Language: Python - Size: 10 MB - Last synced at: 24 days ago - Pushed at: over 1 year ago - Stars: 71 - Forks: 18

ndthuan/vi-word-segmenter
HTTP wrapper of the VnCoreNLP library - A Vietnamese natural language processing toolkit
Language: Java - Size: 82 KB - Last synced at: about 2 months ago - Pushed at: about 1 year ago - Stars: 1 - Forks: 0

ndthuan/go-vi-wordseg-client
Go client library for the ndthuan/vi-word-segmenter service
Language: Go - Size: 19.5 KB - Last synced at: about 2 months ago - Pushed at: about 5 years ago - Stars: 0 - Forks: 0

midobal/tokenizer
A tokenization tool for tokenizing sentences using several tokenizers.
Language: Python - Size: 2.93 KB - Last synced at: about 2 years ago - Pushed at: almost 6 years ago - Stars: 0 - Forks: 0
