An open API service providing repository metadata for many open source software ecosystems.

Topic: "word-segmenter"

undertheseanlp/underthesea

Underthesea - Vietnamese NLP Toolkit

Language: Python - Size: 166 MB - Last synced at: about 7 hours ago - Pushed at: about 1 month ago - Stars: 1,518 - Forks: 281

vncorenlp/VnCoreNLP

A Vietnamese natural language processing toolkit (NAACL 2018)

Language: Java - Size: 232 MB - Last synced at: 9 months ago - Pushed at: about 2 years ago - Stars: 570 - Forks: 141

fastcws/fastcws

轻量级高性能中文分词项目

Language: C++ - Size: 524 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 190 - Forks: 8

VietHoang1512/khmer-nltk

Khmer language processing toolkit

Language: Python - Size: 10 MB - Last synced at: 24 days ago - Pushed at: over 1 year ago - Stars: 71 - Forks: 18

ndthuan/vi-word-segmenter

HTTP wrapper of the VnCoreNLP library - A Vietnamese natural language processing toolkit

Language: Java - Size: 82 KB - Last synced at: about 2 months ago - Pushed at: about 1 year ago - Stars: 1 - Forks: 0

ndthuan/go-vi-wordseg-client

Go client library for the ndthuan/vi-word-segmenter service

Language: Go - Size: 19.5 KB - Last synced at: about 2 months ago - Pushed at: about 5 years ago - Stars: 0 - Forks: 0

midobal/tokenizer

A tokenization tool for tokenizing sentences using several tokenizers.

Language: Python - Size: 2.93 KB - Last synced at: about 2 years ago - Pushed at: almost 6 years ago - Stars: 0 - Forks: 0