Ecosyste.ms: Repos

An open API service providing repository metadata for many open source software ecosystems.

GitHub topics: tinysegmenter

Funkschy/TinySegmenter

A Clojure library to split Japanese into words

Language: Clojure - Size: 12.7 KB - Last synced: about 1 month ago - Pushed: about 2 years ago - Stars: 0 - Forks: 0

JuliaStrings/TinySegmenter.jl

Julia version of TinySegmenter, compact Japanese tokenizer

Language: Julia - Size: 286 KB - Last synced: 13 days ago - Pushed: over 3 years ago - Stars: 20 - Forks: 8

Ran350/ja-wordcloud

Serverless web app for ultra-fast generation of highly customizable WordCloud for Japanese

Language: TypeScript - Size: 15.5 MB - Last synced: 8 months ago - Pushed: about 1 year ago - Stars: 4 - Forks: 0

ryoppippi/bunsetsu.vim

Language: TypeScript - Size: 19.5 KB - Last synced: about 1 year ago - Pushed: about 1 year ago - Stars: 4 - Forks: 0

jestasgameland/Japanese_tokenizer

A small experiment using both Mecab and Tinysegmenter to create a tokenized list of Japanese sentences in JSON, taken from the Tatoeba corpus.

Language: Python - Size: 5.48 MB - Last synced: about 1 year ago - Pushed: about 3 years ago - Stars: 1 - Forks: 0