Ecosyste.ms: Repos
An open API service providing repository metadata for many open source software ecosystems.
Package Usage: pypi: utoken
utoken is a universal tokenizer (multilingual word segmenter) that divides text into words, punctuation and special tokens such as numbers, URLs, XML tags, email-addresses and hashtags. It comes with a companion detokenizer.
6 versions
Latest release: over 2 years ago
121 downloads last month
View more package details: https://packages.ecosyste.ms/registries/pypi.org/packages/utoken
View more repository details: https://repos.ecosyste.ms/hosts/GitHub/repositories/uhermjakob%2Futoken
Dependent Repos 1
isi-nlp/rtg
Reader Translator Generator - NMT toolkit based on pytorch- * setup.py
Size: 6.23 MB - Last synced: about 1 month ago - Pushed: 9 months ago