Ecosyste.ms: Repos

An open API service providing repository metadata for many open source software ecosystems.

GitHub topics: sentence-splitter

gosbd/gosbd

A sentence splitting (sentence boundary disambiguation) library for Go. It is rule-based and works out-of-the-box.

Language: Go - Size: 1.82 MB - Last synced: 23 days ago - Pushed: 24 days ago - Stars: 7 - Forks: 2

tsproisl/SoMaJo

A tokenizer and sentence splitter for German and English web and social media texts.

Language: Python - Size: 1.34 MB - Last synced: 14 days ago - Pushed: 22 days ago - Stars: 134 - Forks: 20

rakutentech/pisah

Sentence Splitter Library (C++ port of pySBD)

Language: C++ - Size: 132 KB - Last synced: about 2 months ago - Pushed: 11 months ago - Stars: 4 - Forks: 0

ZJaume/splitters

A CLI for Rust SRX sentence segmenation rules as Python package.

Language: Rust - Size: 68.4 KB - Last synced: 9 months ago - Pushed: 9 months ago - Stars: 0 - Forks: 0

busraoguzoglu/Turkish-NLP-Preprocessing-module Fork of meliksahturker/Turkish-NLP-Preprocessing-module

Preprocessing tool for Turkish NLP that contains tokenizer, normalizer, stop-word eliminator and stemmer

Language: Jupyter Notebook - Size: 9.63 MB - Last synced: 12 months ago - Pushed: over 3 years ago - Stars: 3 - Forks: 0

astariul/Sentencize.jl

Smallish library for sentence splitting in Julia

Language: Julia - Size: 43 KB - Last synced: about 1 year ago - Pushed: almost 4 years ago - Stars: 2 - Forks: 2