An open API service providing repository metadata for many open source software ecosystems.

Topic: "sentence-segmenter"

segment-any-text/wtpsplit

Toolkit to segment text into sentences or other semantic units in a robust, efficient and adaptable way.

Language: Python - Size: 83 MB - Last synced at: 9 days ago - Pushed at: 28 days ago - Stars: 987 - Forks: 56

notAI-tech/deepsegment 📦

A sentence segmenter that actually works!

Language: Python - Size: 81.1 KB - Last synced at: about 2 months ago - Pushed at: over 4 years ago - Stars: 305 - Forks: 56

VietHoang1512/khmer-nltk

Khmer language processing toolkit

Language: Python - Size: 10 MB - Last synced at: 26 days ago - Pushed at: over 1 year ago - Stars: 71 - Forks: 18

DoodleBears/split-lang

✨ Split text by languages (e.g. 你喜欢看アニメ吗 -> 你喜欢看 | アニメ | 吗) for NLP tasks (e.g. parse, TTS). Powered by fasttext and budoux

Language: Jupyter Notebook - Size: 295 KB - Last synced at: 4 days ago - Pushed at: 2 months ago - Stars: 51 - Forks: 4

sentencizer/sentencizer

A sentence splitting (sentence boundary disambiguation) library for Go. It is rule-based and works out-of-the-box.

Language: Go - Size: 1.83 MB - Last synced at: 8 days ago - Pushed at: 24 days ago - Stars: 31 - Forks: 6

superlinear-ai/wtpsplit-lite

✂️ Sentence segmentation with wtpsplit's state-of-the-art Segment any Text (SaT) models

Language: Python - Size: 128 KB - Last synced at: 14 days ago - Pushed at: about 2 months ago - Stars: 10 - Forks: 0

akash-rajak/Sentence-Segmenter

Python script to segment the huge paragraph or text in different lines and segments.

Language: Python - Size: 983 KB - Last synced at: about 1 month ago - Pushed at: over 3 years ago - Stars: 1 - Forks: 0