Topic: "sentence-segmenter"
segment-any-text/wtpsplit
Toolkit to segment text into sentences or other semantic units in a robust, efficient and adaptable way.
Language: Python - Size: 83 MB - Last synced at: 7 days ago - Pushed at: about 2 months ago - Stars: 1,036 - Forks: 58

notAI-tech/deepsegment 📦
A sentence segmenter that actually works!
Language: Python - Size: 81.1 KB - Last synced at: 16 days ago - Pushed at: almost 5 years ago - Stars: 306 - Forks: 55

VietHoang1512/khmer-nltk
Khmer language processing toolkit
Language: Python - Size: 10 MB - Last synced at: 19 days ago - Pushed at: over 1 year ago - Stars: 73 - Forks: 18

DoodleBears/split-lang
✨ Split text by languages (e.g. 你喜欢看アニメ吗 -> 你喜欢看 | アニメ | 吗) for NLP tasks (e.g. parse, TTS). Powered by fasttext and budoux
Language: Jupyter Notebook - Size: 295 KB - Last synced at: 23 days ago - Pushed at: 3 months ago - Stars: 53 - Forks: 5

sentencizer/sentencizer
A sentence splitting (sentence boundary disambiguation) library for Go. It is rule-based and works out-of-the-box.
Language: Go - Size: 1.83 MB - Last synced at: 1 day ago - Pushed at: about 2 months ago - Stars: 33 - Forks: 6

superlinear-ai/wtpsplit-lite
✂️ Sentence segmentation with wtpsplit's state-of-the-art Segment any Text (SaT) models
Language: Python - Size: 130 KB - Last synced at: 4 days ago - Pushed at: 26 days ago - Stars: 11 - Forks: 1

akash-rajak/Sentence-Segmenter
Python script to segment the huge paragraph or text in different lines and segments.
Language: Python - Size: 983 KB - Last synced at: 2 months ago - Pushed at: over 3 years ago - Stars: 1 - Forks: 0
