Ecosyste.ms: Repos

An open API service providing repository metadata for many open source software ecosystems.

GitHub topics: compound-words

muhammadshaffay/Roman-Urdu-Tokenizer

Enhance Roman-Urdu text processing with this Python-based tokenizer that handles compound words flawlessly.

Language: Jupyter Notebook - Size: 65.4 KB - Last synced: about 1 month ago - Pushed: about 1 year ago - Stars: 0 - Forks: 0

shangjingbo1226/AutoPhrase

AutoPhrase: Automated Phrase Mining from Massive Text Corpora

Language: C++ - Size: 195 MB - Last synced: about 1 month ago - Pushed: over 2 years ago - Stars: 1,161 - Forks: 272

viig99/SymSpellCppPy

Fast SymSpell written in c++ and exposes to python via pybind11

Language: C++ - Size: 8.31 MB - Last synced: 17 days ago - Pushed: about 1 year ago - Stars: 38 - Forks: 7

GokulVSD/FOGIndex

Provides functions required for calculation of Gunning / regular FOG index. Contains a syllable counter and a dictionary based compound word splitter.

Language: Python - Size: 5.86 KB - Last synced: 10 months ago - Pushed: over 5 years ago - Stars: 0 - Forks: 1

SoumadipDey/Longest-Compound-Words

Impledge Technologies Interview Coding Test 2022

Language: Python - Size: 1.36 MB - Last synced: about 1 year ago - Pushed: over 1 year ago - Stars: 2 - Forks: 1