An open API service providing repository metadata for many open source software ecosystems.

GitHub topics: subwords

marlonrichert/zsh-edit

🛠 Better command line editing tools for Zsh

Language: Shell - Size: 114 KB - Last synced at: 12 months ago - Pushed at: over 1 year ago - Stars: 112 - Forks: 7

svirpioj/morphoeval

Evaluation for unsupervised morphological analysis and segmentation

Language: Python - Size: 41 KB - Last synced at: 3 days ago - Pushed at: over 2 years ago - Stars: 2 - Forks: 0

vishnukanduri/Language-Classification-using-Naive-Bayes-in-Python

Classified sentences into one of Slovak, Czech, and English. Implemented relevant preprocessing steps, addressed the class imbalance in training set by employing the learned theory of Naive Bayes Models, and implementing subword units.

Language: Smalltalk - Size: 1.08 MB - Last synced at: over 1 year ago - Pushed at: almost 5 years ago - Stars: 0 - Forks: 0

butsugiri/gec-pseudodata

Repository of "An Empirical Study of Incorporating Pseudo Data into Grammatical Error Correction" (EMNLP-IJCNLP 2019)

Language: Python - Size: 707 KB - Last synced at: over 1 year ago - Pushed at: over 5 years ago - Stars: 67 - Forks: 8

TiMauzi/dawg

The concept of DAWGs is based on: Blumer, A. et al. (1985). The smallest automation recognizing the subwords of a text. Theoretical Computer Science, 40, 31–55.

Language: Java - Size: 88.9 KB - Last synced at: almost 2 years ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

prabormukherjee/Language_classifier

Classifying English, Slovak, Czech language using Naive Bayes

Language: Smalltalk - Size: 1.06 MB - Last synced at: almost 2 years ago - Pushed at: over 4 years ago - Stars: 1 - Forks: 0