Ecosyste.ms: Repos
An open API service providing repository metadata for many open source software ecosystems.
GitHub topics: kenlm
shibing624/pycorrector
pycorrector is a toolkit for text error correction. 文本纠错,实现了Kenlm,T5,MacBERT,ChatGLM3,LLaMA等模型应用在纠错场景,开箱即用。
Language: Python - Size: 50.4 MB - Last synced: 8 days ago - Pushed: 8 days ago - Stars: 5,245 - Forks: 1,067
Sarasadeghii/Sharif-Wav2vec2
This repo shows how to finetune the wav2vec2.0 model along with its prerequisites.
Language: Jupyter Notebook - Size: 294 KB - Last synced: 13 days ago - Pushed: 13 days ago - Stars: 3 - Forks: 0
kmario23/KenLM-training
Training an n-gram based Language Model using KenLM toolkit for Deep Speech 2
Size: 5.86 KB - Last synced: about 1 month ago - Pushed: about 5 years ago - Stars: 110 - Forks: 21
DeutscheKI/tevr-asr-tool
State-of-the-art (ranked #1 Aug 2022) German Speech Recognition in 284 lines of C++. This is a 100% private 100% offline 100% free CLI tool.
Language: C - Size: 289 KB - Last synced: about 1 month ago - Pushed: almost 2 years ago - Stars: 408 - Forks: 18
LuluW8071/Automatic-Speech-Recognition-with-PyTorch
End-to-End Automatic Speech Recognition on PyTorch with CTC Decoder and Ken LM
Language: Python - Size: 137 KB - Last synced: about 2 months ago - Pushed: about 2 months ago - Stars: 1 - Forks: 1
Lednik7/nto-ai-text-recognition
Optical Character Recognition + Instance Segmentation for russian and english languages
Language: Jupyter Notebook - Size: 47.7 MB - Last synced: about 1 month ago - Pushed: about 2 years ago - Stars: 27 - Forks: 2
pooya-mohammadi/persian-spell-checker-kenlm
A complete instruction for training a Persian spell checker and a language model based on SymSpell and KenLM, respectively using Wikipedia dataset.
Language: Python - Size: 438 KB - Last synced: 3 months ago - Pushed: almost 2 years ago - Stars: 28 - Forks: 2
fquirin/kaldi-adapt-lm Fork of gooofy/kaldi-adapt-lm
Create and adapt n-gram and JSGF language models, e.g. for Kaldi-ASR nnet3 chain models from Zamia-Speech
Language: Python - Size: 98.6 KB - Last synced: 23 days ago - Pushed: 12 months ago - Stars: 6 - Forks: 2
Leen-Alzebdeh/NLP-LMs
We create n-gram language models that quantify the likelihood of various sound sequences occurring in the English language.
Language: Python - Size: 1.94 MB - Last synced: 4 months ago - Pushed: 5 months ago - Stars: 0 - Forks: 0
SNUDerek/lm_perplexity_bootstrapping
demo of domain corpus bootstrapping using language model perplexity
Language: Jupyter Notebook - Size: 61.5 KB - Last synced: 6 months ago - Pushed: over 6 years ago - Stars: 2 - Forks: 3
Targoman/TargomanSMT
Targoman SMT framework source code
Language: C++ - Size: 3.06 MB - Last synced: about 1 month ago - Pushed: over 6 years ago - Stars: 30 - Forks: 6
Msparihar/Transcriber
Developed an AI tool to automatically generate captions and transcripts for YouTube videos in 67 languages and can generate summarized texts in 133 languages.
Language: Python - Size: 14.6 KB - Last synced: 7 months ago - Pushed: 7 months ago - Stars: 3 - Forks: 1
Sundy1219/ctc_beam_search_lm
CTC+Beam_Search+kenlm 是用于以汉字为声学模型建模单元的解码系统
Language: C++ - Size: 58.2 MB - Last synced: 7 months ago - Pushed: almost 6 years ago - Stars: 43 - Forks: 22
DeepSchneider/speech-recognition-examples
Simple Guide How To Build Your Own End-To-End Automatic Speech Recognition System Written In PyTorch
Language: Python - Size: 82.9 MB - Last synced: 8 months ago - Pushed: 8 months ago - Stars: 0 - Forks: 1
DeepSchneider/iam-crnn-ctc-recognition
IAM Dataset Handwriting Recognition Using CRNN, CTC Loss, DeepSpeech Beam Search, And KenLM Scorer
Language: Python - Size: 1.71 MB - Last synced: 8 months ago - Pushed: 8 months ago - Stars: 0 - Forks: 1
racai-ai/RobinASR
Romanian Automatic Speech Recognition from the ROBIN project
Language: Python - Size: 204 KB - Last synced: 7 months ago - Pushed: over 2 years ago - Stars: 17 - Forks: 8
loretoparisi/wave2vec-recognize-docker
Wave2vec 2.0 Recognize pipeline
Language: Python - Size: 33.2 KB - Last synced: 7 months ago - Pushed: over 3 years ago - Stars: 33 - Forks: 10
teodor-cotet/RoGEC
Neural Grammatical Error Correction for Romanian using Transformer
Language: Python - Size: 41.1 MB - Last synced: about 1 year ago - Pushed: over 1 year ago - Stars: 7 - Forks: 1
levyfan/kenlm-jni
A Java JNI wrapper for KenLM: Faster and Smaller Language Model Queries
Language: Java - Size: 19.5 KB - Last synced: about 1 year ago - Pushed: over 3 years ago - Stars: 12 - Forks: 4
tokestermw/spacy_kenlm
:game_die: KenLM extension for spaCy 2.0.
Language: Python - Size: 8.79 KB - Last synced: 26 days ago - Pushed: over 6 years ago - Stars: 16 - Forks: 2
mozilla/scorertool
Generate language models from OSCAR corpora
Language: Python - Size: 58.6 KB - Last synced: 2 days ago - Pushed: about 4 years ago - Stars: 7 - Forks: 1