Ecosyste.ms: Repos

An open API service providing repository metadata for many open source software ecosystems.

GitHub topics: kenlm

shibing624/pycorrector

pycorrector is a toolkit for text error correction. 文本纠错,实现了Kenlm,T5,MacBERT,ChatGLM3,LLaMA等模型应用在纠错场景,开箱即用。

Language: Python - Size: 50.4 MB - Last synced: 8 days ago - Pushed: 8 days ago - Stars: 5,245 - Forks: 1,067

Sarasadeghii/Sharif-Wav2vec2

This repo shows how to finetune the wav2vec2.0 model along with its prerequisites.

Language: Jupyter Notebook - Size: 294 KB - Last synced: 13 days ago - Pushed: 13 days ago - Stars: 3 - Forks: 0

kmario23/KenLM-training

Training an n-gram based Language Model using KenLM toolkit for Deep Speech 2

Size: 5.86 KB - Last synced: about 1 month ago - Pushed: about 5 years ago - Stars: 110 - Forks: 21

DeutscheKI/tevr-asr-tool

State-of-the-art (ranked #1 Aug 2022) German Speech Recognition in 284 lines of C++. This is a 100% private 100% offline 100% free CLI tool.

Language: C - Size: 289 KB - Last synced: about 1 month ago - Pushed: almost 2 years ago - Stars: 408 - Forks: 18

LuluW8071/Automatic-Speech-Recognition-with-PyTorch

End-to-End Automatic Speech Recognition on PyTorch with CTC Decoder and Ken LM

Language: Python - Size: 137 KB - Last synced: about 2 months ago - Pushed: about 2 months ago - Stars: 1 - Forks: 1

Lednik7/nto-ai-text-recognition

Optical Character Recognition + Instance Segmentation for russian and english languages

Language: Jupyter Notebook - Size: 47.7 MB - Last synced: about 1 month ago - Pushed: about 2 years ago - Stars: 27 - Forks: 2

pooya-mohammadi/persian-spell-checker-kenlm

A complete instruction for training a Persian spell checker and a language model based on SymSpell and KenLM, respectively using Wikipedia dataset.

Language: Python - Size: 438 KB - Last synced: 3 months ago - Pushed: almost 2 years ago - Stars: 28 - Forks: 2

fquirin/kaldi-adapt-lm Fork of gooofy/kaldi-adapt-lm

Create and adapt n-gram and JSGF language models, e.g. for Kaldi-ASR nnet3 chain models from Zamia-Speech

Language: Python - Size: 98.6 KB - Last synced: 23 days ago - Pushed: 12 months ago - Stars: 6 - Forks: 2

Leen-Alzebdeh/NLP-LMs

We create n-gram language models that quantify the likelihood of various sound sequences occurring in the English language.

Language: Python - Size: 1.94 MB - Last synced: 4 months ago - Pushed: 5 months ago - Stars: 0 - Forks: 0

SNUDerek/lm_perplexity_bootstrapping

demo of domain corpus bootstrapping using language model perplexity

Language: Jupyter Notebook - Size: 61.5 KB - Last synced: 6 months ago - Pushed: over 6 years ago - Stars: 2 - Forks: 3

Targoman/TargomanSMT

Targoman SMT framework source code

Language: C++ - Size: 3.06 MB - Last synced: about 1 month ago - Pushed: over 6 years ago - Stars: 30 - Forks: 6

Msparihar/Transcriber

Developed an AI tool to automatically generate captions and transcripts for YouTube videos in 67 languages and can generate summarized texts in 133 languages.

Language: Python - Size: 14.6 KB - Last synced: 7 months ago - Pushed: 7 months ago - Stars: 3 - Forks: 1

Sundy1219/ctc_beam_search_lm

CTC+Beam_Search+kenlm 是用于以汉字为声学模型建模单元的解码系统

Language: C++ - Size: 58.2 MB - Last synced: 7 months ago - Pushed: almost 6 years ago - Stars: 43 - Forks: 22

DeepSchneider/speech-recognition-examples

Simple Guide How To Build Your Own End-To-End Automatic Speech Recognition System Written In PyTorch

Language: Python - Size: 82.9 MB - Last synced: 8 months ago - Pushed: 8 months ago - Stars: 0 - Forks: 1

DeepSchneider/iam-crnn-ctc-recognition

IAM Dataset Handwriting Recognition Using CRNN, CTC Loss, DeepSpeech Beam Search, And KenLM Scorer

Language: Python - Size: 1.71 MB - Last synced: 8 months ago - Pushed: 8 months ago - Stars: 0 - Forks: 1

racai-ai/RobinASR

Romanian Automatic Speech Recognition from the ROBIN project

Language: Python - Size: 204 KB - Last synced: 7 months ago - Pushed: over 2 years ago - Stars: 17 - Forks: 8

loretoparisi/wave2vec-recognize-docker

Wave2vec 2.0 Recognize pipeline

Language: Python - Size: 33.2 KB - Last synced: 7 months ago - Pushed: over 3 years ago - Stars: 33 - Forks: 10

teodor-cotet/RoGEC

Neural Grammatical Error Correction for Romanian using Transformer

Language: Python - Size: 41.1 MB - Last synced: about 1 year ago - Pushed: over 1 year ago - Stars: 7 - Forks: 1

levyfan/kenlm-jni

A Java JNI wrapper for KenLM: Faster and Smaller Language Model Queries

Language: Java - Size: 19.5 KB - Last synced: about 1 year ago - Pushed: over 3 years ago - Stars: 12 - Forks: 4

tokestermw/spacy_kenlm

:game_die: KenLM extension for spaCy 2.0.

Language: Python - Size: 8.79 KB - Last synced: 26 days ago - Pushed: over 6 years ago - Stars: 16 - Forks: 2

mozilla/scorertool

Generate language models from OSCAR corpora

Language: Python - Size: 58.6 KB - Last synced: 2 days ago - Pushed: about 4 years ago - Stars: 7 - Forks: 1