An open API service providing repository metadata for many open source software ecosystems.

Topic: "kenlm"

shibing624/pycorrector

pycorrector is a toolkit for text error correction. 文本纠错,实现了Kenlm,T5,MacBERT,ChatGLM3,Qwen2.5等模型应用在纠错场景,开箱即用。

Language: Python - Size: 50.7 MB - Last synced at: 6 days ago - Pushed at: 5 months ago - Stars: 5,997 - Forks: 1,139

DeutscheKI/tevr-asr-tool

State-of-the-art (ranked #1 Aug 2022) German Speech Recognition in 284 lines of C++. This is a 100% private 100% offline 100% free CLI tool.

Language: C - Size: 289 KB - Last synced at: about 2 months ago - Pushed at: almost 3 years ago - Stars: 413 - Forks: 18

kmario23/KenLM-training

Training an n-gram based Language Model using KenLM toolkit for Deep Speech 2

Size: 5.86 KB - Last synced at: about 2 months ago - Pushed at: about 6 years ago - Stars: 114 - Forks: 21

Sundy1219/ctc_beam_search_lm

CTC+Beam_Search+kenlm 是用于以汉字为声学模型建模单元的解码系统

Language: C++ - Size: 58.2 MB - Last synced at: over 1 year ago - Pushed at: almost 7 years ago - Stars: 43 - Forks: 22

pooya-mohammadi/persian-spell-checker-kenlm

A complete instruction for training a Persian spell checker and a language model based on SymSpell and KenLM, respectively using Wikipedia dataset.

Language: Python - Size: 438 KB - Last synced at: about 1 month ago - Pushed at: almost 3 years ago - Stars: 34 - Forks: 1

loretoparisi/wave2vec-recognize-docker

Wave2vec 2.0 Recognize pipeline

Language: Python - Size: 33.2 KB - Last synced at: over 1 year ago - Pushed at: over 4 years ago - Stars: 33 - Forks: 10

Targoman/TargomanSMT

Targoman SMT framework source code

Language: C++ - Size: 3.06 MB - Last synced at: about 1 year ago - Pushed at: over 7 years ago - Stars: 30 - Forks: 6

Lednik7/nto-ai-text-recognition

Optical Character Recognition + Instance Segmentation for russian and english languages

Language: Jupyter Notebook - Size: 47.7 MB - Last synced at: about 1 year ago - Pushed at: about 3 years ago - Stars: 27 - Forks: 2

racai-ai/RobinASR

Romanian Automatic Speech Recognition from the ROBIN project

Language: Python - Size: 204 KB - Last synced at: over 1 year ago - Pushed at: over 3 years ago - Stars: 17 - Forks: 8

tokestermw/spacy_kenlm

:game_die: KenLM extension for spaCy 2.0.

Language: Python - Size: 8.79 KB - Last synced at: 16 days ago - Pushed at: over 7 years ago - Stars: 16 - Forks: 2

levyfan/kenlm-jni

A Java JNI wrapper for KenLM: Faster and Smaller Language Model Queries

Language: Java - Size: 19.5 KB - Last synced at: about 2 years ago - Pushed at: over 4 years ago - Stars: 12 - Forks: 4

LuluW8071/Automatic-Speech-Recognition-with-PyTorch

Real-Time ASR with CNN-BiLSTM: End-to-End Live Streaming Using PyTorch Lightning⚡

Language: Python - Size: 4.16 MB - Last synced at: about 1 month ago - Pushed at: 4 months ago - Stars: 9 - Forks: 2

mozilla/scorertool 📦

INACTIVE - http://mzl.la/ghe-archive - Generate language models from OSCAR corpora

Language: Python - Size: 58.6 KB - Last synced at: 9 days ago - Pushed at: about 5 years ago - Stars: 8 - Forks: 1

teodor-cotet/RoGEC

Neural Grammatical Error Correction for Romanian using Transformer

Language: Python - Size: 41.1 MB - Last synced at: 6 months ago - Pushed at: 9 months ago - Stars: 7 - Forks: 1

Msparihar/Transcriber

Developed an AI tool to automatically generate captions and transcripts for YouTube videos in 67 languages and can generate summarized texts in 133 languages.

Language: Python - Size: 14.6 KB - Last synced at: 2 months ago - Pushed at: over 1 year ago - Stars: 7 - Forks: 1

fquirin/kaldi-adapt-lm Fork of gooofy/kaldi-adapt-lm

Create and adapt n-gram and JSGF language models, e.g. for Kaldi-ASR nnet3 chain models from Zamia-Speech

Language: Python - Size: 98.6 KB - Last synced at: about 1 year ago - Pushed at: about 2 years ago - Stars: 6 - Forks: 2

LuluW8071/ASR-with-Speech-Sentiment-and-Text-Summarizer

Automatic Speech Recognition using Conformer with Speech Sentiment Analysis & Text Summarizer

Language: Python - Size: 27 MB - Last synced at: about 1 month ago - Pushed at: 6 months ago - Stars: 5 - Forks: 3

Sarasadeghii/Sharif-Wav2vec2

This repo shows how to finetune the wav2vec2.0 model along with its prerequisites.

Language: Jupyter Notebook - Size: 297 KB - Last synced at: 12 months ago - Pushed at: about 1 year ago - Stars: 3 - Forks: 0

SNUDerek/lm_perplexity_bootstrapping

demo of domain corpus bootstrapping using language model perplexity

Language: Jupyter Notebook - Size: 61.5 KB - Last synced at: over 1 year ago - Pushed at: over 7 years ago - Stars: 2 - Forks: 3

BonySmoke/speliuk

A more accurate spelling correction for the Ukrainian language.

Language: Python - Size: 145 KB - Last synced at: about 2 months ago - Pushed at: 7 months ago - Stars: 1 - Forks: 0

Leen-Alzebdeh/NLP-LMs

We create n-gram language models that quantify the likelihood of various sound sequences occurring in the English language.

Language: Python - Size: 1.94 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0