An open API service providing repository metadata for many open source software ecosystems.

GitHub topics: kaldi-asr

linto-ai/linto-stt

An automatic speech recognition API

Language: Python - Size: 579 KB - Last synced at: 1 day ago - Pushed at: 1 day ago - Stars: 61 - Forks: 16

YoavRamon/awesome-kaldi

This is a list of features, scripts, blogs and resources for better using Kaldi ( http://kaldi-asr.org/ )

Size: 18.6 KB - Last synced at: 6 days ago - Pushed at: over 3 years ago - Stars: 536 - Forks: 84

garvys-org/rustfst

Rust re-implementation of OpenFST - library for constructing, combining, optimizing, and searching weighted finite-state transducers (FSTs). A Python binding is also available.

Language: Rust - Size: 6.78 MB - Last synced at: 20 days ago - Pushed at: 3 months ago - Stars: 159 - Forks: 17

daanzu/kaldi-active-grammar

Python Kaldi speech recognition with grammars that can be set active/inactive dynamically at decode-time

Language: Python - Size: 579 KB - Last synced at: about 5 hours ago - Pushed at: almost 2 years ago - Stars: 342 - Forks: 51

skit-ai/kaldi-serve

Server framework for Kaldi ASR Toolkit

Language: C++ - Size: 18.7 MB - Last synced at: 3 months ago - Pushed at: almost 2 years ago - Stars: 97 - Forks: 24

mravanelli/pytorch_MLP_for_ASR

This code implements a basic MLP for speech recognition. The MLP is trained with pytorch, while feature extraction, alignments, and decoding are performed with Kaldi. The current implementation supports dropout and batch normalization. An example for phoneme recognition using the standard TIMIT dataset is provided.

Language: Perl - Size: 5.56 MB - Last synced at: 26 days ago - Pushed at: over 7 years ago - Stars: 38 - Forks: 13

uday160386/asr_capstone_en_ms

ASR-WebUI : Deploying kaldi Model to Azure

Language: Python - Size: 124 KB - Last synced at: 4 months ago - Pushed at: 9 months ago - Stars: 0 - Forks: 0

grib0ed0v/kaldi-for-russian 📦

Language: Jupyter Notebook - Size: 11.7 KB - Last synced at: about 2 months ago - Pushed at: over 7 years ago - Stars: 39 - Forks: 10

SEPIA-Framework/sepia-stt-server

SEPIA server to support open-source speech recognition via WebSocket connection.

Language: Python - Size: 923 KB - Last synced at: 8 months ago - Pushed at: 8 months ago - Stars: 120 - Forks: 21

daanzu/kaldi_ag_training

Docker image and scripts for training finetuned or completely personal Kaldi speech models. Particularly for use with kaldi-active-grammar.

Language: Shell - Size: 148 KB - Last synced at: 2 months ago - Pushed at: over 3 years ago - Stars: 20 - Forks: 4

mycrazycracy/tf-kaldi-speaker

Neural speaker recognition/verification system based on Kaldi and Tensorflow

Language: Python - Size: 398 KB - Last synced at: about 2 months ago - Pushed at: almost 5 years ago - Stars: 32 - Forks: 16

cadia-lvl/samromur-asr

Automatic Speech Recognition (ASR) system for the Samrómur speech corpus using Kaldi

Language: Shell - Size: 68.1 MB - Last synced at: 12 months ago - Pushed at: over 2 years ago - Stars: 11 - Forks: 4

cadia-lvl/samromur-mfa

Aligner with MFA for Samromur dataset

Language: Shell - Size: 5.22 MB - Last synced at: about 1 year ago - Pushed at: almost 3 years ago - Stars: 1 - Forks: 0

cadia-lvl/althingi-asr

An ASR recipe and speech corpus of Icelandic parliamentary speeches

Language: Shell - Size: 14.3 MB - Last synced at: about 1 year ago - Pushed at: over 4 years ago - Stars: 2 - Forks: 0

jimmyg1997/NTUA-slp-nlp

💻Speech and Natural Language Processing (SLP & NLP) Lab Assignments for ECE NTUA

Language: Jupyter Notebook - Size: 53.6 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 19 - Forks: 0

anandmoghan/speaker-recognition

Contains code for Speaker Recognition.

Language: Python - Size: 915 KB - Last synced at: about 1 year ago - Pushed at: over 6 years ago - Stars: 4 - Forks: 2

iiscleap/kaldi-in-python

Some of the kaldi functions with python wrappers.

Language: Perl - Size: 19.5 KB - Last synced at: about 1 year ago - Pushed at: almost 7 years ago - Stars: 0 - Forks: 0

Agrover112/Goodness-of-Pronunciation-Pipelines-for-OOV-Problem

Goodness of Pronunciation Pipelines for OOV Removal

Language: Perl - Size: 1.61 MB - Last synced at: 3 days ago - Pushed at: over 2 years ago - Stars: 9 - Forks: 3

gooofy/py-kaldi-asr

Some simple wrappers around kaldi-asr intended to make using kaldi's (online) decoders as convenient as possible.

Language: C++ - Size: 491 KB - Last synced at: about 1 month ago - Pushed at: over 4 years ago - Stars: 170 - Forks: 54

Anwarvic/Arabic-Speech-Recognition

This repository contains my attempt to use two famous speech recognition frameworks (Kaldi, CMU Sphinx4) for Arabic Language using the publicly-available dataset "Arabic Corpus of Isolated Words"

Language: Shell - Size: 3.24 MB - Last synced at: about 1 year ago - Pushed at: over 5 years ago - Stars: 27 - Forks: 10

fquirin/kaldi-adapt-lm Fork of gooofy/kaldi-adapt-lm

Create and adapt n-gram and JSGF language models, e.g. for Kaldi-ASR nnet3 chain models from Zamia-Speech

Language: Python - Size: 98.6 KB - Last synced at: about 1 year ago - Pushed at: about 2 years ago - Stars: 6 - Forks: 2

minhnq97/asr-commands

Scripts run on Kaldi toolkit on Kaggle's Tensorflow Speech Recognition Challenge

Language: Python - Size: 3.49 MB - Last synced at: over 1 year ago - Pushed at: almost 6 years ago - Stars: 1 - Forks: 0

speechio/chinese_text_normalization

Chinese text normalization for speech processing

Language: Python - Size: 918 KB - Last synced at: over 1 year ago - Pushed at: over 2 years ago - Stars: 554 - Forks: 135

amirharati/kaldi-alligner

scripts to align a given wave to its transcription using trained models by Kaldi

Language: Shell - Size: 4.15 MB - Last synced at: over 1 year ago - Pushed at: almost 6 years ago - Stars: 30 - Forks: 6

Agrover112/Kaldi-notes

Resources helpful for Kaldi

Size: 144 KB - Last synced at: 7 days ago - Pushed at: over 2 years ago - Stars: 6 - Forks: 0

maggieezzat/kaldi-msa-asr

A kaldi recipe for modern standard arabic speech recognition

Language: Shell - Size: 1.51 MB - Last synced at: almost 2 years ago - Pushed at: over 4 years ago - Stars: 0 - Forks: 1

maggieezzat/kaldi-egy-asr

A Kaldi-Recipe for Egyptian Arabic Speech Recognition

Language: Shell - Size: 1.55 MB - Last synced at: almost 2 years ago - Pushed at: over 4 years ago - Stars: 0 - Forks: 0

Hamahmi/kaldi-tut

This is a Kaldi tutorial for beginners

Language: Shell - Size: 1.68 MB - Last synced at: almost 2 years ago - Pushed at: over 6 years ago - Stars: 6 - Forks: 6

johnidm/kaldi-asr-architeture

Kaldi ASR Architeture Proposal

Language: Python - Size: 1.3 MB - Last synced at: almost 2 years ago - Pushed at: over 4 years ago - Stars: 1 - Forks: 0

sidgupta234/Indian_English_ASR

An Indian English ASR system based on Hidden Markov Models (HMM) has been designed using Kaldi(Povey et al., 2011).

Language: Shell - Size: 79.6 MB - Last synced at: 3 months ago - Pushed at: almost 3 years ago - Stars: 2 - Forks: 1

KunalDhawan/ASR-System-for-Hindi-Language

The repository contains all the codes necessary for my project - Automatic Speech Recognition System in Hindi Language ( Project description is available at :- https://kunal-dhawan.weebly.com/asr-system-for-hindi-language-from-scratch.html) : It contains the code for the following systems - 1) Monophone-HMM system built using HTK toolkit , 2)Monophone-HMM system built using Kaldi toolkit, 3)Triphone-HMM system built using Kaldi toolkit and 4)DNN-HMM system built using Kaldi toolkit

Language: Shell - Size: 114 MB - Last synced at: almost 2 years ago - Pushed at: over 5 years ago - Stars: 26 - Forks: 17

SpringerNLP/Chapter8

Chapter 8: Automatic Speech Recognition

Language: Jupyter Notebook - Size: 177 KB - Last synced at: almost 2 years ago - Pushed at: almost 6 years ago - Stars: 11 - Forks: 5

ZoraizQ/urdu-speech-recognition

Urdu Speech Recognition using Kaldi ASR, by training Triphone Acoustic GMMs using the PRUS dataset.

Language: Shell - Size: 1.16 GB - Last synced at: almost 2 years ago - Pushed at: almost 4 years ago - Stars: 3 - Forks: 0

Chaanks/stklia

simple version of our torch kaldi toolkit, developed at the LIA by 2 apprentices. (@Chaanks & @vbrignatz)

Language: Python - Size: 46.6 MB - Last synced at: over 2 years ago - Pushed at: over 3 years ago - Stars: 7 - Forks: 1

purijs/kaldi-asr-aws

This code repo is in reference to the Medium Article for setting up Kaldi on AWS

Language: Python - Size: 32.2 KB - Last synced at: over 2 years ago - Pushed at: over 5 years ago - Stars: 12 - Forks: 5

charlesliucn/LanMIT Fork of kaldi-asr/kaldi

📖 LanMIT: A Toolkit for Improving Language Models in Low-resourced Speech Recognition based on Kaldi.

Language: C++ - Size: 139 MB - Last synced at: over 2 years ago - Pushed at: almost 6 years ago - Stars: 20 - Forks: 0

pragyak412/Improving-Voice-Separation-by-Incorporating-End-To-End-Speech-Recognition

Implementing the paper -

Language: Python - Size: 261 KB - Last synced at: over 2 years ago - Pushed at: about 3 years ago - Stars: 15 - Forks: 2

3wille/bbb-kaldi-connector

Connect BigBlueButton conferences to Kaldi automatic-speech-recognition

Language: Go - Size: 206 KB - Last synced at: 3 months ago - Pushed at: over 4 years ago - Stars: 1 - Forks: 0

SethiPawandeep/kaldi-for-dummies

This is the repository for my version of Kaldi for Dummies example.

Language: Shell - Size: 2.08 MB - Last synced at: over 2 years ago - Pushed at: over 6 years ago - Stars: 17 - Forks: 10

lormaechea/kaldi-grammar-compiler

A minimal tool that helps transforming fixed grammars into compiled Finite State Transducers (FSTs). This thus makes them readable as language models (G.fst) in Kaldi.

Language: Ragel - Size: 47.9 KB - Last synced at: about 2 years ago - Pushed at: about 3 years ago - Stars: 3 - Forks: 0

mohammad-yazdani/nevis

Nevis is a (sort of) all in one speech transcription library backed by Kaldi ASR.

Language: Python - Size: 23.3 MB - Last synced at: about 2 years ago - Pushed at: almost 3 years ago - Stars: 0 - Forks: 0

DeepHiveMind/PyTorch_TF2_DeepNN_Image_Speech_NLP_Recommendation_Transformer Fork of NVIDIA/DeepLearningExamples

:fire: Deep NN Models with FRAMEWORKS -- PyTorch, Tensorflow 2, Kaldi, FastSpeech, MxNET Frameworks -- Image/Speech/NLP/Recommendation/Transformer Cognitive Analytics

Size: 72.1 MB - Last synced at: over 2 years ago - Pushed at: about 4 years ago - Stars: 2 - Forks: 0

igorsitdikov/lid_kaldi

Language: C++ - Size: 32.3 MB - Last synced at: over 2 years ago - Pushed at: almost 4 years ago - Stars: 22 - Forks: 6

tjysdsg/ali_to_phone

Extract phone-level alignment and phonemic transcript from kaldi ali.*.gz files

Language: Shell - Size: 1000 Bytes - Last synced at: over 2 years ago - Pushed at: over 3 years ago - Stars: 0 - Forks: 0

tjysdsg/aidatatang_force_align

Perform force alignment on Mandarin data using aidatatang pretrained model at https://kaldi-asr.org/models/m10

Language: Shell - Size: 8.79 KB - Last synced at: over 2 years ago - Pushed at: about 4 years ago - Stars: 4 - Forks: 0

tongplw/ASR-web-based-restaurant

🍔 Foody, a smart voice-assistant web-based restaurant using Kaldi, React, and WebRTC

Language: JavaScript - Size: 65.7 MB - Last synced at: over 2 years ago - Pushed at: about 3 years ago - Stars: 3 - Forks: 1

AdityaYadavalli1/Kaldi-on-ADA

My documentation of which script does what in Kaldi

Size: 423 KB - Last synced at: over 2 years ago - Pushed at: almost 4 years ago - Stars: 3 - Forks: 2

bagustris/id

Iban-based Kaldi recipe for Indonesian speech Corpus, presented at ASJ Spring 2019.

Language: Shell - Size: 4.83 MB - Last synced at: about 2 years ago - Pushed at: about 5 years ago - Stars: 8 - Forks: 5

t13m/kaldi-readers-for-tensorflow

readers that enable reading kaldi ark in tensorflow

Language: C++ - Size: 13.7 KB - Last synced at: about 1 year ago - Pushed at: over 7 years ago - Stars: 17 - Forks: 7

SinSpeech-Development/SinSpeech-WebApp

Web application for speech recognition that can be configured with kaldi.

Language: TypeScript - Size: 455 KB - Last synced at: over 2 years ago - Pushed at: about 3 years ago - Stars: 0 - Forks: 0

SinSpeech-Development/sinspeech

Scripts used to do a experiment using kaldi for Sinhala Speech Recognition.

Language: Shell - Size: 3.77 MB - Last synced at: over 2 years ago - Pushed at: over 3 years ago - Stars: 0 - Forks: 0

walterheymans/pytorch-kaldi-gan Fork of mravanelli/pytorch-kaldi

This is a fork of PyTorch-Kaldi, a project for developing state-of-the-art DNN/RNN hybrid speech recognition systems. The DNN part is managed by pytorch, while feature extraction, label computation, and decoding are performed with the kaldi toolkit. This repo adds support to use a GAN front-end for an ASR acoustic model.

Language: Python - Size: 639 KB - Last synced at: about 2 years ago - Pushed at: about 3 years ago - Stars: 0 - Forks: 0

samespace/kaldi-data-preperation

Tool to transform data from Nemo/Deepspeech format to Kaldi as described here — https://kaldi-asr.org/doc/data_prep.html

Language: Python - Size: 1.02 MB - Last synced at: over 2 years ago - Pushed at: over 3 years ago - Stars: 1 - Forks: 0

junaedifahmi/FlaskKaldi

Kaldi Implementation with Flask

Language: Python - Size: 5.2 MB - Last synced at: over 2 years ago - Pushed at: over 5 years ago - Stars: 0 - Forks: 0

junaedifahmi/kaldi

Playing with kaldi, try make myown recipes.

Language: Shell - Size: 50.6 MB - Last synced at: over 2 years ago - Pushed at: over 5 years ago - Stars: 0 - Forks: 0

PadmajaVB/listen-attend-spell

Code for converting speech data into text using encoder-decoder model.

Language: Python - Size: 10.6 MB - Last synced at: over 2 years ago - Pushed at: over 7 years ago - Stars: 8 - Forks: 6

alex-ht/options-segmenter

scripts to build a keyword-filler based recognizer for four-option single choice question speech segmentation.

Language: Shell - Size: 72.3 MB - Last synced at: over 2 years ago - Pushed at: over 4 years ago - Stars: 3 - Forks: 1

begemotv2718/recipes Fork of freerussianasr/recipes

Language: Shell - Size: 132 MB - Last synced at: over 2 years ago - Pushed at: over 7 years ago - Stars: 1 - Forks: 0

Related Keywords
kaldi-asr 58 speech-recognition 29 kaldi 28 asr 19 automatic-speech-recognition 8 speech-to-text 8 python 7 speech 5 deep-learning 4 pytorch 4 deep-neural-networks 3 speaker-recognition 3 arabic 3 tensorflow 3 samromur 2 speaker-verification 2 nlp 2 nnet3 2 kaldi-server 2 text-normalization 2 force-alignment 2 grammars 2 speaker-identification 2 chinese 2 openfst 2 pykaldi 2 automata 2 whisper 2 asr-model 2 indian-english-speech-data 1 kaldi-toolkit 1 gmm 1 hmm 1 sphinx 1 tutorial 1 multi-speaker 1 prus 1 urdu 1 representation-learning 1 resnet 1 celery 1 arabic-numbers 1 arabic-numerals 1 cmu-sphinx 1 cmusphinx 1 g2p 1 jsgf-grammars 1 kenlm 1 language-model 1 ngram-models 1 zamia 1 sparrowhawk 1 thrax-gramma 1 alignment 1 forced-alignment 1 hacktoberfest 1 kaldi-librispeech 1 egyptian 1 kaldi-helpers 1 natural-language-processing 1 language-recognition 1 spoken-language-identification 1 spoken-language-recognition 1 phonemic-transcription 1 mandarin 1 react 1 restaurant 1 rtc 1 voice-assistant 1 webrtc 1 documentation 1 bahasa-indonesia 1 kaldi-decoder 1 sinhala-asr 1 generative-adversarial-network 1 multi-style-training 1 flask 1 voxforge 1 blstm 1 encoder-decoder 1 russian-support 1 speaker-embedding 1 aws 1 keyword-spotting 1 language-modeling 1 low-resource-languages 1 espnet 1 voice-separation 1 bigbluebutton 1 fixed-grammars 1 language-models 1 transcription 1 bert 1 fastspeech 1 image-analytics 1 mxnet 1 pytorch-implementation 1 recommendation 1 speech-synthesisi 1 tesnorflow 1