gitlab.com topics: automatic speech recognition (ASR)
prebens-phd-adventures/universal-edit-distance
A small Python library containing some generic metrics implemented in Rust
Last synced at: 3 months ago - Stars: 0 - Forks: 0
alphaspeech/alphaspeech-npm-asr-kit
This project provides a client package and example scripts for TypeScript to access the alphaspeech pro ASR stream API.
Last synced at: 4 months ago - Stars: 0 - Forks: 0
alphaspeech/alphaspeech-python
This project provides a client package and example scripts to access the alphaspeech pro ASR APIs.
Last synced at: 6 months ago - Stars: 0 - Forks: 0
erik.projs/asr/en/hub5
Hub5: small ASR data set of CTS used mainly as a test set (LDC2002T43).
Last synced at: almost 3 years ago - Stars: 0 - Forks: 0
breslerek/lem_srd
LEM speech recognition device, designed for Signal Processing lecture.
Last synced at: over 2 years ago - Stars: 0 - Forks: 0
erik.projs/asr/en/wtimit
WTIMIT: TIMIT played through wideband mobile telephone networks (LDC2010S02).
Last synced at: almost 3 years ago - Stars: 0 - Forks: 0
erik.projs/asr/en/cv1
CV1: Common Voice Single-Word Target dataset from Mozilla.
Last synced at: almost 3 years ago - Stars: 0 - Forks: 0
erik.projs/asr/en/audiomnist
AudioMNIST: free dataset of spoken digits (0-9) from 60 speakers.
Last synced at: almost 3 years ago - Stars: 0 - Forks: 0
erik.projs/asr/en/aesl
AESL: American English Spoken Lexicon from 1 female speaker (LDC99L23).
Last synced at: almost 3 years ago - Stars: 0 - Forks: 0
erik.projs/asr/en/swc
SWC: Spoken Wikipedia Corpus, crowdsourced speech of read Wikipedia articles.
Last synced at: almost 3 years ago - Stars: 0 - Forks: 0
erik.projs/asr/en/tedlium2
TED-LIUM 2: Release 2 of TED talk corpus from LIUM (SLR19).
Last synced at: almost 3 years ago - Stars: 0 - Forks: 0
erik.projs/asr/en/fred
FRED: Freiburg English Dialect Corpus Sampler (FRED-S) of British English interviews.
Last synced at: almost 3 years ago - Stars: 0 - Forks: 0
erik.projs/asr/en/rm
RM: Resource Management v. 2.0 corpus from DARPA in the 1990s (LDC93S3A).
Last synced at: almost 3 years ago - Stars: 0 - Forks: 0
erik.projs/asr/en/noisytimit
NoisyTIMIT: TIMIT corpus with various additive noises (LDC2017S04).
Last synced at: almost 3 years ago - Stars: 0 - Forks: 0
erik.projs/asr/en/crm
CRM (coordinate response number) corpus [Bolia et al. 2000].
Last synced at: almost 3 years ago - Stars: 0 - Forks: 0
erik.projs/asr/en/grid
GRID: audiovisual corpus of grid-related commands, from Univ. Sheffield.
Last synced at: almost 3 years ago - Stars: 0 - Forks: 0
erik.projs/asr/en/rt03
rt03: NIST 2003 Rich Transcription Evaluation Data for CTS (LDC2007S10).
Last synced at: almost 3 years ago - Stars: 0 - Forks: 0
erik.projs/asr/en/reddots
RedDots: corpus of short-dur utts from mobile apps (sites.google.com/site/thereddotsproject).
Last synced at: almost 3 years ago - Stars: 0 - Forks: 0
erik.projs/asr/en/pda
PDA: Personal Digital Assistant speech dataset from CMU.
Last synced at: almost 3 years ago - Stars: 0 - Forks: 0
erik.projs/asr/en/ng_en
ng_en: Nigerian English high-quality crowdsourced dataset (SLR70).
Last synced at: almost 3 years ago - Stars: 0 - Forks: 0
erik.projs/asr/en/lj
LJ: LJ (Linda Johson) Speech Corpus (v. 1.1), often used for TTS.
Last synced at: almost 3 years ago - Stars: 0 - Forks: 0
erik.projs/asr/en/heysnips2
heysnips2: Hey Snips Dataset 2 for KWS from Sonos [Leroy et al. 2019].
Last synced at: almost 3 years ago - Stars: 0 - Forks: 0

xdroid-public/xdroid-amazon-connect-integration
This repository provides resources for a Quick Start guide for connecting Amazon Connect with Xdroid platform to provide post-call analytics. Intended target audience are system administrators who manage and configure the AWS Amazon Connect instance, and also for system architects and support engineers.
Last synced at: almost 3 years ago - Stars: 0 - Forks: 0

KPHIBYE/voskwrapper
C# library that provides an easy to use abstraction of the Vosk speech recognition toolkit
Last synced at: almost 3 years ago - Stars: 1 - Forks: 1
fb-resources/kaldi-br
Models trained with Kaldi
Last synced at: almost 3 years ago - Stars: 3 - Forks: 0

erik.projs/asr/en/timit
TIMIT: famous corpus of American English with phone-level transcriptions (LDC93S1).
Last synced at: almost 3 years ago - Stars: 0 - Forks: 0
erik.projs/asr/en/librispeech
LibriSpeech: large ASR data set of read books (SLR12) [Panayotov et al. 2015].
Last synced at: almost 3 years ago - Stars: 0 - Forks: 1
erik.projs/asr/en/an4
AN4: Alphanumeric or "census" database from CMU [Acero 1993].
Last synced at: almost 3 years ago - Stars: 0 - Forks: 0
erik.projs/asr/en/buckeye
Buckeye: Buckeye Speech Corpus (release 2) of interviews from Ohio State.
Last synced at: almost 3 years ago - Stars: 0 - Forks: 0
erik.projs/pearson/poc
Proof-of-concept (POC) app towards Aida English app.
Last synced at: almost 3 years ago - Stars: 1 - Forks: 0
erik.projs/asr/en/indictts
IndicTTS: Indian English speech from the IIT TTS Team.
Last synced at: almost 3 years ago - Stars: 0 - Forks: 1
erik.projs/asr/en/emime
EMIME Bilingual {Finnish,German,Mandarin}/English database (www.emime.org).
Last synced at: almost 3 years ago - Stars: 0 - Forks: 1
erik.projs/asr/en/ucam
UCAM Bilingual database from EMIME (www.emime.org).
Last synced at: almost 3 years ago - Stars: 0 - Forks: 0
erik.projs/asr/en/lombard
Lombard Grid: extension of Grid corpus with Lombard and normal speech.
Last synced at: almost 3 years ago - Stars: 0 - Forks: 0
erik.projs/asr/en/ctimit
CTIMIT: TIMIT played through cellphone network (LDC96S30).
Last synced at: almost 3 years ago - Stars: 0 - Forks: 0
erik.projs/asr/en/noisy-vctk
Noisy-VCTK: Noisy subset of VCTK (Voice Cloning Toolkit) dataset from CSTR.
Last synced at: almost 3 years ago - Stars: 0 - Forks: 0
erik.projs/asr/en/libritts
LibriTTS: Librispeech for text-to-speech (TTS) corpus (SLR60).
Last synced at: almost 3 years ago - Stars: 0 - Forks: 0
erik.projs/asr/en/arctic
CMU_ARCTIC dataset from CMU FestVox project (www.festvox.org/cmu_arctic).
Last synced at: almost 3 years ago - Stars: 0 - Forks: 0
erik.projs/asr/en/cmu_sin
CMU_SIN (speech-in-noise) dataset of Lombard speech from CMU FestVox project.
Last synced at: almost 3 years ago - Stars: 0 - Forks: 0
erik.projs/asr/en/commonvoice
commonvoice: Common Voice dataset of crowdsourced speech from Mozilla.
Last synced at: almost 3 years ago - Stars: 0 - Forks: 0
erik.projs/asr/en/dr-vctk
DR-VCTK: device-recorded Voice Cloning Toolkit (DR-VCTK) dataset from CSTR.
Last synced at: almost 3 years ago - Stars: 0 - Forks: 0
erik.projs/asr/en/vctk
VCTK: Voice Cloning Toolkit dataset from CSTR, Edinburgh [Veaux et al. 2013].
Last synced at: almost 3 years ago - Stars: 0 - Forks: 0
erik.projs/asr/en/voxforge
VoxForge: free, open-source ASR dataset of crowdsourced speech from voxforge.org.
Last synced at: almost 3 years ago - Stars: 0 - Forks: 0
erik.projs/asr/en/wsj
WSJ: Wall Street Journal corpus from ARPA in 1992, 1994 (LDC93S6A, LDC94S13A).
Last synced at: almost 3 years ago - Stars: 0 - Forks: 0
erik.projs/asr/en/tatoeba
Tatoeba: Tatoeba Project of English sentences (https://tatoeba.org/eng).
Last synced at: almost 3 years ago - Stars: 0 - Forks: 0
erik.projs/asr/en/fluentcmd
fluentcmd: Fluent Speech Commands Dataset for SLU from fluent.ai.
Last synced at: almost 3 years ago - Stars: 0 - Forks: 0
erik.projs/asr/en/heysnips1
heysnips1: Hey Snips Dataset 1 for KWS from Sonos [Coucke et al. 2019]
Last synced at: almost 3 years ago - Stars: 0 - Forks: 0
erik.projs/asr/en/swbd1
Switchboard-1, release 2: famous ASR data set of CTS from the 1990s (LDC97S62).
Last synced at: almost 3 years ago - Stars: 0 - Forks: 0
erik.projs/asr/en/tedlium1
TED-LIUM 1: Release 1 of TED talk corpus from LIUM (SLR7).
Last synced at: almost 3 years ago - Stars: 0 - Forks: 0
erik.projs/asr/en/fisher
Fisher: large ASR dataset of CTS (LDC2004S13, LDC2005S13, LDC2004T19, LDC2005T19).
Last synced at: almost 3 years ago - Stars: 0 - Forks: 0
erik.projs/asr/en/marsec2
marsec2: Aix-MARSEC v. 2 database of carefully-annotated British English.
Last synced at: almost 3 years ago - Stars: 0 - Forks: 0
erik.projs/asr/en/vystadial
Vystadial: English part of Vystadial CTS corpus from Prague [Korvas et al. 2014].
Last synced at: almost 3 years ago - Stars: 0 - Forks: 0
erik.projs/asr/en/tedlium3
TED-LIUM 3: Release 3 of TED talk corpus from LIUM (SLR51).
Last synced at: almost 3 years ago - Stars: 0 - Forks: 0
erik.projs/asr/en/nie_csse
NIE_CSSE: NIE Corpus of Spoken Singapore English.
Last synced at: almost 3 years ago - Stars: 0 - Forks: 0
erik.projs/asr/en/ami
AMI (Augmented Multi-party Interaction) Meeting Corpus (SLR16).
Last synced at: almost 3 years ago - Stars: 0 - Forks: 0
erik.projs/asr/en/leap
LeaP: LeaP (Learning Prosody in a Foreign Language) corpus of English learners.
Last synced at: almost 3 years ago - Stars: 0 - Forks: 0
erik.projs/asr/en/speechcmd
speechcmd: Speech Commands Dataset from Google [Warden 2018].
Last synced at: almost 3 years ago - Stars: 0 - Forks: 0
erik.projs/asr/en/snips
Snips: SLU dataset from Snips (now part of Sonos) [Saade et al. 2019].
Last synced at: almost 3 years ago - Stars: 0 - Forks: 0
erik.projs/asr/en/ntimit
NTIMIT: TIMIT played through NYNEX telephone network (LDC93S2).
Last synced at: almost 3 years ago - Stars: 0 - Forks: 0
erik.projs/asr/en/fsdd
FSDD: Fee Spoken Digit Dataset (FSDD) at 8 kHz from Jakobovski Github page.
Last synced at: almost 3 years ago - Stars: 0 - Forks: 0
erik.projs/asr/en/heysnips0
heysnips0: Hey Snips initial dataset for KWS from Sonos.
Last synced at: almost 3 years ago - Stars: 0 - Forks: 0
erik.projs/asr/en/aesop
Aesop: Aesop British English Corpus of read speech from the Oxford Phonetics Lab.
Last synced at: almost 3 years ago - Stars: 0 - Forks: 0
erik.projs/asr/en/st-aeds
ST-AEDS: Surfingtech American English Dataset of cellphone speech (SLR45).
Last synced at: almost 3 years ago - Stars: 0 - Forks: 0
erik.projs/asr/en/synthcmd
synthcmd: Synthetic Speech Commands Dataset from Kaggle.
Last synced at: almost 3 years ago - Stars: 0 - Forks: 0
erik.projs/pearson/aida1
Aida English app, dataset 1 (alpha).
Last synced at: almost 3 years ago - Stars: 1 - Forks: 0