gitlab.com topics: automatic speech recognition (ASR)

This repository provides resources for a Quick Start guide for connecting Amazon Connect with Xdroid platform to provide post-call analytics. Intended target audience are system administrators who manage and configure the AWS Amazon Connect instance, and also for system architects and support engineers.

Last synced at: almost 3 years ago - Stars: 0 - Forks: 0

KPHIBYE/voskwrapper

C# library that provides an easy to use abstraction of the Vosk speech recognition toolkit

Last synced at: almost 3 years ago - Stars: 1 - Forks: 1

fb-resources/kaldi-br

Models trained with Kaldi

Last synced at: almost 3 years ago - Stars: 3 - Forks: 0

erik.projs/asr/en/timit

TIMIT: famous corpus of American English with phone-level transcriptions (LDC93S1).

Last synced at: almost 3 years ago - Stars: 0 - Forks: 0

erik.projs/asr/en/librispeech

LibriSpeech: large ASR data set of read books (SLR12) [Panayotov et al. 2015].

Last synced at: almost 3 years ago - Stars: 0 - Forks: 1

erik.projs/asr/en/an4

AN4: Alphanumeric or "census" database from CMU [Acero 1993].

Last synced at: almost 3 years ago - Stars: 0 - Forks: 0

erik.projs/asr/en/buckeye

Buckeye: Buckeye Speech Corpus (release 2) of interviews from Ohio State.

Last synced at: almost 3 years ago - Stars: 0 - Forks: 0

erik.projs/pearson/poc

Proof-of-concept (POC) app towards Aida English app.

Last synced at: almost 3 years ago - Stars: 1 - Forks: 0

erik.projs/asr/en/indictts

IndicTTS: Indian English speech from the IIT TTS Team.

Last synced at: almost 3 years ago - Stars: 0 - Forks: 1

erik.projs/asr/en/emime

EMIME Bilingual {Finnish,German,Mandarin}/English database (www.emime.org).

Last synced at: almost 3 years ago - Stars: 0 - Forks: 1

erik.projs/asr/en/ucam

UCAM Bilingual database from EMIME (www.emime.org).

Last synced at: almost 3 years ago - Stars: 0 - Forks: 0

erik.projs/asr/en/lombard

Lombard Grid: extension of Grid corpus with Lombard and normal speech.

Last synced at: almost 3 years ago - Stars: 0 - Forks: 0

erik.projs/asr/en/ctimit

CTIMIT: TIMIT played through cellphone network (LDC96S30).

Last synced at: almost 3 years ago - Stars: 0 - Forks: 0

erik.projs/asr/en/noisy-vctk

Noisy-VCTK: Noisy subset of VCTK (Voice Cloning Toolkit) dataset from CSTR.

Last synced at: almost 3 years ago - Stars: 0 - Forks: 0

erik.projs/asr/en/libritts

LibriTTS: Librispeech for text-to-speech (TTS) corpus (SLR60).

Last synced at: almost 3 years ago - Stars: 0 - Forks: 0

erik.projs/asr/en/arctic

CMU_ARCTIC dataset from CMU FestVox project (www.festvox.org/cmu_arctic).

Last synced at: almost 3 years ago - Stars: 0 - Forks: 0

erik.projs/asr/en/cmu_sin

CMU_SIN (speech-in-noise) dataset of Lombard speech from CMU FestVox project.

Last synced at: almost 3 years ago - Stars: 0 - Forks: 0

erik.projs/asr/en/commonvoice

commonvoice: Common Voice dataset of crowdsourced speech from Mozilla.

Last synced at: almost 3 years ago - Stars: 0 - Forks: 0

erik.projs/asr/en/dr-vctk

DR-VCTK: device-recorded Voice Cloning Toolkit (DR-VCTK) dataset from CSTR.

Last synced at: almost 3 years ago - Stars: 0 - Forks: 0

erik.projs/asr/en/vctk

VCTK: Voice Cloning Toolkit dataset from CSTR, Edinburgh [Veaux et al. 2013].

Last synced at: almost 3 years ago - Stars: 0 - Forks: 0

erik.projs/asr/en/voxforge

VoxForge: free, open-source ASR dataset of crowdsourced speech from voxforge.org.

Last synced at: almost 3 years ago - Stars: 0 - Forks: 0

erik.projs/asr/en/wsj

WSJ: Wall Street Journal corpus from ARPA in 1992, 1994 (LDC93S6A, LDC94S13A).

Last synced at: almost 3 years ago - Stars: 0 - Forks: 0

erik.projs/asr/en/tatoeba

Tatoeba: Tatoeba Project of English sentences (https://tatoeba.org/eng).

Last synced at: almost 3 years ago - Stars: 0 - Forks: 0

erik.projs/asr/en/fluentcmd

fluentcmd: Fluent Speech Commands Dataset for SLU from fluent.ai.

Last synced at: almost 3 years ago - Stars: 0 - Forks: 0

erik.projs/asr/en/heysnips1

heysnips1: Hey Snips Dataset 1 for KWS from Sonos [Coucke et al. 2019]

Last synced at: almost 3 years ago - Stars: 0 - Forks: 0

erik.projs/asr/en/swbd1

Switchboard-1, release 2: famous ASR data set of CTS from the 1990s (LDC97S62).

Last synced at: almost 3 years ago - Stars: 0 - Forks: 0

erik.projs/asr/en/tedlium1

TED-LIUM 1: Release 1 of TED talk corpus from LIUM (SLR7).

Last synced at: almost 3 years ago - Stars: 0 - Forks: 0

erik.projs/asr/en/fisher

Fisher: large ASR dataset of CTS (LDC2004S13, LDC2005S13, LDC2004T19, LDC2005T19).

Last synced at: almost 3 years ago - Stars: 0 - Forks: 0

erik.projs/asr/en/marsec2

marsec2: Aix-MARSEC v. 2 database of carefully-annotated British English.

Last synced at: almost 3 years ago - Stars: 0 - Forks: 0

erik.projs/asr/en/vystadial

Vystadial: English part of Vystadial CTS corpus from Prague [Korvas et al. 2014].

Last synced at: almost 3 years ago - Stars: 0 - Forks: 0

erik.projs/asr/en/tedlium3

TED-LIUM 3: Release 3 of TED talk corpus from LIUM (SLR51).

Last synced at: almost 3 years ago - Stars: 0 - Forks: 0

erik.projs/asr/en/nie_csse

NIE_CSSE: NIE Corpus of Spoken Singapore English.

Last synced at: almost 3 years ago - Stars: 0 - Forks: 0

erik.projs/asr/en/ami

AMI (Augmented Multi-party Interaction) Meeting Corpus (SLR16).

Last synced at: almost 3 years ago - Stars: 0 - Forks: 0

erik.projs/asr/en/leap

LeaP: LeaP (Learning Prosody in a Foreign Language) corpus of English learners.

Last synced at: almost 3 years ago - Stars: 0 - Forks: 0

erik.projs/asr/en/speechcmd

speechcmd: Speech Commands Dataset from Google [Warden 2018].

Last synced at: almost 3 years ago - Stars: 0 - Forks: 0

erik.projs/asr/en/snips

Snips: SLU dataset from Snips (now part of Sonos) [Saade et al. 2019].

Last synced at: almost 3 years ago - Stars: 0 - Forks: 0

erik.projs/asr/en/ntimit

NTIMIT: TIMIT played through NYNEX telephone network (LDC93S2).

Last synced at: almost 3 years ago - Stars: 0 - Forks: 0

erik.projs/asr/en/fsdd

FSDD: Fee Spoken Digit Dataset (FSDD) at 8 kHz from Jakobovski Github page.

Last synced at: almost 3 years ago - Stars: 0 - Forks: 0

erik.projs/asr/en/heysnips0

heysnips0: Hey Snips initial dataset for KWS from Sonos.

Last synced at: almost 3 years ago - Stars: 0 - Forks: 0

erik.projs/asr/en/aesop

Aesop: Aesop British English Corpus of read speech from the Oxford Phonetics Lab.

Last synced at: almost 3 years ago - Stars: 0 - Forks: 0

erik.projs/asr/en/st-aeds

ST-AEDS: Surfingtech American English Dataset of cellphone speech (SLR45).

Last synced at: almost 3 years ago - Stars: 0 - Forks: 0

erik.projs/asr/en/synthcmd

synthcmd: Synthetic Speech Commands Dataset from Kaggle.

Last synced at: almost 3 years ago - Stars: 0 - Forks: 0

erik.projs/pearson/aida1

Aida English app, dataset 1 (alpha).

Last synced at: almost 3 years ago - Stars: 1 - Forks: 0