An open API service providing repository metadata for many open source software ecosystems.

gitlab.com topics: automatic speech recognition (ASR)

prebens-phd-adventures/universal-edit-distance

A small Python library containing some generic metrics implemented in Rust

Last synced at: 3 months ago - Stars: 0 - Forks: 0

alphaspeech/alphaspeech-npm-asr-kit

This project provides a client package and example scripts for TypeScript to access the alphaspeech pro ASR stream API.

Last synced at: 4 months ago - Stars: 0 - Forks: 0

alphaspeech/alphaspeech-python

This project provides a client package and example scripts to access the alphaspeech pro ASR APIs.

Last synced at: 6 months ago - Stars: 0 - Forks: 0

erik.projs/asr/en/hub5

Hub5: small ASR data set of CTS used mainly as a test set (LDC2002T43).

Last synced at: almost 3 years ago - Stars: 0 - Forks: 0

breslerek/lem_srd

LEM speech recognition device, designed for Signal Processing lecture.

Last synced at: over 2 years ago - Stars: 0 - Forks: 0

erik.projs/asr/en/wtimit

WTIMIT: TIMIT played through wideband mobile telephone networks (LDC2010S02).

Last synced at: almost 3 years ago - Stars: 0 - Forks: 0

erik.projs/asr/en/cv1

CV1: Common Voice Single-Word Target dataset from Mozilla.

Last synced at: almost 3 years ago - Stars: 0 - Forks: 0

erik.projs/asr/en/audiomnist

AudioMNIST: free dataset of spoken digits (0-9) from 60 speakers.

Last synced at: almost 3 years ago - Stars: 0 - Forks: 0

erik.projs/asr/en/aesl

AESL: American English Spoken Lexicon from 1 female speaker (LDC99L23).

Last synced at: almost 3 years ago - Stars: 0 - Forks: 0

erik.projs/asr/en/swc

SWC: Spoken Wikipedia Corpus, crowdsourced speech of read Wikipedia articles.

Last synced at: almost 3 years ago - Stars: 0 - Forks: 0

erik.projs/asr/en/tedlium2

TED-LIUM 2: Release 2 of TED talk corpus from LIUM (SLR19).

Last synced at: almost 3 years ago - Stars: 0 - Forks: 0

erik.projs/asr/en/fred

FRED: Freiburg English Dialect Corpus Sampler (FRED-S) of British English interviews.

Last synced at: almost 3 years ago - Stars: 0 - Forks: 0

erik.projs/asr/en/rm

RM: Resource Management v. 2.0 corpus from DARPA in the 1990s (LDC93S3A).

Last synced at: almost 3 years ago - Stars: 0 - Forks: 0

erik.projs/asr/en/noisytimit

NoisyTIMIT: TIMIT corpus with various additive noises (LDC2017S04).

Last synced at: almost 3 years ago - Stars: 0 - Forks: 0

erik.projs/asr/en/crm

CRM (coordinate response number) corpus [Bolia et al. 2000].

Last synced at: almost 3 years ago - Stars: 0 - Forks: 0

erik.projs/asr/en/grid

GRID: audiovisual corpus of grid-related commands, from Univ. Sheffield.

Last synced at: almost 3 years ago - Stars: 0 - Forks: 0

erik.projs/asr/en/rt03

rt03: NIST 2003 Rich Transcription Evaluation Data for CTS (LDC2007S10).

Last synced at: almost 3 years ago - Stars: 0 - Forks: 0

erik.projs/asr/en/reddots

RedDots: corpus of short-dur utts from mobile apps (sites.google.com/site/thereddotsproject).

Last synced at: almost 3 years ago - Stars: 0 - Forks: 0

erik.projs/asr/en/pda

PDA: Personal Digital Assistant speech dataset from CMU.

Last synced at: almost 3 years ago - Stars: 0 - Forks: 0

erik.projs/asr/en/ng_en

ng_en: Nigerian English high-quality crowdsourced dataset (SLR70).

Last synced at: almost 3 years ago - Stars: 0 - Forks: 0

erik.projs/asr/en/lj

LJ: LJ (Linda Johson) Speech Corpus (v. 1.1), often used for TTS.

Last synced at: almost 3 years ago - Stars: 0 - Forks: 0

erik.projs/asr/en/heysnips2

heysnips2: Hey Snips Dataset 2 for KWS from Sonos [Leroy et al. 2019].

Last synced at: almost 3 years ago - Stars: 0 - Forks: 0

tiro-is/heyra

Last synced at: almost 3 years ago - Stars: 0 - Forks: 0

xdroid-public/xdroid-amazon-connect-integration

This repository provides resources for a Quick Start guide for connecting Amazon Connect with Xdroid platform to provide post-call analytics. Intended target audience are system administrators who manage and configure the AWS Amazon Connect instance, and also for system architects and support engineers.

Last synced at: almost 3 years ago - Stars: 0 - Forks: 0

KPHIBYE/voskwrapper

C# library that provides an easy to use abstraction of the Vosk speech recognition toolkit

Last synced at: almost 3 years ago - Stars: 1 - Forks: 1

fb-resources/kaldi-br

Models trained with Kaldi

Last synced at: almost 3 years ago - Stars: 3 - Forks: 0

erik.projs/asr/en/timit

TIMIT: famous corpus of American English with phone-level transcriptions (LDC93S1).

Last synced at: almost 3 years ago - Stars: 0 - Forks: 0

erik.projs/asr/en/librispeech

LibriSpeech: large ASR data set of read books (SLR12) [Panayotov et al. 2015].

Last synced at: almost 3 years ago - Stars: 0 - Forks: 1

erik.projs/asr/en/an4

AN4: Alphanumeric or "census" database from CMU [Acero 1993].

Last synced at: almost 3 years ago - Stars: 0 - Forks: 0

erik.projs/asr/en/buckeye

Buckeye: Buckeye Speech Corpus (release 2) of interviews from Ohio State.

Last synced at: almost 3 years ago - Stars: 0 - Forks: 0

erik.projs/pearson/poc

Proof-of-concept (POC) app towards Aida English app.

Last synced at: almost 3 years ago - Stars: 1 - Forks: 0

erik.projs/asr/en/indictts

IndicTTS: Indian English speech from the IIT TTS Team.

Last synced at: almost 3 years ago - Stars: 0 - Forks: 1

erik.projs/asr/en/emime

EMIME Bilingual {Finnish,German,Mandarin}/English database (www.emime.org).

Last synced at: almost 3 years ago - Stars: 0 - Forks: 1

erik.projs/asr/en/ucam

UCAM Bilingual database from EMIME (www.emime.org).

Last synced at: almost 3 years ago - Stars: 0 - Forks: 0

erik.projs/asr/en/lombard

Lombard Grid: extension of Grid corpus with Lombard and normal speech.

Last synced at: almost 3 years ago - Stars: 0 - Forks: 0

erik.projs/asr/en/ctimit

CTIMIT: TIMIT played through cellphone network (LDC96S30).

Last synced at: almost 3 years ago - Stars: 0 - Forks: 0

erik.projs/asr/en/noisy-vctk

Noisy-VCTK: Noisy subset of VCTK (Voice Cloning Toolkit) dataset from CSTR.

Last synced at: almost 3 years ago - Stars: 0 - Forks: 0

erik.projs/asr/en/libritts

LibriTTS: Librispeech for text-to-speech (TTS) corpus (SLR60).

Last synced at: almost 3 years ago - Stars: 0 - Forks: 0

erik.projs/asr/en/arctic

CMU_ARCTIC dataset from CMU FestVox project (www.festvox.org/cmu_arctic).

Last synced at: almost 3 years ago - Stars: 0 - Forks: 0

erik.projs/asr/en/cmu_sin

CMU_SIN (speech-in-noise) dataset of Lombard speech from CMU FestVox project.

Last synced at: almost 3 years ago - Stars: 0 - Forks: 0

erik.projs/asr/en/commonvoice

commonvoice: Common Voice dataset of crowdsourced speech from Mozilla.

Last synced at: almost 3 years ago - Stars: 0 - Forks: 0

erik.projs/asr/en/dr-vctk

DR-VCTK: device-recorded Voice Cloning Toolkit (DR-VCTK) dataset from CSTR.

Last synced at: almost 3 years ago - Stars: 0 - Forks: 0

erik.projs/asr/en/vctk

VCTK: Voice Cloning Toolkit dataset from CSTR, Edinburgh [Veaux et al. 2013].

Last synced at: almost 3 years ago - Stars: 0 - Forks: 0

erik.projs/asr/en/voxforge

VoxForge: free, open-source ASR dataset of crowdsourced speech from voxforge.org.

Last synced at: almost 3 years ago - Stars: 0 - Forks: 0

erik.projs/asr/en/wsj

WSJ: Wall Street Journal corpus from ARPA in 1992, 1994 (LDC93S6A, LDC94S13A).

Last synced at: almost 3 years ago - Stars: 0 - Forks: 0

erik.projs/asr/en/tatoeba

Tatoeba: Tatoeba Project of English sentences (https://tatoeba.org/eng).

Last synced at: almost 3 years ago - Stars: 0 - Forks: 0

erik.projs/asr/en/fluentcmd

fluentcmd: Fluent Speech Commands Dataset for SLU from fluent.ai.

Last synced at: almost 3 years ago - Stars: 0 - Forks: 0

erik.projs/asr/en/heysnips1

heysnips1: Hey Snips Dataset 1 for KWS from Sonos [Coucke et al. 2019]

Last synced at: almost 3 years ago - Stars: 0 - Forks: 0

erik.projs/asr/en/swbd1

Switchboard-1, release 2: famous ASR data set of CTS from the 1990s (LDC97S62).

Last synced at: almost 3 years ago - Stars: 0 - Forks: 0

erik.projs/asr/en/tedlium1

TED-LIUM 1: Release 1 of TED talk corpus from LIUM (SLR7).

Last synced at: almost 3 years ago - Stars: 0 - Forks: 0

erik.projs/asr/en/fisher

Fisher: large ASR dataset of CTS (LDC2004S13, LDC2005S13, LDC2004T19, LDC2005T19).

Last synced at: almost 3 years ago - Stars: 0 - Forks: 0

erik.projs/asr/en/marsec2

marsec2: Aix-MARSEC v. 2 database of carefully-annotated British English.

Last synced at: almost 3 years ago - Stars: 0 - Forks: 0

erik.projs/asr/en/vystadial

Vystadial: English part of Vystadial CTS corpus from Prague [Korvas et al. 2014].

Last synced at: almost 3 years ago - Stars: 0 - Forks: 0

erik.projs/asr/en/tedlium3

TED-LIUM 3: Release 3 of TED talk corpus from LIUM (SLR51).

Last synced at: almost 3 years ago - Stars: 0 - Forks: 0

erik.projs/asr/en/nie_csse

NIE_CSSE: NIE Corpus of Spoken Singapore English.

Last synced at: almost 3 years ago - Stars: 0 - Forks: 0

erik.projs/asr/en/ami

AMI (Augmented Multi-party Interaction) Meeting Corpus (SLR16).

Last synced at: almost 3 years ago - Stars: 0 - Forks: 0

erik.projs/asr/en/leap

LeaP: LeaP (Learning Prosody in a Foreign Language) corpus of English learners.

Last synced at: almost 3 years ago - Stars: 0 - Forks: 0

erik.projs/asr/en/speechcmd

speechcmd: Speech Commands Dataset from Google [Warden 2018].

Last synced at: almost 3 years ago - Stars: 0 - Forks: 0

erik.projs/asr/en/snips

Snips: SLU dataset from Snips (now part of Sonos) [Saade et al. 2019].

Last synced at: almost 3 years ago - Stars: 0 - Forks: 0

erik.projs/asr/en/ntimit

NTIMIT: TIMIT played through NYNEX telephone network (LDC93S2).

Last synced at: almost 3 years ago - Stars: 0 - Forks: 0

erik.projs/asr/en/fsdd

FSDD: Fee Spoken Digit Dataset (FSDD) at 8 kHz from Jakobovski Github page.

Last synced at: almost 3 years ago - Stars: 0 - Forks: 0

erik.projs/asr/en/heysnips0

heysnips0: Hey Snips initial dataset for KWS from Sonos.

Last synced at: almost 3 years ago - Stars: 0 - Forks: 0

erik.projs/asr/en/aesop

Aesop: Aesop British English Corpus of read speech from the Oxford Phonetics Lab.

Last synced at: almost 3 years ago - Stars: 0 - Forks: 0

erik.projs/asr/en/st-aeds

ST-AEDS: Surfingtech American English Dataset of cellphone speech (SLR45).

Last synced at: almost 3 years ago - Stars: 0 - Forks: 0

erik.projs/asr/en/synthcmd

synthcmd: Synthetic Speech Commands Dataset from Kaggle.

Last synced at: almost 3 years ago - Stars: 0 - Forks: 0

erik.projs/pearson/aida1

Aida English app, dataset 1 (alpha).

Last synced at: almost 3 years ago - Stars: 1 - Forks: 0