An open API service providing repository metadata for many open source software ecosystems.

GitHub topics: librispeech

juliagusak/dataloaders

Pytorch and TensorFlow data loaders for several audio datasets

Language: Python - Size: 115 KB - Last synced at: 2 days ago - Pushed at: over 5 years ago - Stars: 111 - Forks: 11

wq2012/SpeakerRecognitionFromScratch

Final project for the Speaker Recognition course on Udemy, 机器之心, 深蓝学院 and 语音之家

Language: Python - Size: 9.2 MB - Last synced at: 8 days ago - Pushed at: 12 months ago - Stars: 44 - Forks: 14

LuluW8071/Deep-Speech-2

Implementation of Deep Speech 2 paper with BiGRU and BiLSTM using LibriSpeech Dataset

Language: Jupyter Notebook - Size: 2.08 MB - Last synced at: about 2 months ago - Pushed at: 3 months ago - Stars: 2 - Forks: 0

HarishGoudVennakula/SELF-SUPERVISED-REPRESENTATION-LEARNING

A useful librispeech project where without using the datastes available in the internet. Here you have to create your own audio files and take them as input to create text as output. There is problem with the dataset available in online this project comes in handy for people who interested in this.

Language: Python - Size: 3.91 KB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

speechbrain/speechbrain.github.io

The SpeechBrain project aims to build a novel speech toolkit fully based on PyTorch. With SpeechBrain users can easily create speech processing systems, ranging from speech recognition (both HMM/DNN and end-to-end), speaker recognition, speech enhancement, speech separation, multi-microphone speech processing, and many others.

Language: HTML - Size: 46.8 MB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 364 - Forks: 29

hirofumi0810/tensorflow_end2end_speech_recognition

End-to-End speech recognition implementation base on TensorFlow (CTC, Attention, and MTL training)

Language: Python - Size: 4.17 MB - Last synced at: 5 months ago - Pushed at: about 7 years ago - Stars: 313 - Forks: 120

filippogiruzzi/voice_activity_detection

Voice Activity Detection based on Deep Learning & TensorFlow

Language: Python - Size: 238 KB - Last synced at: 5 months ago - Pushed at: about 2 years ago - Stars: 355 - Forks: 69

soheil-mp/Speech-Recognition

End-to-End Speech Recognition using Neural Networks.

Language: Jupyter Notebook - Size: 15.5 MB - Last synced at: 2 days ago - Pushed at: 8 months ago - Stars: 35 - Forks: 21

BenAAndrew/speech-transcriber

A web-app/library for transcribing speech

Language: Python - Size: 796 KB - Last synced at: 15 days ago - Pushed at: over 2 years ago - Stars: 5 - Forks: 1

oleges1/quartznet-pytorch

Quartznet implementation on pytorch [https://arxiv.org/abs/1910.10261]

Language: Jupyter Notebook - Size: 116 KB - Last synced at: 5 months ago - Pushed at: almost 4 years ago - Stars: 26 - Forks: 7

UDASE-CHiME2023/reverberant-LibriCHiME-5

Scripts to generate the reverberant LibriCHiME-5 dataset.

Language: Python - Size: 341 KB - Last synced at: 10 months ago - Pushed at: 10 months ago - Stars: 4 - Forks: 0

zssloth/TF-Speech-Recognition

Speech Recognition Using Tensorflow

Language: Python - Size: 719 KB - Last synced at: 12 months ago - Pushed at: over 7 years ago - Stars: 13 - Forks: 3

Ephrem-ETH/E2E-ASR-on-Librispeech

End to End Automatic Speech Recognition on Librispeech: Pytorch implementation

Language: Python - Size: 1.17 MB - Last synced at: about 1 year ago - Pushed at: over 3 years ago - Stars: 1 - Forks: 0

andi611/Kaldi-LibriSpeech-fMLLR

This repository contains Kaldi recipes on the LibriSpeech corpora to extract fMLLR features

Language: Shell - Size: 7.81 KB - Last synced at: 27 days ago - Pushed at: over 5 years ago - Stars: 1 - Forks: 1

stefanpantic/asr

Automatic speech recognition using neural networks

Language: Python - Size: 143 MB - Last synced at: about 2 months ago - Pushed at: over 4 years ago - Stars: 19 - Forks: 1

jayaneetha/GenderClassifierLibriSpeech

Gender Classification of the speaker from LibriSpeech Dataset

Language: Python - Size: 15.6 KB - Last synced at: about 1 month ago - Pushed at: about 2 years ago - Stars: 4 - Forks: 2

jreremy/conformer

Pytorch implementation of conformer with with training script for end-to-end speech recognition on the LibriSpeech dataset.

Language: Python - Size: 23.4 KB - Last synced at: over 1 year ago - Pushed at: almost 3 years ago - Stars: 3 - Forks: 3

hammaad2002/SimpleASRmodel

A simple CRDNN based ASR model for my own understanding of how ASR works and are trained. (Work in progress) If anyone finds any error or have any suggestion please do let me know.

Language: Jupyter Notebook - Size: 0 Bytes - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 0 - Forks: 0

vjoki/fsl-experi

Few-shot learning experiments mostly on speaker recognition.

Language: Python - Size: 3.58 MB - Last synced at: over 1 year ago - Pushed at: over 3 years ago - Stars: 5 - Forks: 1

hirofumi0810/asr_preprocessing

Python implementation of pre-processing for End-to-End speech recognition

Language: Python - Size: 1.67 MB - Last synced at: almost 2 years ago - Pushed at: about 7 years ago - Stars: 66 - Forks: 22

to-schi/ASR-Deepspeech2-Tensorflow

An end-to-end speech recognition engine similar to DeepSpeech2

Language: Jupyter Notebook - Size: 2.19 MB - Last synced at: over 1 year ago - Pushed at: over 2 years ago - Stars: 2 - Forks: 0

30stomercury/Automatic-Speech-Recognition

End-to-End Speech Recognition Using Tensorflow

Language: Python - Size: 1.93 MB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 39 - Forks: 8

pyyush/SpecAugment

SpecAugment: A Simple Data Augmentation Method for Automatic Speech Recognition

Language: Python - Size: 3.02 MB - Last synced at: about 2 years ago - Pushed at: over 4 years ago - Stars: 37 - Forks: 8

bhigy/zr-2021vg_baseline

Baselines for the Zero-Resources Speech Challenge using VisuallyGrounded Models of Spoken Language, 2021 edition

Language: Python - Size: 371 KB - Last synced at: about 2 years ago - Pushed at: almost 4 years ago - Stars: 6 - Forks: 2

EmanuelAlogna/Gender-Classification-using-ML

Gender Classification with different Machine Learning models, using the LibriSpeech ASR dataset.

Language: Jupyter Notebook - Size: 146 KB - Last synced at: almost 2 years ago - Pushed at: almost 5 years ago - Stars: 3 - Forks: 3

tnakatani/dnn_speech_recognition

Implement a deep neural network that functions as part of an end-to-end automatic speech recognition (ASR) pipeline

Language: HTML - Size: 18.6 MB - Last synced at: about 2 years ago - Pushed at: over 4 years ago - Stars: 0 - Forks: 1

Related Keywords
librispeech 26 speech-recognition 11 asr 10 automatic-speech-recognition 7 pytorch 6 tensorflow 6 deep-learning 5 timit 4 machine-learning 4 librispeech-dataset 4 ctc 3 speaker-recognition 3 timit-dataset 3 speech-to-text 3 attention-mechanism 3 neural-networks 3 speaker-identification 3 speech-processing 2 speech 2 csj 2 end-to-end 2 deep-neural-networks 2 python 2 transcription 2 asr-model 2 common-voice 2 quartznet 2 deeplearning 2 lstm 2 neural-network 2 audio-processing 2 ctc-decode 2 dataset 2 representation-learning 1 switchboard 1 preprocessing 1 voxceleb 1 siamese-neural-network 1 resnet34 1 metric-learning 1 few-shot-learning 1 supervised-learning 1 speech-classification 1 svm 1 pytorch-tutorial 1 pytorch-implementation 1 crdnn 1 conformer 1 keras 1 cnn 1 classification 1 jasper 1 librispeech-fmllr 1 kaldi-librispeech 1 multimodal-learning 1 challenge 1 spectrogram 1 spokencoco 1 specaugment 1 masking 1 visually-grounded-speech 1 data-augmentation 1 weakly-supervised-learning 1 convolutional-neural-networks 1 tfrecord 1 k-nearest-neighbors 1 logistic-regression 1 location-aware-attention 1 listen-attend-and-spell 1 machine-learning-algorithms 1 mel-spectrogram 1 mlp 1 naive-bayes 1 data-preparation 1 perceptron 1 ctc-loss 1 end-to-end-learning 1 beam-search 1 speechrecognition 1 speech-separation 1 speech-recognizer 1 speech-emotion-recognition 1 speech-api 1 speech-analysis 1 speaker-verification 1 beamforming 1 kenlm-toolkit 1 hacktoberfest 1 deep-speech 1 transformer-models 1 transformer 1 speaker-recognition-systems 1 tfrecords 1 nsynth 1 gtzan 1 esc 1 dataloader 1 kaldi 1 fmllr 1 e2e-asr 1