GitHub topics: librispeech
juliagusak/dataloaders
Pytorch and TensorFlow data loaders for several audio datasets
Language: Python - Size: 115 KB - Last synced at: 2 days ago - Pushed at: over 5 years ago - Stars: 111 - Forks: 11

wq2012/SpeakerRecognitionFromScratch
Final project for the Speaker Recognition course on Udemy, 机器之心, 深蓝学院 and 语音之家
Language: Python - Size: 9.2 MB - Last synced at: 8 days ago - Pushed at: 12 months ago - Stars: 44 - Forks: 14

LuluW8071/Deep-Speech-2
Implementation of Deep Speech 2 paper with BiGRU and BiLSTM using LibriSpeech Dataset
Language: Jupyter Notebook - Size: 2.08 MB - Last synced at: about 2 months ago - Pushed at: 3 months ago - Stars: 2 - Forks: 0

HarishGoudVennakula/SELF-SUPERVISED-REPRESENTATION-LEARNING
A useful librispeech project where without using the datastes available in the internet. Here you have to create your own audio files and take them as input to create text as output. There is problem with the dataset available in online this project comes in handy for people who interested in this.
Language: Python - Size: 3.91 KB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

speechbrain/speechbrain.github.io
The SpeechBrain project aims to build a novel speech toolkit fully based on PyTorch. With SpeechBrain users can easily create speech processing systems, ranging from speech recognition (both HMM/DNN and end-to-end), speaker recognition, speech enhancement, speech separation, multi-microphone speech processing, and many others.
Language: HTML - Size: 46.8 MB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 364 - Forks: 29

hirofumi0810/tensorflow_end2end_speech_recognition
End-to-End speech recognition implementation base on TensorFlow (CTC, Attention, and MTL training)
Language: Python - Size: 4.17 MB - Last synced at: 5 months ago - Pushed at: about 7 years ago - Stars: 313 - Forks: 120

filippogiruzzi/voice_activity_detection
Voice Activity Detection based on Deep Learning & TensorFlow
Language: Python - Size: 238 KB - Last synced at: 5 months ago - Pushed at: about 2 years ago - Stars: 355 - Forks: 69

soheil-mp/Speech-Recognition
End-to-End Speech Recognition using Neural Networks.
Language: Jupyter Notebook - Size: 15.5 MB - Last synced at: 2 days ago - Pushed at: 8 months ago - Stars: 35 - Forks: 21

BenAAndrew/speech-transcriber
A web-app/library for transcribing speech
Language: Python - Size: 796 KB - Last synced at: 15 days ago - Pushed at: over 2 years ago - Stars: 5 - Forks: 1

oleges1/quartznet-pytorch
Quartznet implementation on pytorch [https://arxiv.org/abs/1910.10261]
Language: Jupyter Notebook - Size: 116 KB - Last synced at: 5 months ago - Pushed at: almost 4 years ago - Stars: 26 - Forks: 7

UDASE-CHiME2023/reverberant-LibriCHiME-5
Scripts to generate the reverberant LibriCHiME-5 dataset.
Language: Python - Size: 341 KB - Last synced at: 10 months ago - Pushed at: 10 months ago - Stars: 4 - Forks: 0

zssloth/TF-Speech-Recognition
Speech Recognition Using Tensorflow
Language: Python - Size: 719 KB - Last synced at: 12 months ago - Pushed at: over 7 years ago - Stars: 13 - Forks: 3

Ephrem-ETH/E2E-ASR-on-Librispeech
End to End Automatic Speech Recognition on Librispeech: Pytorch implementation
Language: Python - Size: 1.17 MB - Last synced at: about 1 year ago - Pushed at: over 3 years ago - Stars: 1 - Forks: 0

andi611/Kaldi-LibriSpeech-fMLLR
This repository contains Kaldi recipes on the LibriSpeech corpora to extract fMLLR features
Language: Shell - Size: 7.81 KB - Last synced at: 27 days ago - Pushed at: over 5 years ago - Stars: 1 - Forks: 1

stefanpantic/asr
Automatic speech recognition using neural networks
Language: Python - Size: 143 MB - Last synced at: about 2 months ago - Pushed at: over 4 years ago - Stars: 19 - Forks: 1

jayaneetha/GenderClassifierLibriSpeech
Gender Classification of the speaker from LibriSpeech Dataset
Language: Python - Size: 15.6 KB - Last synced at: about 1 month ago - Pushed at: about 2 years ago - Stars: 4 - Forks: 2

jreremy/conformer
Pytorch implementation of conformer with with training script for end-to-end speech recognition on the LibriSpeech dataset.
Language: Python - Size: 23.4 KB - Last synced at: over 1 year ago - Pushed at: almost 3 years ago - Stars: 3 - Forks: 3

hammaad2002/SimpleASRmodel
A simple CRDNN based ASR model for my own understanding of how ASR works and are trained. (Work in progress) If anyone finds any error or have any suggestion please do let me know.
Language: Jupyter Notebook - Size: 0 Bytes - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 0 - Forks: 0

vjoki/fsl-experi
Few-shot learning experiments mostly on speaker recognition.
Language: Python - Size: 3.58 MB - Last synced at: over 1 year ago - Pushed at: over 3 years ago - Stars: 5 - Forks: 1

hirofumi0810/asr_preprocessing
Python implementation of pre-processing for End-to-End speech recognition
Language: Python - Size: 1.67 MB - Last synced at: almost 2 years ago - Pushed at: about 7 years ago - Stars: 66 - Forks: 22

to-schi/ASR-Deepspeech2-Tensorflow
An end-to-end speech recognition engine similar to DeepSpeech2
Language: Jupyter Notebook - Size: 2.19 MB - Last synced at: over 1 year ago - Pushed at: over 2 years ago - Stars: 2 - Forks: 0

30stomercury/Automatic-Speech-Recognition
End-to-End Speech Recognition Using Tensorflow
Language: Python - Size: 1.93 MB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 39 - Forks: 8

pyyush/SpecAugment
SpecAugment: A Simple Data Augmentation Method for Automatic Speech Recognition
Language: Python - Size: 3.02 MB - Last synced at: about 2 years ago - Pushed at: over 4 years ago - Stars: 37 - Forks: 8

bhigy/zr-2021vg_baseline
Baselines for the Zero-Resources Speech Challenge using VisuallyGrounded Models of Spoken Language, 2021 edition
Language: Python - Size: 371 KB - Last synced at: about 2 years ago - Pushed at: almost 4 years ago - Stars: 6 - Forks: 2

EmanuelAlogna/Gender-Classification-using-ML
Gender Classification with different Machine Learning models, using the LibriSpeech ASR dataset.
Language: Jupyter Notebook - Size: 146 KB - Last synced at: almost 2 years ago - Pushed at: almost 5 years ago - Stars: 3 - Forks: 3

tnakatani/dnn_speech_recognition
Implement a deep neural network that functions as part of an end-to-end automatic speech recognition (ASR) pipeline
Language: HTML - Size: 18.6 MB - Last synced at: about 2 years ago - Pushed at: over 4 years ago - Stars: 0 - Forks: 1
