Topic: "wav2vec"
s3prl/s3prl
Self-Supervised Speech Pre-training and Representation Learning Toolkit
Language: Python - Size: 135 MB - Last synced at: 6 months ago - Pushed at: 6 months ago - Stars: 2,253 - Forks: 484

mailong25/self-supervised-speech-recognition
speech to text with self-supervised learning based on wav2vec 2.0 framework
Language: Python - Size: 13.1 MB - Last synced at: over 1 year ago - Pushed at: over 3 years ago - Stars: 359 - Forks: 113

oliverguhr/wav2vec2-live
A live speech recognition using Facebooks wav2vec 2.0 model.
Language: Python - Size: 2.84 MB - Last synced at: 16 days ago - Pushed at: about 1 year ago - Stars: 348 - Forks: 56

arxyzan/data2vec-pytorch
PyTorch implementation of "data2vec: A General Framework for Self-supervised Learning in Speech, Vision and Language" from Meta AI
Language: Python - Size: 1.16 MB - Last synced at: over 1 year ago - Pushed at: almost 2 years ago - Stars: 146 - Forks: 25

shangeth/SpeakerProfiling
Estimating the Age, Height, and Gender of a speaker with their speech signal.
Language: Python - Size: 195 KB - Last synced at: about 1 year ago - Pushed at: almost 4 years ago - Stars: 55 - Forks: 20

lucasgris/wav2vec4bp
Wav2vec resources and models for Brazilian Portuguese
Language: Jupyter Notebook - Size: 1.65 MB - Last synced at: 21 days ago - Pushed at: almost 3 years ago - Stars: 33 - Forks: 2

loretoparisi/wave2vec-recognize-docker
Wave2vec 2.0 Recognize pipeline
Language: Python - Size: 33.2 KB - Last synced at: over 1 year ago - Pushed at: over 4 years ago - Stars: 33 - Forks: 10

robinhad/voice-recognition-ua
Training scripts for Speech-To-Text models for Ukrainian language
Language: Jupyter Notebook - Size: 403 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 32 - Forks: 2

bhattbhavesh91/wav2vec2-huggingface-demo
Speech to Text with self-supervised learning based on wav2vec 2.0 framework using Hugging Face's Transformer
Language: Jupyter Notebook - Size: 841 KB - Last synced at: 3 days ago - Pushed at: almost 4 years ago - Stars: 30 - Forks: 14

daanzu/wav2vec2_stt_python
Simple Python library, distributed via binary wheels with few direct dependencies, for easily using wav2vec 2.0 models for speech recognition
Language: Python - Size: 88.9 KB - Last synced at: 2 days ago - Pushed at: over 3 years ago - Stars: 24 - Forks: 3

notAI-tech/IndicASR
Speeech Recognition for Indic languages.
Language: Python - Size: 623 KB - Last synced at: 16 days ago - Pushed at: about 4 years ago - Stars: 13 - Forks: 3

jvel07/wav2vec2_patho
Fine-tuning wav2vec2 to for Pathological Speech Processing
Language: Jupyter Notebook - Size: 4.05 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 5 - Forks: 1

slinusc/speaker_identification_evaluation
Evaluating the Effectiveness of Transformer Layers in Wav2Vec 2.0, XLS-R, and Whisper for Speaker Identification Tasks
Language: Jupyter Notebook - Size: 8.56 MB - Last synced at: 1 day ago - Pushed at: 3 months ago - Stars: 3 - Forks: 1

oswaldoludwig/Pruning-pre-trained-models-using-evolutionary-computation
This repository contains scripts to prune Wav2vec2 using a neuroevolution-based method. More details about this method can be found in the paper Compressing Wav2vec2 for Embedded Applications.
Language: Shell - Size: 4.53 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 2 - Forks: 0

manhph2211/Speech-Processing
Building a speaker identification & verification pipeline for Vietnamese voices :sleepy:
Language: Jupyter Notebook - Size: 3.92 MB - Last synced at: about 2 years ago - Pushed at: over 3 years ago - Stars: 2 - Forks: 0

phanxuanphucnd/wav2asr
A library version of wav2vec 2.0 framework for Automatic Speech Recognition task.
Language: Python - Size: 8.5 MB - Last synced at: about 2 years ago - Pushed at: over 3 years ago - Stars: 2 - Forks: 4

Katashynskyi/Voice_assistant_UA_EN
No api-keys | local | llama3.1 For language studying and live translation
Language: Python - Size: 1.09 MB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 1 - Forks: 1

thisisHJLee/Fine-Tuning-of-XLSR-Wav2Vec2-on-Korean
Language: Jupyter Notebook - Size: 1.35 MB - Last synced at: almost 2 years ago - Pushed at: over 2 years ago - Stars: 1 - Forks: 0

mead-ml/audio8
Deep audio modeling
Language: Python - Size: 307 KB - Last synced at: about 1 month ago - Pushed at: about 3 years ago - Stars: 1 - Forks: 0

MarwaAbdelAal/ASR-correction-model
ASR model generates transcription from audio waves, then correct the word spelling
Language: Python - Size: 12.7 MB - Last synced at: about 2 years ago - Pushed at: over 3 years ago - Stars: 1 - Forks: 0

NabinAdhikari674/wav2vec
A repo to make installation and training of a wav2vec model easier
Language: Python - Size: 10.7 KB - Last synced at: almost 2 years ago - Pushed at: over 4 years ago - Stars: 1 - Forks: 0

ogunlao/speech_language_models
A collection of speech language models with a focus on acoustic codes
Language: Python - Size: 133 KB - Last synced at: 5 days ago - Pushed at: 5 days ago - Stars: 0 - Forks: 0

abdur75648/DINet-Inference
Create high-resolution visually dubbed videos with DINet
Language: Python - Size: 45.9 KB - Last synced at: 21 days ago - Pushed at: 10 months ago - Stars: 0 - Forks: 0

hciays/ailab_ss2022
asr for German Language
Language: Python - Size: 39.1 KB - Last synced at: over 1 year ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

Natalia-T/NeurIPS2021 Fork of LeBenchmark/NeurIPS2021
Size: 13.4 MB - Last synced at: 12 months ago - Pushed at: over 3 years ago - Stars: 0 - Forks: 0
