An open API service providing repository metadata for many open source software ecosystems.

Topic: "wav2vec"

s3prl/s3prl

Self-Supervised Speech Pre-training and Representation Learning Toolkit

Language: Python - Size: 135 MB - Last synced at: 6 months ago - Pushed at: 6 months ago - Stars: 2,253 - Forks: 484

mailong25/self-supervised-speech-recognition

speech to text with self-supervised learning based on wav2vec 2.0 framework

Language: Python - Size: 13.1 MB - Last synced at: over 1 year ago - Pushed at: over 3 years ago - Stars: 359 - Forks: 113

oliverguhr/wav2vec2-live

A live speech recognition using Facebooks wav2vec 2.0 model.

Language: Python - Size: 2.84 MB - Last synced at: 16 days ago - Pushed at: about 1 year ago - Stars: 348 - Forks: 56

arxyzan/data2vec-pytorch

PyTorch implementation of "data2vec: A General Framework for Self-supervised Learning in Speech, Vision and Language" from Meta AI

Language: Python - Size: 1.16 MB - Last synced at: over 1 year ago - Pushed at: almost 2 years ago - Stars: 146 - Forks: 25

shangeth/SpeakerProfiling

Estimating the Age, Height, and Gender of a speaker with their speech signal.

Language: Python - Size: 195 KB - Last synced at: about 1 year ago - Pushed at: almost 4 years ago - Stars: 55 - Forks: 20

lucasgris/wav2vec4bp

Wav2vec resources and models for Brazilian Portuguese

Language: Jupyter Notebook - Size: 1.65 MB - Last synced at: 21 days ago - Pushed at: almost 3 years ago - Stars: 33 - Forks: 2

loretoparisi/wave2vec-recognize-docker

Wave2vec 2.0 Recognize pipeline

Language: Python - Size: 33.2 KB - Last synced at: over 1 year ago - Pushed at: over 4 years ago - Stars: 33 - Forks: 10

robinhad/voice-recognition-ua

Training scripts for Speech-To-Text models for Ukrainian language

Language: Jupyter Notebook - Size: 403 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 32 - Forks: 2

bhattbhavesh91/wav2vec2-huggingface-demo

Speech to Text with self-supervised learning based on wav2vec 2.0 framework using Hugging Face's Transformer

Language: Jupyter Notebook - Size: 841 KB - Last synced at: 3 days ago - Pushed at: almost 4 years ago - Stars: 30 - Forks: 14

daanzu/wav2vec2_stt_python

Simple Python library, distributed via binary wheels with few direct dependencies, for easily using wav2vec 2.0 models for speech recognition

Language: Python - Size: 88.9 KB - Last synced at: 2 days ago - Pushed at: over 3 years ago - Stars: 24 - Forks: 3

notAI-tech/IndicASR

Speeech Recognition for Indic languages.

Language: Python - Size: 623 KB - Last synced at: 16 days ago - Pushed at: about 4 years ago - Stars: 13 - Forks: 3

jvel07/wav2vec2_patho

Fine-tuning wav2vec2 to for Pathological Speech Processing

Language: Jupyter Notebook - Size: 4.05 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 5 - Forks: 1

slinusc/speaker_identification_evaluation

Evaluating the Effectiveness of Transformer Layers in Wav2Vec 2.0, XLS-R, and Whisper for Speaker Identification Tasks

Language: Jupyter Notebook - Size: 8.56 MB - Last synced at: 1 day ago - Pushed at: 3 months ago - Stars: 3 - Forks: 1

oswaldoludwig/Pruning-pre-trained-models-using-evolutionary-computation

This repository contains scripts to prune Wav2vec2 using a neuroevolution-based method. More details about this method can be found in the paper Compressing Wav2vec2 for Embedded Applications.

Language: Shell - Size: 4.53 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 2 - Forks: 0

manhph2211/Speech-Processing

Building a speaker identification & verification pipeline for Vietnamese voices :sleepy:

Language: Jupyter Notebook - Size: 3.92 MB - Last synced at: about 2 years ago - Pushed at: over 3 years ago - Stars: 2 - Forks: 0

phanxuanphucnd/wav2asr

A library version of wav2vec 2.0 framework for Automatic Speech Recognition task.

Language: Python - Size: 8.5 MB - Last synced at: about 2 years ago - Pushed at: over 3 years ago - Stars: 2 - Forks: 4

Katashynskyi/Voice_assistant_UA_EN

No api-keys | local | llama3.1 For language studying and live translation

Language: Python - Size: 1.09 MB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 1 - Forks: 1

thisisHJLee/Fine-Tuning-of-XLSR-Wav2Vec2-on-Korean

Language: Jupyter Notebook - Size: 1.35 MB - Last synced at: almost 2 years ago - Pushed at: over 2 years ago - Stars: 1 - Forks: 0

mead-ml/audio8

Deep audio modeling

Language: Python - Size: 307 KB - Last synced at: about 1 month ago - Pushed at: about 3 years ago - Stars: 1 - Forks: 0

MarwaAbdelAal/ASR-correction-model

ASR model generates transcription from audio waves, then correct the word spelling

Language: Python - Size: 12.7 MB - Last synced at: about 2 years ago - Pushed at: over 3 years ago - Stars: 1 - Forks: 0

NabinAdhikari674/wav2vec

A repo to make installation and training of a wav2vec model easier

Language: Python - Size: 10.7 KB - Last synced at: almost 2 years ago - Pushed at: over 4 years ago - Stars: 1 - Forks: 0

ogunlao/speech_language_models

A collection of speech language models with a focus on acoustic codes

Language: Python - Size: 133 KB - Last synced at: 5 days ago - Pushed at: 5 days ago - Stars: 0 - Forks: 0

abdur75648/DINet-Inference

Create high-resolution visually dubbed videos with DINet

Language: Python - Size: 45.9 KB - Last synced at: 21 days ago - Pushed at: 10 months ago - Stars: 0 - Forks: 0

hciays/ailab_ss2022

asr for German Language

Language: Python - Size: 39.1 KB - Last synced at: over 1 year ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

Natalia-T/NeurIPS2021 Fork of LeBenchmark/NeurIPS2021

Size: 13.4 MB - Last synced at: 12 months ago - Pushed at: over 3 years ago - Stars: 0 - Forks: 0