Ecosyste.ms: Repos

An open API service providing repository metadata for many open source software ecosystems.

GitHub topics: speaker-embedding

juanmc2005/diart

A python package to build AI-powered real-time audio applications

Language: Python - Size: 30 MB - Last synced: 11 days ago - Pushed: 5 months ago - Stars: 830 - Forks: 71

pyannote/pyannote-audio

Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding

Language: Jupyter Notebook - Size: 250 MB - Last synced: 18 days ago - Pushed: 19 days ago - Stars: 5,154 - Forks: 702

zabir-nabil/awesome-speaker-recognition-verification

A curated list of awesome speaker recognition/verification papers, projects, datasets, and competition.

Size: 21.5 KB - Last synced: 12 days ago - Pushed: almost 3 years ago - Stars: 11 - Forks: 2

Picovoice/eagle

On-device speaker recognition engine powered by deep learning

Language: Python - Size: 36.9 MB - Last synced: about 1 month ago - Pushed: about 1 month ago - Stars: 15 - Forks: 1

Chris10M/Lip2Speech

A pipeline to read lips and generate speech for the read content, i.e Lip to Speech Synthesis.

Language: Python - Size: 12.4 MB - Last synced: about 1 month ago - Pushed: over 2 years ago - Stars: 67 - Forks: 19

ranchlai/awesome-speaker-embedding

A curated list of speaker-embedding speaker-verification, speaker-identification resources.

Size: 334 KB - Last synced: 10 days ago - Pushed: almost 3 years ago - Stars: 42 - Forks: 5

maxhollmann/voxceleb-luigi

Luigi pipeline to download VoxCeleb(2) audio from YouTube and extract speaker segments

Language: Python - Size: 80.1 KB - Last synced: 17 days ago - Pushed: about 3 years ago - Stars: 41 - Forks: 4

warisqr007/vq-ppg-vc

Vector Quantized PPGs based Voice conversion

Language: Jupyter Notebook - Size: 1.36 MB - Last synced: 23 days ago - Pushed: 9 months ago - Stars: 5 - Forks: 1

SEERNET/Voice-Prints

Create speaker voiceprints from a few seconds of audio. And, identify individuals in real-time streaming or recorded conversations.

Size: 6.84 KB - Last synced: 2 months ago - Pushed: over 5 years ago - Stars: 13 - Forks: 3

MaxMax2016/VI-Speaker

Speaker embedding for VI-SVC and VI-SVS, alse for VITS; Use this to replace the ID to implement voice clone.

Language: Python - Size: 62.5 KB - Last synced: 7 months ago - Pushed: over 1 year ago - Stars: 23 - Forks: 3

yistLin/dvector

Speaker embedding (d-vector) trained with GE2E loss

Language: Python - Size: 85.9 KB - Last synced: 7 months ago - Pushed: 11 months ago - Stars: 244 - Forks: 44

juanmc2005/SpeakerEmbeddingLossComparison

Companion repository for the paper "A Comparison of Metric Learning Loss Functions for End-to-End Speaker Verification" published at SLSP 2020

Language: Jupyter Notebook - Size: 16.1 MB - Last synced: 7 months ago - Pushed: over 3 years ago - Stars: 58 - Forks: 9

PiotrTa/Huawei-Challenge-Speaker-Identification

Trained speaker embedding deep learning models and evaluation pipelines in pytorch and tesorflow for speaker recognition.

Language: Jupyter Notebook - Size: 33.3 MB - Last synced: 8 months ago - Pushed: over 4 years ago - Stars: 36 - Forks: 10

Walleclipse/Deep_Speaker-speaker_recognition_system

Keras implementation of ‘’Deep Speaker: an End-to-End Neural Speaker Embedding System‘’ (speaker recognition)

Language: Python - Size: 429 MB - Last synced: 7 months ago - Pushed: about 4 years ago - Stars: 235 - Forks: 79

DongyaoZhu/Real-Time-Accent-Conversion

Real Time Foreign Accent Conversion

Language: Python - Size: 1.03 MB - Last synced: 11 months ago - Pushed: 11 months ago - Stars: 46 - Forks: 8

xx205/voxsrc2020_speaker_verification

This project partially embodies the state-of-the-art practices in speaker verification technology up until 2020, while attaining the state-of-the-art performance on the VoxCeleb1 test sets.

Language: Python - Size: 280 KB - Last synced: 12 months ago - Pushed: 12 months ago - Stars: 5 - Forks: 0

yuyq96/D-TDNN

PyTorch implementation of Densely Connected Time Delay Neural Network

Language: Python - Size: 155 KB - Last synced: about 1 year ago - Pushed: about 1 year ago - Stars: 70 - Forks: 23

swshon/voxceleb-ivector

Voxceleb1 i-vector based speaker recognition system

Language: Perl - Size: 2.16 MB - Last synced: 3 months ago - Pushed: about 6 years ago - Stars: 41 - Forks: 11

Chaanks/stklia

simple version of our torch kaldi toolkit, developed at the LIA by 2 apprentices. (@Chaanks & @vbrignatz)

Language: Python - Size: 46.6 MB - Last synced: about 1 year ago - Pushed: over 2 years ago - Stars: 7 - Forks: 1

bghorvath/fastClusteringDiarizer

Fast clustering of speaker embeddings for multifile speaker diarization with reappearing speakers

Language: Jupyter Notebook - Size: 402 KB - Last synced: about 1 year ago - Pushed: over 1 year ago - Stars: 0 - Forks: 0

cvqluu/dropclass_speaker

DropClass and DropAdapt - repository for the paper accepted to Speaker Odyssey 2020

Language: Python - Size: 178 KB - Last synced: about 1 year ago - Pushed: over 3 years ago - Stars: 21 - Forks: 13

ZhaZhaFon/repo_voxcelebtrainer Fork of clovaai/voxceleb_trainer

说话人识别仓库-说话人表征-ResNet/VGGVox || a ready-to-use repo for Speaker Verification / Speaker Embedding with xvector

Language: Python - Size: 175 KB - Last synced: about 1 year ago - Pushed: about 2 years ago - Stars: 1 - Forks: 0

deep-privacy/sidekit

For further release go to: https://git-lium.univ-lemans.fr/speaker/sidekit

Language: Python - Size: 1.19 MB - Last synced: about 1 year ago - Pushed: over 1 year ago - Stars: 1 - Forks: 1

arhtur007/Angular-Triplet-Center-Loss

Angular triplet center loss implementation in Pytorch.

Language: Python - Size: 2.93 KB - Last synced: about 1 year ago - Pushed: about 3 years ago - Stars: 8 - Forks: 3

iPRoBe-lab/1D-Triplet-CNN

PyTorch implementation of the 1D-Triplet-CNN neural network model described in Fusing MFCC and LPC Features using 1D Triplet CNN for Speaker Recognition in Severely Degraded Audio Signals by A. Chowdhury, and A. Ross.

Language: Python - Size: 4.43 MB - Last synced: about 1 year ago - Pushed: over 4 years ago - Stars: 23 - Forks: 7

ZhaZhaFon/repo_dvector Fork of yistLin/dvector

说话人识别仓库-说话人表征-dvector || a ready-to-use repo for Speaker Verification / Speaker Embedding with dvector

Language: Python - Size: 3.57 MB - Last synced: about 1 year ago - Pushed: about 2 years ago - Stars: 0 - Forks: 0

hujinsen/PyTorch_Speaker_Verification Fork of HarryVolek/PyTorch_Speaker_Verification

PyTorch implementation of "Generalized End-to-End Loss for Speaker Verification" by Wan, Li et al.

Language: Python - Size: 43 MB - Last synced: about 1 year ago - Pushed: almost 5 years ago - Stars: 1 - Forks: 0

Related Keywords
speaker-embedding 27 speaker-verification 19 speaker-recognition 15 speaker-identification 9 pytorch 6 deep-learning 6 speaker-diarization 4 voxceleb 4 voice-activity-detection 3 kaldi 3 metric-learning 3 ge2e 2 dvector 2 representation-learning 2 speaker-adaptation 2 acoustic-model 2 speech 2 xvector 2 voxceleb1 2 machine-learning 2 speech-processing 2 real-time 2 voice-cloning 1 vq-vae 1 vqvae 1 voice-authentication 1 i-vector 1 time-delay-neural-network 1 temporal-convolutional-network 1 dpn 1 d-tdnn 1 res2net 1 voxsrc 1 voxceleb2 1 tdnn 1 tensorflow 1 speaker-representation 1 convolutional-neural-networks 1 speaker-embeddings 1 loss-functions 1 loss-function 1 face-verification 1 face-recognition 1 automatic-speaker-verification 1 asv 1 speaker-representatoin 1 rvector 1 metalearning 1 meta-learning 1 dropout 1 knn-classification 1 clustering 1 agglomerative-clustering 1 resnet 1 kaldi-asr 1 voice-conversion 1 transformer 1 prosody-transfer 1 prosody 1 phonetic-posteriorgram 1 luigi 1 speech-synthesis 1 liptospeech 1 lipreading 1 lip-reading 1 speaker 1 awesome-list 1 speech-activity-detection 1 speaker-change-detection 1 pretrained-models 1 overlapped-speech-detection 1 transcription 1 streaming-audio 1 vocoder 1 speech-recognition 1 multiband-melgan 1 melgan 1 generalised-end-to-end 1 foreign-accent-conversion 1 domain-transfer 1 accent 1 triplet-loss 1 keras 1 voice-recognition 1 x-vector 1 sincnet 1 end-to-end-machine-learning 1 additive-angular-margin-loss 1 torchscript 1 speaker-encoder 1 voice-clone 1 vits 1