Topic: "speaker-embedding"
pyannote/pyannote-audio
Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding
Language: Jupyter Notebook - Size: 252 MB - Last synced at: about 6 hours ago - Pushed at: 8 days ago - Stars: 7,327 - Forks: 868

juanmc2005/diart
A python package to build AI-powered real-time audio applications
Language: Python - Size: 34.8 MB - Last synced at: 1 day ago - Pushed at: 2 months ago - Stars: 1,255 - Forks: 97

yistLin/dvector
Speaker embedding (d-vector) trained with GE2E loss
Language: Python - Size: 85.9 KB - Last synced at: over 1 year ago - Pushed at: almost 2 years ago - Stars: 244 - Forks: 44

Walleclipse/Deep_Speaker-speaker_recognition_system
Keras implementation of ‘’Deep Speaker: an End-to-End Neural Speaker Embedding System‘’ (speaker recognition)
Language: Python - Size: 429 MB - Last synced at: over 1 year ago - Pushed at: almost 5 years ago - Stars: 235 - Forks: 79

Chris10M/Lip2Speech
A pipeline to read lips and generate speech for the read content, i.e Lip to Speech Synthesis.
Language: Python - Size: 12.4 MB - Last synced at: 11 days ago - Pushed at: over 3 years ago - Stars: 83 - Forks: 20

yuyq96/D-TDNN
PyTorch implementation of Densely Connected Time Delay Neural Network
Language: Python - Size: 155 KB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 70 - Forks: 23

juanmc2005/SpeakerEmbeddingLossComparison
Companion repository for the paper "A Comparison of Metric Learning Loss Functions for End-to-End Speaker Verification" published at SLSP 2020
Language: Jupyter Notebook - Size: 16.1 MB - Last synced at: 20 days ago - Pushed at: over 4 years ago - Stars: 59 - Forks: 8

ranchlai/awesome-speaker-embedding
A curated list of speaker-embedding speaker-verification, speaker-identification resources.
Size: 334 KB - Last synced at: about 12 hours ago - Pushed at: over 3 years ago - Stars: 47 - Forks: 5

DongyaoZhu/Real-Time-Accent-Conversion
Real Time Foreign Accent Conversion
Language: Python - Size: 1.03 MB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 46 - Forks: 8

maxhollmann/voxceleb-luigi
Luigi pipeline to download VoxCeleb(2) audio from YouTube and extract speaker segments
Language: Python - Size: 80.1 KB - Last synced at: 21 days ago - Pushed at: about 4 years ago - Stars: 43 - Forks: 4

swshon/voxceleb-ivector
Voxceleb1 i-vector based speaker recognition system
Language: Perl - Size: 2.16 MB - Last synced at: 11 months ago - Pushed at: almost 7 years ago - Stars: 40 - Forks: 11

PiotrTa/Huawei-Challenge-Speaker-Identification
Trained speaker embedding deep learning models and evaluation pipelines in pytorch and tesorflow for speaker recognition.
Language: Jupyter Notebook - Size: 33.3 MB - Last synced at: over 1 year ago - Pushed at: over 5 years ago - Stars: 36 - Forks: 10

Picovoice/eagle
On-device speaker recognition engine powered by deep learning
Language: Python - Size: 36.3 MB - Last synced at: 14 days ago - Pushed at: about 1 month ago - Stars: 33 - Forks: 5

PlayVoice/VI-Speaker
Speaker embedding for VI-SVC and VI-SVS, alse for VITS; Use this to replace the ID to implement voice clone.
Language: Python - Size: 62.5 KB - Last synced at: 18 days ago - Pushed at: over 2 years ago - Stars: 29 - Forks: 3

iPRoBe-lab/1D-Triplet-CNN
PyTorch implementation of the 1D-Triplet-CNN neural network model described in Fusing MFCC and LPC Features using 1D Triplet CNN for Speaker Recognition in Severely Degraded Audio Signals by A. Chowdhury, and A. Ross.
Language: Python - Size: 4.43 MB - Last synced at: about 2 years ago - Pushed at: over 5 years ago - Stars: 23 - Forks: 7

cvqluu/dropclass_speaker
DropClass and DropAdapt - repository for the paper accepted to Speaker Odyssey 2020
Language: Python - Size: 178 KB - Last synced at: about 2 years ago - Pushed at: over 4 years ago - Stars: 21 - Forks: 13

zabir-nabil/awesome-speaker-recognition-verification
A curated list of awesome speaker recognition/verification papers, projects, datasets, and competition.
Size: 21.5 KB - Last synced at: 12 days ago - Pushed at: over 3 years ago - Stars: 15 - Forks: 2

SEERNET/Voice-Prints
Create speaker voiceprints from a few seconds of audio. And, identify individuals in real-time streaming or recorded conversations.
Size: 6.84 KB - Last synced at: about 1 year ago - Pushed at: about 6 years ago - Stars: 13 - Forks: 3

bunyaminergen/awesome-speech-dataset
Awesome Speech Dataset, including download links and a brief explanation for each resource. These datasets provide diverse and high-quality speech data covering various domains such as conversational, academic, political, and more.
Size: 113 KB - Last synced at: 5 days ago - Pushed at: about 1 month ago - Stars: 8 - Forks: 0

arhtur007/Angular-Triplet-Center-Loss
Angular triplet center loss implementation in Pytorch.
Language: Python - Size: 2.93 KB - Last synced at: about 2 years ago - Pushed at: about 4 years ago - Stars: 8 - Forks: 3

xx205/voxsrc2020_speaker_verification
This project partially embodies the state-of-the-art practices in speaker verification technology up until 2020, while attaining the state-of-the-art performance on the VoxCeleb1 test sets.
Language: Python - Size: 308 KB - Last synced at: 18 days ago - Pushed at: almost 2 years ago - Stars: 7 - Forks: 0

Chaanks/stklia
simple version of our torch kaldi toolkit, developed at the LIA by 2 apprentices. (@Chaanks & @vbrignatz)
Language: Python - Size: 46.6 MB - Last synced at: about 2 years ago - Pushed at: over 3 years ago - Stars: 7 - Forks: 1

warisqr007/vq-ppg-vc
Vector Quantized PPGs based Voice conversion
Language: Jupyter Notebook - Size: 1.36 MB - Last synced at: 12 months ago - Pushed at: over 1 year ago - Stars: 5 - Forks: 1

deep-privacy/sidekit
For further release go to: https://git-lium.univ-lemans.fr/speaker/sidekit
Language: Python - Size: 1.19 MB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 1 - Forks: 1

ZhaZhaFon/repo_voxcelebtrainer Fork of clovaai/voxceleb_trainer
说话人识别仓库-说话人表征-ResNet/VGGVox || a ready-to-use repo for Speaker Verification / Speaker Embedding with xvector
Language: Python - Size: 175 KB - Last synced at: about 2 years ago - Pushed at: about 3 years ago - Stars: 1 - Forks: 0

hujinsen/PyTorch_Speaker_Verification Fork of HarryVolek/PyTorch_Speaker_Verification
PyTorch implementation of "Generalized End-to-End Loss for Speaker Verification" by Wan, Li et al.
Language: Python - Size: 43 MB - Last synced at: about 2 years ago - Pushed at: almost 6 years ago - Stars: 1 - Forks: 0

z3lx/speaker-identification
Speaker identification on audio files using the pyannote/embedding model.
Language: Python - Size: 14.6 KB - Last synced at: 11 months ago - Pushed at: 11 months ago - Stars: 0 - Forks: 0

bghorvath/fastClusteringDiarizer
Fast clustering of speaker embeddings for multifile speaker diarization with reappearing speakers
Language: Jupyter Notebook - Size: 402 KB - Last synced at: about 2 years ago - Pushed at: about 2 years ago - Stars: 0 - Forks: 0

ZhaZhaFon/repo_dvector Fork of yistLin/dvector
说话人识别仓库-说话人表征-dvector || a ready-to-use repo for Speaker Verification / Speaker Embedding with dvector
Language: Python - Size: 3.57 MB - Last synced at: about 2 years ago - Pushed at: about 3 years ago - Stars: 0 - Forks: 0
