Topic: "speaker-recognition"
NVIDIA/NeMo
A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)
Language: Python - Size: 431 MB - Last synced at: 2 days ago - Pushed at: 2 days ago - Stars: 13,700 - Forks: 2,801

speechbrain/speechbrain
A PyTorch-based Speech Toolkit
Language: Python - Size: 97.8 MB - Last synced at: 3 days ago - Pushed at: 9 days ago - Stars: 9,723 - Forks: 1,476

pyannote/pyannote-audio
Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding
Language: Jupyter Notebook - Size: 252 MB - Last synced at: 2 days ago - Pushed at: 10 days ago - Stars: 7,327 - Forks: 868

google/uis-rnn
This is the library for the Unbounded Interleaved-State Recurrent Neural Network (UIS-RNN) algorithm, corresponding to the paper Fully Supervised Speaker Diarization.
Language: Python - Size: 107 MB - Last synced at: 6 days ago - Pushed at: 7 months ago - Stars: 1,571 - Forks: 320

mravanelli/SincNet
SincNet is a neural architecture for efficiently processing raw audio samples.
Language: Python - Size: 78.9 MB - Last synced at: 17 days ago - Pushed at: almost 4 years ago - Stars: 1,169 - Forks: 265

clovaai/voxceleb_trainer
In defence of metric learning for speaker recognition
Language: Python - Size: 103 KB - Last synced at: 17 days ago - Pushed at: about 1 year ago - Stars: 1,092 - Forks: 279

yeyupiaoling/VoiceprintRecognition-Pytorch
This project uses a variety of advanced voiceprint recognition models such as EcapaTdnn, ResNetSE, ERes2Net, CAM++, etc. It is not excluded that more models will be supported in the future. At the same time, this project also supports MelSpectrogram, Spectrogram data preprocessing methods
Language: Python - Size: 4.95 MB - Last synced at: 12 days ago - Pushed at: about 2 months ago - Stars: 951 - Forks: 137

athena-team/athena
an open-source implementation of sequence-to-sequence based speech processing engine
Language: C++ - Size: 9.94 MB - Last synced at: 6 days ago - Pushed at: over 2 years ago - Stars: 947 - Forks: 189

astorfi/3D-convolutional-speaker-recognition
:speaker: Deep Learning & 3D Convolutional Neural Networks for Speaker Verification
Language: Python - Size: 48.1 MB - Last synced at: 21 days ago - Pushed at: about 5 years ago - Stars: 782 - Forks: 272

wenet-e2e/wespeaker
Research and Production Oriented Speaker Verification, Recognition and Diarization Toolkit
Language: Python - Size: 3.4 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 514 - Forks: 88

cvqluu/Angular-Penalty-Softmax-Losses-Pytorch
Angular penalty loss functions in Pytorch (ArcFace, SphereFace, Additive Margin, CosFace)
Language: Python - Size: 9.35 MB - Last synced at: 22 days ago - Pushed at: over 1 year ago - Stars: 487 - Forks: 93

taylorlu/Speaker-Diarization
speaker diarization by uis-rnn and speaker embedding by vgg-speaker-recognition
Language: Python - Size: 52.6 MB - Last synced at: 11 months ago - Pushed at: almost 4 years ago - Stars: 453 - Forks: 124

TaoRuijie/ECAPA-TDNN
Unofficial reimplementation of ECAPA-TDNN for speaker recognition (EER=0.86 for Vox1_O when train only in Vox2)
Language: Python - Size: 60.1 MB - Last synced at: over 1 year ago - Pushed at: almost 2 years ago - Stars: 440 - Forks: 97

google/speaker-id
This repository contains audio samples and supplementary materials accompanying publications by the "Speaker, Voice and Language" team at Google.
Language: Python - Size: 175 MB - Last synced at: 6 days ago - Pushed at: 25 days ago - Stars: 411 - Forks: 40

nuaazs/VAF_2
Aims to create a comprehensive voice toolkit for training, testing, and deploying speaker verification systems.
Language: Python - Size: 32.7 MB - Last synced at: 4 months ago - Pushed at: about 1 year ago - Stars: 403 - Forks: 21

speechbrain/speechbrain.github.io
The SpeechBrain project aims to build a novel speech toolkit fully based on PyTorch. With SpeechBrain users can easily create speech processing systems, ranging from speech recognition (both HMM/DNN and end-to-end), speaker recognition, speech enhancement, speech separation, multi-microphone speech processing, and many others.
Language: HTML - Size: 46.8 MB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 364 - Forks: 29

yeyupiaoling/VoiceprintRecognition-Tensorflow
使用Tensorflow实现声纹识别
Language: Python - Size: 1.01 MB - Last synced at: 20 days ago - Pushed at: 10 months ago - Stars: 308 - Forks: 67

manojpamk/pytorch_xvectors
Deep speaker embeddings in PyTorch, including x-vectors. Code used in this work: https://arxiv.org/abs/2007.16196
Language: Python - Size: 356 KB - Last synced at: 6 months ago - Pushed at: over 4 years ago - Stars: 306 - Forks: 65

yeyupiaoling/VoiceprintRecognition-PaddlePaddle
本项目使用了EcapaTdnn、ResNetSE、ERes2Net、CAM++等多种先进的声纹识别模型,同时本项目也支持了MelSpectrogram、Spectrogram、MFCC、Fbank等多种数据预处理方法
Language: Python - Size: 5.09 MB - Last synced at: 13 days ago - Pushed at: about 2 months ago - Stars: 262 - Forks: 48

SamirPaulb/real-time-voice-translator
A desktop application that uses AI to translate voice between languages in real time, while preserving the speaker's tone and emotion.
Language: Tcl - Size: 248 MB - Last synced at: 22 days ago - Pushed at: over 1 year ago - Stars: 257 - Forks: 67

crouchred/speaker-recognition-py3 📦
Base on MFCC and GMM(基于MFCC和高斯混合模型的语音识别)
Language: Python - Size: 14.6 KB - Last synced at: over 1 year ago - Pushed at: about 6 years ago - Stars: 242 - Forks: 82

Walleclipse/Deep_Speaker-speaker_recognition_system
Keras implementation of ‘’Deep Speaker: an End-to-End Neural Speaker Embedding System‘’ (speaker recognition)
Language: Python - Size: 429 MB - Last synced at: over 1 year ago - Pushed at: almost 5 years ago - Stars: 235 - Forks: 79

VITA-Group/AutoSpeech
[InterSpeech 2020] "AutoSpeech: Neural Architecture Search for Speaker Recognition" by Shaojin Ding*, Tianlong Chen*, Xinyu Gong, Weiwei Zha, Zhangyang Wang
Language: Python - Size: 193 KB - Last synced at: 13 days ago - Pushed at: over 2 years ago - Stars: 208 - Forks: 42

Atul-Anand-Jha/Speaker-Identification-Python
Speaker Identification System (upto 100% accuracy); built using Python 2.7 and python_speech_features library
Language: Python - Size: 11.8 MB - Last synced at: 25 days ago - Pushed at: almost 5 years ago - Stars: 207 - Forks: 76

NavodPeiris/speechlib
speechlib is a library that can do speaker diarization, transcription and speaker recognition on an audio file to create transcripts with actual speaker names
Language: Python - Size: 33.9 MB - Last synced at: 16 days ago - Pushed at: 16 days ago - Stars: 203 - Forks: 18

jymsuper/SpeakerRecognition_tutorial
Simple d-vector based Speaker Recognition (verification and identification) using Pytorch
Language: Python - Size: 678 MB - Last synced at: over 1 year ago - Pushed at: almost 5 years ago - Stars: 199 - Forks: 44

IBM-Cloud/chatbot-watson-android 📦
An Android ChatBot powered by Watson Services - Assistant, Speech-to-Text and Text-to-Speech on IBM Cloud.
Language: Java - Size: 3.42 MB - Last synced at: 5 months ago - Pushed at: over 3 years ago - Stars: 195 - Forks: 181

cvqluu/TDNN
Time delay neural network (TDNN) implementation in Pytorch using unfold method
Language: Python - Size: 708 KB - Last synced at: over 1 year ago - Pushed at: over 5 years ago - Stars: 183 - Forks: 40

oscarknagg/voicemap
Identifying people from small audio fragments
Language: Python - Size: 3.18 MB - Last synced at: 19 days ago - Pushed at: about 5 years ago - Stars: 170 - Forks: 73

lihanghang/CASR-DEMO
基于Flask Web的中文自动语音识别演示系统,包含语音识别、语音合成、声纹识别之说话人识别。
Language: CSS - Size: 97.2 MB - Last synced at: 18 days ago - Pushed at: about 1 year ago - Stars: 161 - Forks: 28

cvqluu/Factorized-TDNN
PyTorch implementation of the Factorized TDNN (TDNN-F) from "Semi-Orthogonal Low-Rank Matrix Factorization for Deep Neural Networks" and Kaldi
Language: Python - Size: 278 KB - Last synced at: over 1 year ago - Pushed at: over 5 years ago - Stars: 140 - Forks: 34

yeyupiaoling/VoiceprintRecognition-Keras
基于Kersa实现的声纹识别模型
Language: Python - Size: 1.57 MB - Last synced at: 18 days ago - Pushed at: 7 months ago - Stars: 137 - Forks: 28

jefflai108/pytorch-kaldi-neural-speaker-embeddings
A light weight neural speaker embeddings extraction based on Kaldi and PyTorch.
Language: Perl - Size: 9.35 MB - Last synced at: 5 months ago - Pushed at: about 5 years ago - Stars: 137 - Forks: 34

Anwarvic/Speaker-Recognition
This repo contains my attempt to create a Speaker Recognition and Verification system using SideKit-1.3.1
Language: Python - Size: 22.9 MB - Last synced at: 5 months ago - Pushed at: almost 6 years ago - Stars: 110 - Forks: 32

Speaker-Identification/You-Only-Speak-Once
Deep Learning - one shot learning for speaker recognition using Filter Banks
Language: Jupyter Notebook - Size: 13 MB - Last synced at: about 2 years ago - Pushed at: almost 3 years ago - Stars: 92 - Forks: 30

georgygospodinov/speech_course
Deep Learning for Speech
Language: Jupyter Notebook - Size: 35.6 MB - Last synced at: 18 days ago - Pushed at: 4 months ago - Stars: 90 - Forks: 8

bjfu-ai-institute/speaker-recognition-papers
Share some recent speaker recognition papers and their implementations.
Language: Python - Size: 9.49 MB - Last synced at: about 1 year ago - Pushed at: over 5 years ago - Stars: 90 - Forks: 22

shangeth/wavencoder
WavEncoder is a Python library for encoding audio signals, transforms for audio augmentation, and training audio classification models with PyTorch backend.
Language: Python - Size: 5.21 MB - Last synced at: 24 days ago - Pushed at: almost 4 years ago - Stars: 89 - Forks: 13

GauravWaghmare/Speaker-Identification
A program for automatic speaker identification using deep learning techniques.
Language: Python - Size: 436 KB - Last synced at: over 1 year ago - Pushed at: about 8 years ago - Stars: 83 - Forks: 29

Speech-Interaction-Technology-Aalto-U/itsp
Introduction to Speech Processing
Language: Jupyter Notebook - Size: 254 MB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 82 - Forks: 15

linhdvu14/vggvox-speaker-identification
Speaker identification with VGGVox network
Language: Python - Size: 62.5 MB - Last synced at: 11 months ago - Pushed at: over 6 years ago - Stars: 82 - Forks: 34

cvqluu/GE2E-Loss
Pytorch implementation of Generalized End-to-End Loss for speaker verification
Language: Python - Size: 3.91 KB - Last synced at: about 2 years ago - Pushed at: about 6 years ago - Stars: 74 - Forks: 13

seongmin-kye/meta-SR
Pytorch implementation of Meta-Learning for Short Utterance Speaker Recognition with Imbalance Length Pairs (Interspeech, 2020)
Language: Python - Size: 778 KB - Last synced at: 5 months ago - Pushed at: over 4 years ago - Stars: 73 - Forks: 19

grausof/keras-sincnet
Keras (tensorflow) implementation of SincNet (Mirco Ravanelli, Yoshua Bengio - https://github.com/mravanelli/SincNet)
Language: Python - Size: 260 KB - Last synced at: 12 days ago - Pushed at: almost 4 years ago - Stars: 72 - Forks: 26

VidyasagarMSC/WatBot
An Android ChatBot powered by IBM Watson Services (Assistant V1, Text-to-Speech, and Speech-to-Text with Speaker Recognition) on IBM Cloud.
Language: Java - Size: 4.82 MB - Last synced at: 3 days ago - Pushed at: over 6 years ago - Stars: 72 - Forks: 53

yuyq96/D-TDNN
PyTorch implementation of Densely Connected Time Delay Neural Network
Language: Python - Size: 155 KB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 70 - Forks: 23

cyrta/voxceleb
mirror of VoxCeleb dataset - a large-scale speaker identification dataset
Language: Shell - Size: 10 MB - Last synced at: 5 months ago - Pushed at: almost 6 years ago - Stars: 68 - Forks: 19

TaoRuijie/Loss-Gated-Learning
ICASSP 2022: 'Self-supervised Speaker Recognition with Loss-gated Learning'
Language: Python - Size: 40.9 MB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 65 - Forks: 12

zycv/OpenSpeaker
OpenSpeaker is a completely independent and open source speaker recognition project. It provides the entire process of speaker recognition including multi-platform deployment and model optimization.
Language: C++ - Size: 15.5 MB - Last synced at: 25 days ago - Pushed at: about 3 years ago - Stars: 64 - Forks: 13

Voice-Privacy-Challenge/Voice-Privacy-Challenge-2022
Baseline Recipe for VoicePrivacy Challenge 2022: anonymization systems and evaluation software
Language: Python - Size: 1.87 MB - Last synced at: 6 months ago - Pushed at: 6 months ago - Stars: 63 - Forks: 15

Wadaboa/titanet
Speaker identification/verification models for Machine Learning for Computer Vision class at UNIBO
Language: Jupyter Notebook - Size: 8.25 MB - Last synced at: 22 days ago - Pushed at: over 2 years ago - Stars: 62 - Forks: 13

hyperion-ml/hyperion
Python toolkit for speech processing
Language: Python - Size: 150 MB - Last synced at: about 1 year ago - Pushed at: over 1 year ago - Stars: 61 - Forks: 18

mjpyeon/wavenet-classifier
Keras Implementation of Deepmind's WaveNet for Supervised Learning Tasks
Language: Python - Size: 12.7 KB - Last synced at: about 2 years ago - Pushed at: about 6 years ago - Stars: 57 - Forks: 11

Adirockzz95/Piwho
Speaker recognition library based on MARF for raspberry pi and other SBCs.
Language: Python - Size: 1.49 MB - Last synced at: 15 days ago - Pushed at: over 7 years ago - Stars: 56 - Forks: 20

shangeth/SpeakerProfiling
Estimating the Age, Height, and Gender of a speaker with their speech signal.
Language: Python - Size: 195 KB - Last synced at: about 1 year ago - Pushed at: almost 4 years ago - Stars: 55 - Forks: 20

andi611/Mockingjay-Speech-Representation
Official Implementation of Mockingjay in Pytorch
Language: Python - Size: 1.56 MB - Last synced at: 12 days ago - Pushed at: almost 2 years ago - Stars: 54 - Forks: 12

SuperKogito/Voice-based-speaker-identification
:sound: :boy: :girl: :woman: :man: Speaker identification using voice MFCCs and GMM
Language: Python - Size: 105 KB - Last synced at: 21 days ago - Pushed at: over 4 years ago - Stars: 54 - Forks: 15

thuiar/MIntRec
MIntRec: A New Dataset for Multimodal Intent Recognition (ACM MM 2022)
Language: Python - Size: 1.49 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 53 - Forks: 8

Aurora11111/speaker-recognition-pytorch
Speaker recognition ,Voiceprint recognition
Language: Python - Size: 99.6 KB - Last synced at: 11 months ago - Pushed at: about 5 years ago - Stars: 51 - Forks: 8

pika-online/AESRC2020
a deep accent recognition network
Language: Python - Size: 105 MB - Last synced at: 6 months ago - Pushed at: over 3 years ago - Stars: 47 - Forks: 10

ranchlai/awesome-speaker-embedding
A curated list of speaker-embedding speaker-verification, speaker-identification resources.
Size: 334 KB - Last synced at: 2 days ago - Pushed at: over 3 years ago - Stars: 47 - Forks: 5

wq2012/SpeakerRecognitionFromScratch
Final project for the Speaker Recognition course on Udemy, 机器之心, 深蓝学院 and 语音之家
Language: Python - Size: 9.2 MB - Last synced at: 13 days ago - Pushed at: 12 months ago - Stars: 44 - Forks: 14

maxhollmann/voxceleb-luigi
Luigi pipeline to download VoxCeleb(2) audio from YouTube and extract speaker segments
Language: Python - Size: 80.1 KB - Last synced at: 23 days ago - Pushed at: about 4 years ago - Stars: 43 - Forks: 4

Picovoice/falcon
On-device speaker diarization powered by deep learning
Language: Python - Size: 20.2 MB - Last synced at: 18 days ago - Pushed at: about 1 month ago - Stars: 42 - Forks: 4

swshon/voxceleb-ivector
Voxceleb1 i-vector based speaker recognition system
Language: Perl - Size: 2.16 MB - Last synced at: 11 months ago - Pushed at: almost 7 years ago - Stars: 40 - Forks: 11

bioidiap/bob
Bob is a free signal-processing and machine learning toolbox originally developed by the Biometrics group at Idiap Research Institute, in Switzerland. - Mirrored from https://gitlab.idiap.ch/bob/bob
Language: Python - Size: 139 MB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 39 - Forks: 5

zycv/Speaker-Recognition-Based-on-Deep-Learning-An-Overview
This repo is to list the references papers of 《Speaker Recognition Based on Deep Learning: An Overview》
Size: 9.77 KB - Last synced at: about 1 month ago - Pushed at: almost 4 years ago - Stars: 39 - Forks: 5

ttop32/wav2vec2-live-japanese-translator
real time japanese speech recognition translator using wav2vec2
Language: Jupyter Notebook - Size: 926 KB - Last synced at: 20 days ago - Pushed at: almost 3 years ago - Stars: 37 - Forks: 3

PiotrTa/Huawei-Challenge-Speaker-Identification
Trained speaker embedding deep learning models and evaluation pipelines in pytorch and tesorflow for speaker recognition.
Language: Jupyter Notebook - Size: 33.3 MB - Last synced at: over 1 year ago - Pushed at: over 5 years ago - Stars: 36 - Forks: 10

SpeakerGuard/SpeakerGuard
a Pytorch library for security research on speaker recognition, released in "Towards Understanding and Mitigating Audio Adversarial Examples for Speaker Recognition" accepted by TDSC
Language: Python - Size: 507 KB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 35 - Forks: 10

AdityaDutt/Audio-Classification-Using-Wavelet-Transform
Classifying audio using Wavelet transform and deep learning
Language: Python - Size: 19.8 MB - Last synced at: 6 days ago - Pushed at: over 3 years ago - Stars: 35 - Forks: 6

cvqluu/nn-similarity-diarization
Neural network based similarity scoring for diarization (pytorch implementation of "LSTM based Similarity Measurement with Spectral Clustering for Speaker Diarization")
Language: Python - Size: 347 KB - Last synced at: about 2 years ago - Pushed at: over 4 years ago - Stars: 35 - Forks: 11

manthanthakker/speakerIdentificationNeuralNetworks
⇨ The Speaker Recognition System consists of two phases, Feature Extraction and Recognition. ⇨ In the Extraction phase, the Speaker's voice is recorded and typical number of features are extracted to form a model. ⇨ During the Recognition phase, a speech sample is compared against a previously created voice print stored in the database. ⇨ The highlight of the system is that it can identify the Speaker's voice in a Multi-Speaker Environment too. Multi-layer Perceptron (MLP) Neural Network based on error back propagation training algorithm was used to train and test the system. ⇨ The system response time was 74 µs with an average efficiency of 95%.
Language: MATLAB - Size: 2.94 MB - Last synced at: over 1 year ago - Pushed at: over 5 years ago - Stars: 34 - Forks: 20

Picovoice/eagle
On-device speaker recognition engine powered by deep learning
Language: Python - Size: 36.3 MB - Last synced at: 16 days ago - Pushed at: about 1 month ago - Stars: 33 - Forks: 5

mycrazycracy/tf-kaldi-speaker
Neural speaker recognition/verification system based on Kaldi and Tensorflow
Language: Python - Size: 398 KB - Last synced at: 5 months ago - Pushed at: almost 5 years ago - Stars: 32 - Forks: 16

KrishnaDN/Attentive-Statistics-Pooling-for-Deep-Speaker-Embedding
Implementation of the paper "Attentive Statistics Pooling for Deep Speaker Embedding" in Pytorch
Language: Python - Size: 35.2 KB - Last synced at: about 2 years ago - Pushed at: almost 5 years ago - Stars: 32 - Forks: 8

ZhaZhaFon/resource_speech
语音算法相关资源汇总 Resource for Speech Processing || NEWS: official link of VoxCeleb fails recently and an external link is added for download
Size: 61.5 KB - Last synced at: about 2 years ago - Pushed at: almost 3 years ago - Stars: 30 - Forks: 6

dydtjr1128/Speaker-Recognition-using-NN
Speaker Recognition using Neural Network & Linear Regression
Language: Jupyter Notebook - Size: 46.8 MB - Last synced at: about 2 years ago - Pushed at: over 6 years ago - Stars: 29 - Forks: 7

swshon/multi-speakerID
Language: Python - Size: 446 KB - Last synced at: over 1 year ago - Pushed at: over 6 years ago - Stars: 29 - Forks: 13

imranparuk/speaker-recognition-3d-cnn
Keras + pyTorch implimentation of "Deep Learning & 3D Convolutional Neural Networks for Speaker Verification"
Language: Python - Size: 1.06 MB - Last synced at: about 2 years ago - Pushed at: over 6 years ago - Stars: 27 - Forks: 12

exemplaryai/ai-engine
Easy to use Multi-Provider ASR/Speech To Text and NLP engine
Size: 5.15 MB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 25 - Forks: 0

doerlbh/MiniVox
Code for our ACML and INTERSPEECH papers: "Speaker Diarization as a Fully Online Bandit Learning Problem in MiniVox".
Language: Cuda - Size: 998 MB - Last synced at: 11 months ago - Pushed at: over 3 years ago - Stars: 25 - Forks: 5

Abhay0899193/Speaker-Recognition
Speaker Recognition System using MFCC and GMM.
Language: Python - Size: 7.98 MB - Last synced at: about 2 years ago - Pushed at: about 7 years ago - Stars: 24 - Forks: 12

deep-privacy/SA-toolkit
SA-toolkit: Speaker speech anonymization toolkit in python
Language: Python - Size: 95 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 23 - Forks: 1

wngh1187/RawNeXt
Pytorch implementation of RawNeXt: Speaker verification system for variable-duration utterance with deep layer aggregation and dynamic scaling policies
Language: Python - Size: 179 MB - Last synced at: 11 months ago - Pushed at: almost 3 years ago - Stars: 23 - Forks: 0

iPRoBe-lab/1D-Triplet-CNN
PyTorch implementation of the 1D-Triplet-CNN neural network model described in Fusing MFCC and LPC Features using 1D Triplet CNN for Speaker Recognition in Severely Degraded Audio Signals by A. Chowdhury, and A. Ross.
Language: Python - Size: 4.43 MB - Last synced at: about 2 years ago - Pushed at: over 5 years ago - Stars: 23 - Forks: 7

bsxfan/meta-embeddings
Meta-embeddings are a probabilistic generalization of embeddings in machine learning.
Language: Matlab - Size: 15 MB - Last synced at: 5 months ago - Pushed at: over 6 years ago - Stars: 23 - Forks: 16

SEC4SR/SEC4SR
Source Code for 'SECurity evaluation platform FOR Speaker Recognition' released in 'Defending against Audio Adversarial Examples on Speaker Recognition Systems'
Language: Python - Size: 152 KB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 22 - Forks: 15

theolepage/sslsv
Framework for training and evaluating self-supervised learning methods for speaker verification.
Language: Python - Size: 10.6 MB - Last synced at: 26 days ago - Pushed at: 2 months ago - Stars: 21 - Forks: 4

vi7/ecoute-macos Fork of SevaSk/ecoute
Ecoute is a live transcription tool that provides real-time transcripts for both the user's microphone input (You) and the user's speakers output (Speaker) in a textbox. It also generates a suggested response using OpenAI's GPT-3.5 for the user to say based on the live transcription of the conversation.
Language: Python - Size: 128 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 21 - Forks: 3

cvqluu/dropclass_speaker
DropClass and DropAdapt - repository for the paper accepted to Speaker Odyssey 2020
Language: Python - Size: 178 KB - Last synced at: about 2 years ago - Pushed at: over 4 years ago - Stars: 21 - Forks: 13

deepaudio/deepaudio-speaker
neural network based speaker embedder
Language: Python - Size: 130 KB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 20 - Forks: 5

bscharan/Automatic-speech-sequence-segmentation
The Main Aim of this project is to segment and cluster an audio sample based on speaker when number of speakers are not known before hand. Main challenge in the process of speaker recognition is separting audio based on speaker.It can enhance the readability of an automatic speech transcription by structuring the audio stream into speaker turns and, when used together with speaker recognition systems, by providing the speaker's true identity.Other challenges are due to multiple speakers present at the time instant
Language: MATLAB - Size: 28.3 MB - Last synced at: almost 2 years ago - Pushed at: about 3 years ago - Stars: 19 - Forks: 7

theolepage/ssl-for-slr
Collection of self-supervised models for speaker and language recognition tasks.
Language: Jupyter Notebook - Size: 4.67 MB - Last synced at: 26 days ago - Pushed at: over 3 years ago - Stars: 19 - Forks: 2

TaoRuijie/Speaker-Recognition-Demo
A ResNet Speaker Recognition&Verification Demo
Language: Python - Size: 40.4 MB - Last synced at: about 2 years ago - Pushed at: over 3 years ago - Stars: 19 - Forks: 6

mycrazycracy/Backends-for-SRE19
This repository will illustrate the use of some different backends on NIST SRE 2019.
Language: Python - Size: 82 KB - Last synced at: about 2 years ago - Pushed at: about 5 years ago - Stars: 19 - Forks: 14

alifarazz/September
:microphone: An offline text-independent speaker recognition system
Language: C - Size: 1.84 MB - Last synced at: 19 days ago - Pushed at: over 7 years ago - Stars: 17 - Forks: 4

luan78zaoha/kaldi-timit-sre-ivector
Develop speaker recognition model based on i-vector using TIMIT database
Language: Shell - Size: 695 KB - Last synced at: about 2 months ago - Pushed at: almost 6 years ago - Stars: 16 - Forks: 11

nuaazs/VAF
Backend of anti-fraud system based on speaker identification technology. 基于声纹识别的反诈系统后端
Language: Python - Size: 274 MB - Last synced at: about 2 years ago - Pushed at: about 2 years ago - Stars: 15 - Forks: 1

zabir-nabil/awesome-speaker-recognition-verification
A curated list of awesome speaker recognition/verification papers, projects, datasets, and competition.
Size: 21.5 KB - Last synced at: about 17 hours ago - Pushed at: over 3 years ago - Stars: 15 - Forks: 2
