An open API service providing repository metadata for many open source software ecosystems.

Topic: "speaker-recognition"

NVIDIA/NeMo

A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)

Language: Python - Size: 431 MB - Last synced at: 2 days ago - Pushed at: 2 days ago - Stars: 13,700 - Forks: 2,801

speechbrain/speechbrain

A PyTorch-based Speech Toolkit

Language: Python - Size: 97.8 MB - Last synced at: 3 days ago - Pushed at: 9 days ago - Stars: 9,723 - Forks: 1,476

pyannote/pyannote-audio

Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding

Language: Jupyter Notebook - Size: 252 MB - Last synced at: 2 days ago - Pushed at: 10 days ago - Stars: 7,327 - Forks: 868

google/uis-rnn

This is the library for the Unbounded Interleaved-State Recurrent Neural Network (UIS-RNN) algorithm, corresponding to the paper Fully Supervised Speaker Diarization.

Language: Python - Size: 107 MB - Last synced at: 6 days ago - Pushed at: 7 months ago - Stars: 1,571 - Forks: 320

mravanelli/SincNet

SincNet is a neural architecture for efficiently processing raw audio samples.

Language: Python - Size: 78.9 MB - Last synced at: 17 days ago - Pushed at: almost 4 years ago - Stars: 1,169 - Forks: 265

clovaai/voxceleb_trainer

In defence of metric learning for speaker recognition

Language: Python - Size: 103 KB - Last synced at: 17 days ago - Pushed at: about 1 year ago - Stars: 1,092 - Forks: 279

yeyupiaoling/VoiceprintRecognition-Pytorch

This project uses a variety of advanced voiceprint recognition models such as EcapaTdnn, ResNetSE, ERes2Net, CAM++, etc. It is not excluded that more models will be supported in the future. At the same time, this project also supports MelSpectrogram, Spectrogram data preprocessing methods

Language: Python - Size: 4.95 MB - Last synced at: 12 days ago - Pushed at: about 2 months ago - Stars: 951 - Forks: 137

athena-team/athena

an open-source implementation of sequence-to-sequence based speech processing engine

Language: C++ - Size: 9.94 MB - Last synced at: 6 days ago - Pushed at: over 2 years ago - Stars: 947 - Forks: 189

astorfi/3D-convolutional-speaker-recognition

:speaker: Deep Learning & 3D Convolutional Neural Networks for Speaker Verification

Language: Python - Size: 48.1 MB - Last synced at: 21 days ago - Pushed at: about 5 years ago - Stars: 782 - Forks: 272

wenet-e2e/wespeaker

Research and Production Oriented Speaker Verification, Recognition and Diarization Toolkit

Language: Python - Size: 3.4 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 514 - Forks: 88

cvqluu/Angular-Penalty-Softmax-Losses-Pytorch

Angular penalty loss functions in Pytorch (ArcFace, SphereFace, Additive Margin, CosFace)

Language: Python - Size: 9.35 MB - Last synced at: 22 days ago - Pushed at: over 1 year ago - Stars: 487 - Forks: 93

taylorlu/Speaker-Diarization

speaker diarization by uis-rnn and speaker embedding by vgg-speaker-recognition

Language: Python - Size: 52.6 MB - Last synced at: 11 months ago - Pushed at: almost 4 years ago - Stars: 453 - Forks: 124

TaoRuijie/ECAPA-TDNN

Unofficial reimplementation of ECAPA-TDNN for speaker recognition (EER=0.86 for Vox1_O when train only in Vox2)

Language: Python - Size: 60.1 MB - Last synced at: over 1 year ago - Pushed at: almost 2 years ago - Stars: 440 - Forks: 97

google/speaker-id

This repository contains audio samples and supplementary materials accompanying publications by the "Speaker, Voice and Language" team at Google.

Language: Python - Size: 175 MB - Last synced at: 6 days ago - Pushed at: 25 days ago - Stars: 411 - Forks: 40

nuaazs/VAF_2

Aims to create a comprehensive voice toolkit for training, testing, and deploying speaker verification systems.

Language: Python - Size: 32.7 MB - Last synced at: 4 months ago - Pushed at: about 1 year ago - Stars: 403 - Forks: 21

speechbrain/speechbrain.github.io

The SpeechBrain project aims to build a novel speech toolkit fully based on PyTorch. With SpeechBrain users can easily create speech processing systems, ranging from speech recognition (both HMM/DNN and end-to-end), speaker recognition, speech enhancement, speech separation, multi-microphone speech processing, and many others.

Language: HTML - Size: 46.8 MB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 364 - Forks: 29

yeyupiaoling/VoiceprintRecognition-Tensorflow

使用Tensorflow实现声纹识别

Language: Python - Size: 1.01 MB - Last synced at: 20 days ago - Pushed at: 10 months ago - Stars: 308 - Forks: 67

manojpamk/pytorch_xvectors

Deep speaker embeddings in PyTorch, including x-vectors. Code used in this work: https://arxiv.org/abs/2007.16196

Language: Python - Size: 356 KB - Last synced at: 6 months ago - Pushed at: over 4 years ago - Stars: 306 - Forks: 65

yeyupiaoling/VoiceprintRecognition-PaddlePaddle

本项目使用了EcapaTdnn、ResNetSE、ERes2Net、CAM++等多种先进的声纹识别模型,同时本项目也支持了MelSpectrogram、Spectrogram、MFCC、Fbank等多种数据预处理方法

Language: Python - Size: 5.09 MB - Last synced at: 13 days ago - Pushed at: about 2 months ago - Stars: 262 - Forks: 48

SamirPaulb/real-time-voice-translator

A desktop application that uses AI to translate voice between languages in real time, while preserving the speaker's tone and emotion.

Language: Tcl - Size: 248 MB - Last synced at: 22 days ago - Pushed at: over 1 year ago - Stars: 257 - Forks: 67

crouchred/speaker-recognition-py3 📦

Base on MFCC and GMM(基于MFCC和高斯混合模型的语音识别)

Language: Python - Size: 14.6 KB - Last synced at: over 1 year ago - Pushed at: about 6 years ago - Stars: 242 - Forks: 82

Walleclipse/Deep_Speaker-speaker_recognition_system

Keras implementation of ‘’Deep Speaker: an End-to-End Neural Speaker Embedding System‘’ (speaker recognition)

Language: Python - Size: 429 MB - Last synced at: over 1 year ago - Pushed at: almost 5 years ago - Stars: 235 - Forks: 79

VITA-Group/AutoSpeech

[InterSpeech 2020] "AutoSpeech: Neural Architecture Search for Speaker Recognition" by Shaojin Ding*, Tianlong Chen*, Xinyu Gong, Weiwei Zha, Zhangyang Wang

Language: Python - Size: 193 KB - Last synced at: 13 days ago - Pushed at: over 2 years ago - Stars: 208 - Forks: 42

Atul-Anand-Jha/Speaker-Identification-Python

Speaker Identification System (upto 100% accuracy); built using Python 2.7 and python_speech_features library

Language: Python - Size: 11.8 MB - Last synced at: 25 days ago - Pushed at: almost 5 years ago - Stars: 207 - Forks: 76

NavodPeiris/speechlib

speechlib is a library that can do speaker diarization, transcription and speaker recognition on an audio file to create transcripts with actual speaker names

Language: Python - Size: 33.9 MB - Last synced at: 16 days ago - Pushed at: 16 days ago - Stars: 203 - Forks: 18

jymsuper/SpeakerRecognition_tutorial

Simple d-vector based Speaker Recognition (verification and identification) using Pytorch

Language: Python - Size: 678 MB - Last synced at: over 1 year ago - Pushed at: almost 5 years ago - Stars: 199 - Forks: 44

IBM-Cloud/chatbot-watson-android 📦

An Android ChatBot powered by Watson Services - Assistant, Speech-to-Text and Text-to-Speech on IBM Cloud.

Language: Java - Size: 3.42 MB - Last synced at: 5 months ago - Pushed at: over 3 years ago - Stars: 195 - Forks: 181

cvqluu/TDNN

Time delay neural network (TDNN) implementation in Pytorch using unfold method

Language: Python - Size: 708 KB - Last synced at: over 1 year ago - Pushed at: over 5 years ago - Stars: 183 - Forks: 40

oscarknagg/voicemap

Identifying people from small audio fragments

Language: Python - Size: 3.18 MB - Last synced at: 19 days ago - Pushed at: about 5 years ago - Stars: 170 - Forks: 73

lihanghang/CASR-DEMO

基于Flask Web的中文自动语音识别演示系统,包含语音识别、语音合成、声纹识别之说话人识别。

Language: CSS - Size: 97.2 MB - Last synced at: 18 days ago - Pushed at: about 1 year ago - Stars: 161 - Forks: 28

cvqluu/Factorized-TDNN

PyTorch implementation of the Factorized TDNN (TDNN-F) from "Semi-Orthogonal Low-Rank Matrix Factorization for Deep Neural Networks" and Kaldi

Language: Python - Size: 278 KB - Last synced at: over 1 year ago - Pushed at: over 5 years ago - Stars: 140 - Forks: 34

yeyupiaoling/VoiceprintRecognition-Keras

基于Kersa实现的声纹识别模型

Language: Python - Size: 1.57 MB - Last synced at: 18 days ago - Pushed at: 7 months ago - Stars: 137 - Forks: 28

jefflai108/pytorch-kaldi-neural-speaker-embeddings

A light weight neural speaker embeddings extraction based on Kaldi and PyTorch.

Language: Perl - Size: 9.35 MB - Last synced at: 5 months ago - Pushed at: about 5 years ago - Stars: 137 - Forks: 34

Anwarvic/Speaker-Recognition

This repo contains my attempt to create a Speaker Recognition and Verification system using SideKit-1.3.1

Language: Python - Size: 22.9 MB - Last synced at: 5 months ago - Pushed at: almost 6 years ago - Stars: 110 - Forks: 32

Speaker-Identification/You-Only-Speak-Once

Deep Learning - one shot learning for speaker recognition using Filter Banks

Language: Jupyter Notebook - Size: 13 MB - Last synced at: about 2 years ago - Pushed at: almost 3 years ago - Stars: 92 - Forks: 30

georgygospodinov/speech_course

Deep Learning for Speech

Language: Jupyter Notebook - Size: 35.6 MB - Last synced at: 18 days ago - Pushed at: 4 months ago - Stars: 90 - Forks: 8

bjfu-ai-institute/speaker-recognition-papers

Share some recent speaker recognition papers and their implementations.

Language: Python - Size: 9.49 MB - Last synced at: about 1 year ago - Pushed at: over 5 years ago - Stars: 90 - Forks: 22

shangeth/wavencoder

WavEncoder is a Python library for encoding audio signals, transforms for audio augmentation, and training audio classification models with PyTorch backend.

Language: Python - Size: 5.21 MB - Last synced at: 24 days ago - Pushed at: almost 4 years ago - Stars: 89 - Forks: 13

GauravWaghmare/Speaker-Identification

A program for automatic speaker identification using deep learning techniques.

Language: Python - Size: 436 KB - Last synced at: over 1 year ago - Pushed at: about 8 years ago - Stars: 83 - Forks: 29

Speech-Interaction-Technology-Aalto-U/itsp

Introduction to Speech Processing

Language: Jupyter Notebook - Size: 254 MB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 82 - Forks: 15

linhdvu14/vggvox-speaker-identification

Speaker identification with VGGVox network

Language: Python - Size: 62.5 MB - Last synced at: 11 months ago - Pushed at: over 6 years ago - Stars: 82 - Forks: 34

cvqluu/GE2E-Loss

Pytorch implementation of Generalized End-to-End Loss for speaker verification

Language: Python - Size: 3.91 KB - Last synced at: about 2 years ago - Pushed at: about 6 years ago - Stars: 74 - Forks: 13

seongmin-kye/meta-SR

Pytorch implementation of Meta-Learning for Short Utterance Speaker Recognition with Imbalance Length Pairs (Interspeech, 2020)

Language: Python - Size: 778 KB - Last synced at: 5 months ago - Pushed at: over 4 years ago - Stars: 73 - Forks: 19

grausof/keras-sincnet

Keras (tensorflow) implementation of SincNet (Mirco Ravanelli, Yoshua Bengio - https://github.com/mravanelli/SincNet)

Language: Python - Size: 260 KB - Last synced at: 12 days ago - Pushed at: almost 4 years ago - Stars: 72 - Forks: 26

VidyasagarMSC/WatBot

An Android ChatBot powered by IBM Watson Services (Assistant V1, Text-to-Speech, and Speech-to-Text with Speaker Recognition) on IBM Cloud.

Language: Java - Size: 4.82 MB - Last synced at: 3 days ago - Pushed at: over 6 years ago - Stars: 72 - Forks: 53

yuyq96/D-TDNN

PyTorch implementation of Densely Connected Time Delay Neural Network

Language: Python - Size: 155 KB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 70 - Forks: 23

cyrta/voxceleb

mirror of VoxCeleb dataset - a large-scale speaker identification dataset

Language: Shell - Size: 10 MB - Last synced at: 5 months ago - Pushed at: almost 6 years ago - Stars: 68 - Forks: 19

TaoRuijie/Loss-Gated-Learning

ICASSP 2022: 'Self-supervised Speaker Recognition with Loss-gated Learning'

Language: Python - Size: 40.9 MB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 65 - Forks: 12

zycv/OpenSpeaker

OpenSpeaker is a completely independent and open source speaker recognition project. It provides the entire process of speaker recognition including multi-platform deployment and model optimization.

Language: C++ - Size: 15.5 MB - Last synced at: 25 days ago - Pushed at: about 3 years ago - Stars: 64 - Forks: 13

Voice-Privacy-Challenge/Voice-Privacy-Challenge-2022

Baseline Recipe for VoicePrivacy Challenge 2022: anonymization systems and evaluation software

Language: Python - Size: 1.87 MB - Last synced at: 6 months ago - Pushed at: 6 months ago - Stars: 63 - Forks: 15

Wadaboa/titanet

Speaker identification/verification models for Machine Learning for Computer Vision class at UNIBO

Language: Jupyter Notebook - Size: 8.25 MB - Last synced at: 22 days ago - Pushed at: over 2 years ago - Stars: 62 - Forks: 13

hyperion-ml/hyperion

Python toolkit for speech processing

Language: Python - Size: 150 MB - Last synced at: about 1 year ago - Pushed at: over 1 year ago - Stars: 61 - Forks: 18

mjpyeon/wavenet-classifier

Keras Implementation of Deepmind's WaveNet for Supervised Learning Tasks

Language: Python - Size: 12.7 KB - Last synced at: about 2 years ago - Pushed at: about 6 years ago - Stars: 57 - Forks: 11

Adirockzz95/Piwho

Speaker recognition library based on MARF for raspberry pi and other SBCs.

Language: Python - Size: 1.49 MB - Last synced at: 15 days ago - Pushed at: over 7 years ago - Stars: 56 - Forks: 20

shangeth/SpeakerProfiling

Estimating the Age, Height, and Gender of a speaker with their speech signal.

Language: Python - Size: 195 KB - Last synced at: about 1 year ago - Pushed at: almost 4 years ago - Stars: 55 - Forks: 20

andi611/Mockingjay-Speech-Representation

Official Implementation of Mockingjay in Pytorch

Language: Python - Size: 1.56 MB - Last synced at: 12 days ago - Pushed at: almost 2 years ago - Stars: 54 - Forks: 12

SuperKogito/Voice-based-speaker-identification

:sound: :boy: :girl: :woman: :man: Speaker identification using voice MFCCs and GMM

Language: Python - Size: 105 KB - Last synced at: 21 days ago - Pushed at: over 4 years ago - Stars: 54 - Forks: 15

thuiar/MIntRec

MIntRec: A New Dataset for Multimodal Intent Recognition (ACM MM 2022)

Language: Python - Size: 1.49 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 53 - Forks: 8

Aurora11111/speaker-recognition-pytorch

Speaker recognition ,Voiceprint recognition

Language: Python - Size: 99.6 KB - Last synced at: 11 months ago - Pushed at: about 5 years ago - Stars: 51 - Forks: 8

pika-online/AESRC2020

a deep accent recognition network

Language: Python - Size: 105 MB - Last synced at: 6 months ago - Pushed at: over 3 years ago - Stars: 47 - Forks: 10

ranchlai/awesome-speaker-embedding

A curated list of speaker-embedding speaker-verification, speaker-identification resources.

Size: 334 KB - Last synced at: 2 days ago - Pushed at: over 3 years ago - Stars: 47 - Forks: 5

wq2012/SpeakerRecognitionFromScratch

Final project for the Speaker Recognition course on Udemy, 机器之心, 深蓝学院 and 语音之家

Language: Python - Size: 9.2 MB - Last synced at: 13 days ago - Pushed at: 12 months ago - Stars: 44 - Forks: 14

maxhollmann/voxceleb-luigi

Luigi pipeline to download VoxCeleb(2) audio from YouTube and extract speaker segments

Language: Python - Size: 80.1 KB - Last synced at: 23 days ago - Pushed at: about 4 years ago - Stars: 43 - Forks: 4

Picovoice/falcon

On-device speaker diarization powered by deep learning

Language: Python - Size: 20.2 MB - Last synced at: 18 days ago - Pushed at: about 1 month ago - Stars: 42 - Forks: 4

swshon/voxceleb-ivector

Voxceleb1 i-vector based speaker recognition system

Language: Perl - Size: 2.16 MB - Last synced at: 11 months ago - Pushed at: almost 7 years ago - Stars: 40 - Forks: 11

bioidiap/bob

Bob is a free signal-processing and machine learning toolbox originally developed by the Biometrics group at Idiap Research Institute, in Switzerland. - Mirrored from https://gitlab.idiap.ch/bob/bob

Language: Python - Size: 139 MB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 39 - Forks: 5

zycv/Speaker-Recognition-Based-on-Deep-Learning-An-Overview

This repo is to list the references papers of 《Speaker Recognition Based on Deep Learning: An Overview》

Size: 9.77 KB - Last synced at: about 1 month ago - Pushed at: almost 4 years ago - Stars: 39 - Forks: 5

ttop32/wav2vec2-live-japanese-translator

real time japanese speech recognition translator using wav2vec2

Language: Jupyter Notebook - Size: 926 KB - Last synced at: 20 days ago - Pushed at: almost 3 years ago - Stars: 37 - Forks: 3

PiotrTa/Huawei-Challenge-Speaker-Identification

Trained speaker embedding deep learning models and evaluation pipelines in pytorch and tesorflow for speaker recognition.

Language: Jupyter Notebook - Size: 33.3 MB - Last synced at: over 1 year ago - Pushed at: over 5 years ago - Stars: 36 - Forks: 10

SpeakerGuard/SpeakerGuard

a Pytorch library for security research on speaker recognition, released in "Towards Understanding and Mitigating Audio Adversarial Examples for Speaker Recognition" accepted by TDSC

Language: Python - Size: 507 KB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 35 - Forks: 10

AdityaDutt/Audio-Classification-Using-Wavelet-Transform

Classifying audio using Wavelet transform and deep learning

Language: Python - Size: 19.8 MB - Last synced at: 6 days ago - Pushed at: over 3 years ago - Stars: 35 - Forks: 6

cvqluu/nn-similarity-diarization

Neural network based similarity scoring for diarization (pytorch implementation of "LSTM based Similarity Measurement with Spectral Clustering for Speaker Diarization")

Language: Python - Size: 347 KB - Last synced at: about 2 years ago - Pushed at: over 4 years ago - Stars: 35 - Forks: 11

manthanthakker/speakerIdentificationNeuralNetworks

⇨ The Speaker Recognition System consists of two phases, Feature Extraction and Recognition. ⇨ In the Extraction phase, the Speaker's voice is recorded and typical number of features are extracted to form a model. ⇨ During the Recognition phase, a speech sample is compared against a previously created voice print stored in the database. ⇨ The highlight of the system is that it can identify the Speaker's voice in a Multi-Speaker Environment too. Multi-layer Perceptron (MLP) Neural Network based on error back propagation training algorithm was used to train and test the system. ⇨ The system response time was 74 µs with an average efficiency of 95%.

Language: MATLAB - Size: 2.94 MB - Last synced at: over 1 year ago - Pushed at: over 5 years ago - Stars: 34 - Forks: 20

Picovoice/eagle

On-device speaker recognition engine powered by deep learning

Language: Python - Size: 36.3 MB - Last synced at: 16 days ago - Pushed at: about 1 month ago - Stars: 33 - Forks: 5

mycrazycracy/tf-kaldi-speaker

Neural speaker recognition/verification system based on Kaldi and Tensorflow

Language: Python - Size: 398 KB - Last synced at: 5 months ago - Pushed at: almost 5 years ago - Stars: 32 - Forks: 16

KrishnaDN/Attentive-Statistics-Pooling-for-Deep-Speaker-Embedding

Implementation of the paper "Attentive Statistics Pooling for Deep Speaker Embedding" in Pytorch

Language: Python - Size: 35.2 KB - Last synced at: about 2 years ago - Pushed at: almost 5 years ago - Stars: 32 - Forks: 8

ZhaZhaFon/resource_speech

语音算法相关资源汇总 Resource for Speech Processing || NEWS: official link of VoxCeleb fails recently and an external link is added for download

Size: 61.5 KB - Last synced at: about 2 years ago - Pushed at: almost 3 years ago - Stars: 30 - Forks: 6

dydtjr1128/Speaker-Recognition-using-NN

Speaker Recognition using Neural Network & Linear Regression

Language: Jupyter Notebook - Size: 46.8 MB - Last synced at: about 2 years ago - Pushed at: over 6 years ago - Stars: 29 - Forks: 7

swshon/multi-speakerID

Language: Python - Size: 446 KB - Last synced at: over 1 year ago - Pushed at: over 6 years ago - Stars: 29 - Forks: 13

imranparuk/speaker-recognition-3d-cnn

Keras + pyTorch implimentation of "Deep Learning & 3D Convolutional Neural Networks for Speaker Verification"

Language: Python - Size: 1.06 MB - Last synced at: about 2 years ago - Pushed at: over 6 years ago - Stars: 27 - Forks: 12

exemplaryai/ai-engine

Easy to use Multi-Provider ASR/Speech To Text and NLP engine

Size: 5.15 MB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 25 - Forks: 0

doerlbh/MiniVox

Code for our ACML and INTERSPEECH papers: "Speaker Diarization as a Fully Online Bandit Learning Problem in MiniVox".

Language: Cuda - Size: 998 MB - Last synced at: 11 months ago - Pushed at: over 3 years ago - Stars: 25 - Forks: 5

Abhay0899193/Speaker-Recognition

Speaker Recognition System using MFCC and GMM.

Language: Python - Size: 7.98 MB - Last synced at: about 2 years ago - Pushed at: about 7 years ago - Stars: 24 - Forks: 12

deep-privacy/SA-toolkit

SA-toolkit: Speaker speech anonymization toolkit in python

Language: Python - Size: 95 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 23 - Forks: 1

wngh1187/RawNeXt

Pytorch implementation of RawNeXt: Speaker verification system for variable-duration utterance with deep layer aggregation and dynamic scaling policies

Language: Python - Size: 179 MB - Last synced at: 11 months ago - Pushed at: almost 3 years ago - Stars: 23 - Forks: 0

iPRoBe-lab/1D-Triplet-CNN

PyTorch implementation of the 1D-Triplet-CNN neural network model described in Fusing MFCC and LPC Features using 1D Triplet CNN for Speaker Recognition in Severely Degraded Audio Signals by A. Chowdhury, and A. Ross.

Language: Python - Size: 4.43 MB - Last synced at: about 2 years ago - Pushed at: over 5 years ago - Stars: 23 - Forks: 7

bsxfan/meta-embeddings

Meta-embeddings are a probabilistic generalization of embeddings in machine learning.

Language: Matlab - Size: 15 MB - Last synced at: 5 months ago - Pushed at: over 6 years ago - Stars: 23 - Forks: 16

SEC4SR/SEC4SR

Source Code for 'SECurity evaluation platform FOR Speaker Recognition' released in 'Defending against Audio Adversarial Examples on Speaker Recognition Systems'

Language: Python - Size: 152 KB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 22 - Forks: 15

theolepage/sslsv

Framework for training and evaluating self-supervised learning methods for speaker verification.

Language: Python - Size: 10.6 MB - Last synced at: 26 days ago - Pushed at: 2 months ago - Stars: 21 - Forks: 4

vi7/ecoute-macos Fork of SevaSk/ecoute

Ecoute is a live transcription tool that provides real-time transcripts for both the user's microphone input (You) and the user's speakers output (Speaker) in a textbox. It also generates a suggested response using OpenAI's GPT-3.5 for the user to say based on the live transcription of the conversation.

Language: Python - Size: 128 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 21 - Forks: 3

cvqluu/dropclass_speaker

DropClass and DropAdapt - repository for the paper accepted to Speaker Odyssey 2020

Language: Python - Size: 178 KB - Last synced at: about 2 years ago - Pushed at: over 4 years ago - Stars: 21 - Forks: 13

deepaudio/deepaudio-speaker

neural network based speaker embedder

Language: Python - Size: 130 KB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 20 - Forks: 5

bscharan/Automatic-speech-sequence-segmentation

The Main Aim of this project is to segment and cluster an audio sample based on speaker when number of speakers are not known before hand. Main challenge in the process of speaker recognition is separting audio based on speaker.It can enhance the readability of an automatic speech transcription by structuring the audio stream into speaker turns and, when used together with speaker recognition systems, by providing the speaker's true identity.Other challenges are due to multiple speakers present at the time instant

Language: MATLAB - Size: 28.3 MB - Last synced at: almost 2 years ago - Pushed at: about 3 years ago - Stars: 19 - Forks: 7

theolepage/ssl-for-slr

Collection of self-supervised models for speaker and language recognition tasks.

Language: Jupyter Notebook - Size: 4.67 MB - Last synced at: 26 days ago - Pushed at: over 3 years ago - Stars: 19 - Forks: 2

TaoRuijie/Speaker-Recognition-Demo

A ResNet Speaker Recognition&Verification Demo

Language: Python - Size: 40.4 MB - Last synced at: about 2 years ago - Pushed at: over 3 years ago - Stars: 19 - Forks: 6

mycrazycracy/Backends-for-SRE19

This repository will illustrate the use of some different backends on NIST SRE 2019.

Language: Python - Size: 82 KB - Last synced at: about 2 years ago - Pushed at: about 5 years ago - Stars: 19 - Forks: 14

alifarazz/September

:microphone: An offline text-independent speaker recognition system

Language: C - Size: 1.84 MB - Last synced at: 19 days ago - Pushed at: over 7 years ago - Stars: 17 - Forks: 4

luan78zaoha/kaldi-timit-sre-ivector

Develop speaker recognition model based on i-vector using TIMIT database

Language: Shell - Size: 695 KB - Last synced at: about 2 months ago - Pushed at: almost 6 years ago - Stars: 16 - Forks: 11

nuaazs/VAF

Backend of anti-fraud system based on speaker identification technology. 基于声纹识别的反诈系统后端

Language: Python - Size: 274 MB - Last synced at: about 2 years ago - Pushed at: about 2 years ago - Stars: 15 - Forks: 1

zabir-nabil/awesome-speaker-recognition-verification

A curated list of awesome speaker recognition/verification papers, projects, datasets, and competition.

Size: 21.5 KB - Last synced at: about 17 hours ago - Pushed at: over 3 years ago - Stars: 15 - Forks: 2

Related Topics
speaker-verification 81 speaker-identification 55 deep-learning 45 speech-recognition 41 pytorch 36 machine-learning 35 asr 35 speaker-diarization 34 audio 32 speech-processing 22 speech-to-text 21 python 20 voice-recognition 20 speech 19 mfcc 17 speaker-embedding 16 tensorflow 15 neural-network 14 audio-processing 14 conversational-ai 12 kaldi 11 voxceleb 11 neural-networks 10 cnn 9 gmm 9 signal-processing 9 keras 9 voice 8 machine-translation 8 deep-neural-networks 8 diarization 8 voice-activity-detection 7 self-supervised-learning 7 ecapa-tdnn 7 python3 6 matlab 6 artificial-intelligence 6 speaker 6 arcface 5 nlp 5 java 5 classification 5 resnet 5 feature-extraction 5 representation-learning 5 convolutional-neural-networks 5 i-vector 5 voiceprint 5 call-center 5 digital-signal-processing 5 speech-analysis 5 android 5 automatic-speech-recognition 4 chatbot 4 unsupervised-learning 4 tts 4 adversarial-attacks 4 lstm 4 pyaudio 4 calibration 4 metric-learning 4 speaker-embeddings 4 stt 4 nvidia 4 mfcc-features 4 speech-synthesis 4 deeplearning 4 natural-language-processing 4 voiceprint-recognition 4 speechrecognition 4 multimodal 3 text-to-speech 3 face-recognition 3 speaker-recognition-systems 3 vggvox 3 sre 3 gmm-ubm 3 transcription 3 timit 3 voice-conversion 3 asv 3 biometrics 3 triplet-loss 3 kaldi-asr 3 d-vectors 3 ai 3 transformer 3 openai 3 tdnn 3 dino 3 dataset 3 librispeech 3 speech-enhancement 3 ctc 3 voice-biometrics 3 speech-emotion-recognition 3 vocal 2 wav2vec2 2 translator 2 speech-separation 2