GitHub topics: speaker-identification

Repositories

alphacep/vosk-api

Offline speech recognition API for Android, iOS, Raspberry Pi and servers with Python, Java, C# and Node

Language: Jupyter Notebook - Size: 13.8 MB - Last synced at: 3 days ago - Pushed at: about 2 months ago - Stars: 9,268 - Forks: 1,245

google/speaker-id

This repository contains audio samples and supplementary materials accompanying publications by the "Speaker, Voice and Language" team at Google.

Language: Python - Size: 175 MB - Last synced at: about 23 hours ago - Pushed at: 20 days ago - Stars: 411 - Forks: 40

jakariaemon/WSI

Whisper Speaker Identification (WSI), a cutting-edge model for multilingual speaker identification.

Language: Python - Size: 239 KB - Last synced at: 5 days ago - Pushed at: about 1 month ago - Stars: 13 - Forks: 1

carlosmbe/SpeechDiarizationStarter

Template Project For iOS Apps using .onnx Speech Models

Language: C - Size: 47.4 MB - Last synced at: 10 days ago - Pushed at: 10 days ago - Stars: 2 - Forks: 0

bunyaminergen/awesome-speech-dataset

Awesome Speech Dataset, including download links and a brief explanation for each resource. These datasets provide diverse and high-quality speech data covering various domains such as conversational, academic, political, and more.

Size: 113 KB - Last synced at: 1 day ago - Pushed at: about 1 month ago - Stars: 8 - Forks: 0

F1uctus/ttc

✍ 🗣 A Text-To-Conversation natural language processing toolkit [WIP].

Language: Python - Size: 2.6 MB - Last synced at: 7 days ago - Pushed at: 12 days ago - Stars: 4 - Forks: 0

mravanelli/SincNet

SincNet is a neural architecture for efficiently processing raw audio samples.

Language: Python - Size: 78.9 MB - Last synced at: 12 days ago - Pushed at: almost 4 years ago - Stars: 1,169 - Forks: 265

SiavashShams/ssamba

[SLT'24] The official implementation of SSAMBA: Self-Supervised Audio Representation Learning with Mamba State Space Model

Language: Python - Size: 1.88 MB - Last synced at: 15 days ago - Pushed at: 6 months ago - Stars: 119 - Forks: 9

Warma10032/easytts

打造最简单的TTS前端集合，最简单的有声小说制作工作流。基于正则规则对小说进行分句，基于RoBERTa对小说中的对话进行说话人识别，从而实现一键式生成多人有声小说。多说话人的语音合成，高质量的有声小说制作。

Language: Python - Size: 25.3 MB - Last synced at: 22 days ago - Pushed at: 22 days ago - Stars: 20 - Forks: 5

CouncilDataProject/speakerbox

Speakerbox: Fine-tune Audio Transformers for speaker identification.

Language: Python - Size: 17.7 MB - Last synced at: 6 days ago - Pushed at: 5 months ago - Stars: 56 - Forks: 6

SuperKogito/Voice-based-speaker-identification

:sound: :boy: :girl: :woman: :man: Speaker identification using voice MFCCs and GMM

Language: Python - Size: 105 KB - Last synced at: 16 days ago - Pushed at: over 4 years ago - Stars: 54 - Forks: 15

Picovoice/eagle

On-device speaker recognition engine powered by deep learning

Language: Python - Size: 36.3 MB - Last synced at: 11 days ago - Pushed at: about 1 month ago - Stars: 33 - Forks: 5

Wadaboa/titanet

Speaker identification/verification models for Machine Learning for Computer Vision class at UNIBO

Language: Jupyter Notebook - Size: 8.25 MB - Last synced at: 17 days ago - Pushed at: over 2 years ago - Stars: 62 - Forks: 13

nezhar/speech-condenser

A tool for summarizing dialogues from videos or audio

Language: Python - Size: 241 KB - Last synced at: 1 day ago - Pushed at: over 1 year ago - Stars: 82 - Forks: 10

musa11971/manhuw

Recognizing and identifying Quran reciters from audio recordings.

Language: Python - Size: 224 MB - Last synced at: 22 days ago - Pushed at: over 1 year ago - Stars: 10 - Forks: 0

sonnygeorge/rick-and-morty-speaker-identification

Demo data science project to take an input utterance and predict which member of the core Rick & Morty family is most likely to say it.

Language: Python - Size: 1010 KB - Last synced at: 7 days ago - Pushed at: over 1 year ago - Stars: 1 - Forks: 0

HarryVolek/PyTorch_Speaker_Verification

PyTorch implementation of "Generalized End-to-End Loss for Speaker Verification" by Wan, Li et al.

Language: Python - Size: 54.7 KB - Last synced at: 16 days ago - Pushed at: about 3 years ago - Stars: 582 - Forks: 165

linto-ai/linto-diarization

Speaker diarization service

Language: Python - Size: 37 MB - Last synced at: 9 days ago - Pushed at: 9 days ago - Stars: 21 - Forks: 1

itmo-mbss-lab/sr_lectures_book

The project is related to the development of Basics of Voice Biometrics lecture book for the ITMO Speaker Recognition Course.

Language: TeX - Size: 1.15 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

oscarknagg/voicemap

Identifying people from small audio fragments

Language: Python - Size: 3.18 MB - Last synced at: 14 days ago - Pushed at: about 5 years ago - Stars: 170 - Forks: 73

Atul-Anand-Jha/Speaker-Identification-Python

Speaker Identification System (upto 100% accuracy); built using Python 2.7 and python_speech_features library

Language: Python - Size: 11.8 MB - Last synced at: 20 days ago - Pushed at: almost 5 years ago - Stars: 207 - Forks: 76

zabir-nabil/awesome-speaker-recognition-verification

A curated list of awesome speaker recognition/verification papers, projects, datasets, and competition.

Size: 21.5 KB - Last synced at: 8 days ago - Pushed at: over 3 years ago - Stars: 15 - Forks: 2

CeLuigi/ArabCeleb

ArabCeleb: Speaker Recognition in Arabic

Language: Python - Size: 2.68 MB - Last synced at: 6 days ago - Pushed at: almost 3 years ago - Stars: 7 - Forks: 0

PlayVoice/VI-Speaker

Speaker embedding for VI-SVC and VI-SVS, alse for VITS; Use this to replace the ID to implement voice clone.

Language: Python - Size: 62.5 KB - Last synced at: 15 days ago - Pushed at: over 2 years ago - Stars: 29 - Forks: 3

kaistmm/Audio-Mamba-AuM

Official Implementation of the work "Audio Mamba: Bidirectional State Space Model for Audio Representation Learning"

Language: Python - Size: 10.7 MB - Last synced at: 4 months ago - Pushed at: 5 months ago - Stars: 108 - Forks: 13

speechbrain/speechbrain.github.io

The SpeechBrain project aims to build a novel speech toolkit fully based on PyTorch. With SpeechBrain users can easily create speech processing systems, ranging from speech recognition (both HMM/DNN and end-to-end), speaker recognition, speech enhancement, speech separation, multi-microphone speech processing, and many others.

Language: HTML - Size: 46.8 MB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 364 - Forks: 29

Panjete/speaker-identification

Gentle introduction to Using NeMo's Speaker-Identification capabilities, Analysis across Models and Datasets

Language: Jupyter Notebook - Size: 21.5 KB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 0 - Forks: 0

fabioravila/recognito-csharp

Text Independent Speaker Recognition in CSharp, based on recognito in Java

Language: C# - Size: 53.7 KB - Last synced at: 5 months ago - Pushed at: about 6 years ago - Stars: 10 - Forks: 4

jefflai108/pytorch-kaldi-neural-speaker-embeddings

A light weight neural speaker embeddings extraction based on Kaldi and PyTorch.

Language: Perl - Size: 9.35 MB - Last synced at: 5 months ago - Pushed at: about 5 years ago - Stars: 137 - Forks: 34

OwenWaldron/speaker-test

A short test to determine the distribution of similarity scores for different SpeechBrain speaker identification models.

Language: Python - Size: 364 KB - Last synced at: 6 months ago - Pushed at: 6 months ago - Stars: 0 - Forks: 0

Anwarvic/Speaker-Recognition

This repo contains my attempt to create a Speaker Recognition and Verification system using SideKit-1.3.1

Language: Python - Size: 22.9 MB - Last synced at: 5 months ago - Pushed at: almost 6 years ago - Stars: 110 - Forks: 32

cyrta/voxceleb

mirror of VoxCeleb dataset - a large-scale speaker identification dataset

Language: Shell - Size: 10 MB - Last synced at: 5 months ago - Pushed at: almost 6 years ago - Stars: 68 - Forks: 19

a-jain24/Diarization

Facilitates purely text-based diarization labeling of transcripts or other written conversational data using LLMs

Language: Jupyter Notebook - Size: 18.6 MB - Last synced at: 17 days ago - Pushed at: 8 months ago - Stars: 1 - Forks: 0

itmo-mbss-lab/sr_labs_book

The project is related to the development of labs for the ITMO Speaker Recognition Course.

Language: Jupyter Notebook - Size: 3.25 MB - Last synced at: 3 months ago - Pushed at: over 2 years ago - Stars: 10 - Forks: 8

CiscoDevNet/vo-id

Language: Python - Size: 85.7 MB - Last synced at: 6 days ago - Pushed at: over 3 years ago - Stars: 11 - Forks: 3

jojojaeger/whisper-streamlit

this master thesis project is based on OpenAI Whisper with the goal to transcibe interviews

Language: Python - Size: 44.2 MB - Last synced at: 9 months ago - Pushed at: 9 months ago - Stars: 41 - Forks: 15

piedeboer96/Digital-Assistant-Audio-Processing

Project 2.2 - Speech Recognition and Speaker Identification

Language: Java - Size: 13 MB - Last synced at: 9 months ago - Pushed at: 9 months ago - Stars: 1 - Forks: 0

MiCyg/MatrixSpy

Matrix display which recognise speaker by his voice

Language: Python - Size: 57.1 MB - Last synced at: 2 months ago - Pushed at: 9 months ago - Stars: 0 - Forks: 1

mycrazycracy/tf-kaldi-speaker

Neural speaker recognition/verification system based on Kaldi and Tensorflow

Language: Python - Size: 398 KB - Last synced at: 5 months ago - Pushed at: almost 5 years ago - Stars: 32 - Forks: 16

z3lx/speaker-identification

Speaker identification on audio files using the pyannote/embedding model.

Language: Python - Size: 14.6 KB - Last synced at: 11 months ago - Pushed at: 11 months ago - Stars: 0 - Forks: 0

zabir-nabil/speaker-verification-gmm

Speaker verification using Gaussian Mixture Model (GMM)

Language: Jupyter Notebook - Size: 429 KB - Last synced at: 27 days ago - Pushed at: almost 3 years ago - Stars: 3 - Forks: 0

aliencaocao/TIL-2023

Champion at Brainhack TIL 2023: Team 10000SGDMRT

Language: Jupyter Notebook - Size: 489 MB - Last synced at: 9 months ago - Pushed at: 11 months ago - Stars: 15 - Forks: 0

KunHanKH/GE2E_Speaker_Verification

Most Complete Pytorch Imeplementation "GENERALIZED END-TO-END LOSS FOR SPEAKER VERIFICATION"

Language: Jupyter Notebook - Size: 193 MB - Last synced at: 12 months ago - Pushed at: about 5 years ago - Stars: 10 - Forks: 2

nmeripo/Text-Independant-Speaker-Identification

Implementation of Google AI's GE2E (PyTorch)

Language: Jupyter Notebook - Size: 41.7 MB - Last synced at: 12 months ago - Pushed at: about 6 years ago - Stars: 2 - Forks: 1

BiometricVox/DAE_SpeakerID

Denoising autoencoders for speaker identification on MCE 2018 challenge

Language: Python - Size: 10.7 KB - Last synced at: 12 months ago - Pushed at: over 6 years ago - Stars: 12 - Forks: 4

kensonhui/Speaker-Diarization-Sentiment-Analysis

This project performs speech recognition and diarization (speaker identification) on recordings of conversations. This is followed by sentiment analysis the transcription of each individual.

Language: Jupyter Notebook - Size: 12.7 KB - Last synced at: 12 months ago - Pushed at: 12 months ago - Stars: 1 - Forks: 0

pranshurastogi29/uis_rnn_for_speaker_diarization

speaker_diarization done on toy dataset and tested on timit dataset

Language: Jupyter Notebook - Size: 11.3 MB - Last synced at: 2 days ago - Pushed at: over 3 years ago - Stars: 8 - Forks: 0

NikhilKalloli/Voice-Recognition

A Streamlit web application for Voice recognition using a pre-trained speech embedding model.

Language: PureBasic - Size: 7.45 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

rorywilliams36/Active-Speaker-Detection

Language: Python - Size: 76.8 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

swshon/voxceleb-ivector

Voxceleb1 i-vector based speaker recognition system

Language: Perl - Size: 2.16 MB - Last synced at: 11 months ago - Pushed at: almost 7 years ago - Stars: 40 - Forks: 11

rahulzach/IIIT-Speaker-recognition

This project was done as part of a research teaser project on Speaker Recognition conducted with IIIT Hydrabad.

Language: Jupyter Notebook - Size: 83 KB - Last synced at: 9 months ago - Pushed at: about 1 year ago - Stars: 1 - Forks: 0

Black-Pepper-Team/voice-extractor-svc

Backend service for extracting and processing voice data

Language: Python - Size: 10.7 KB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

SEERNET/Voice-Prints

Create speaker voiceprints from a few seconds of audio. And, identify individuals in real-time streaming or recorded conversations.

Size: 6.84 KB - Last synced at: about 1 year ago - Pushed at: about 6 years ago - Stars: 13 - Forks: 3

SEERNET/Multi-Speaker-Diarization

Automated Multi Speaker diarization API for meetings, calls, interviews, press-conference etc.

Size: 13.7 KB - Last synced at: about 1 year ago - Pushed at: about 6 years ago - Stars: 11 - Forks: 0

TrilokiDA/Speaker-Identification-from-Voice

Language: Jupyter Notebook - Size: 398 KB - Last synced at: about 2 months ago - Pushed at: over 4 years ago - Stars: 1 - Forks: 1

jymsuper/SpeakerRecognition_tutorial

Simple d-vector based Speaker Recognition (verification and identification) using Pytorch

Language: Python - Size: 678 MB - Last synced at: over 1 year ago - Pushed at: almost 5 years ago - Stars: 199 - Forks: 44

koudounasalkis/Audio-Speech-Tutorial

This repository contains a short introduction on the topic of audio and speech processing -- from basics to applications.

Language: Jupyter Notebook - Size: 44.8 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 12 - Forks: 1

zabir-nabil/tf2-speaker-recognition

speaker recognition in tensorflow 2

Language: Python - Size: 6.52 MB - Last synced at: 15 days ago - Pushed at: over 3 years ago - Stars: 4 - Forks: 1

funcwj/ge2e-speaker-verification

Pytorch implementation of "Generalized End-to-End Loss for Speaker Verification"

Language: Python - Size: 20.5 KB - Last synced at: over 1 year ago - Pushed at: about 6 years ago - Stars: 98 - Forks: 24

ITE-5th/speaker-recognition

Language: Python - Size: 15.6 KB - Last synced at: over 1 year ago - Pushed at: about 7 years ago - Stars: 2 - Forks: 5

PiotrTa/Huawei-Challenge-Speaker-Identification

Trained speaker embedding deep learning models and evaluation pipelines in pytorch and tesorflow for speaker recognition.

Language: Jupyter Notebook - Size: 33.3 MB - Last synced at: over 1 year ago - Pushed at: over 5 years ago - Stars: 36 - Forks: 10

manthanthakker/speakerIdentificationNeuralNetworks

⇨ The Speaker Recognition System consists of two phases, Feature Extraction and Recognition. ⇨ In the Extraction phase, the Speaker's voice is recorded and typical number of features are extracted to form a model. ⇨ During the Recognition phase, a speech sample is compared against a previously created voice print stored in the database. ⇨ The highlight of the system is that it can identify the Speaker's voice in a Multi-Speaker Environment too. Multi-layer Perceptron (MLP) Neural Network based on error back propagation training algorithm was used to train and test the system. ⇨ The system response time was 74 µs with an average efficiency of 95%.

Language: MATLAB - Size: 2.94 MB - Last synced at: over 1 year ago - Pushed at: over 5 years ago - Stars: 34 - Forks: 20

ShristiShrestha/SincConvBasedSpeakerRecognition

An extension of Sinc CNN implementation for speaker identification on Nepali Dataset.

Language: Python - Size: 77 MB - Last synced at: about 1 year ago - Pushed at: over 5 years ago - Stars: 2 - Forks: 2

gonza0305/Speaker-identification-using-GMM

Text-independent speaker identification system based on GMM

Language: Jupyter Notebook - Size: 77.1 KB - Last synced at: over 1 year ago - Pushed at: over 4 years ago - Stars: 2 - Forks: 1

jayaneetha/GenderClassifierLibriSpeech

Gender Classification of the speaker from LibriSpeech Dataset

Language: Python - Size: 15.6 KB - Last synced at: about 1 month ago - Pushed at: about 2 years ago - Stars: 4 - Forks: 2

jpinedaa/Voice-ML

MobileNet trained with VoxCeleb dataset and used for voice verification

Language: Python - Size: 581 MB - Last synced at: over 1 year ago - Pushed at: over 2 years ago - Stars: 15 - Forks: 3

anicolson/SPN-ASI

Sum-Product Networks (SPNs) for Robust Automatic Speaker Identification.

Language: Python - Size: 36.9 MB - Last synced at: over 1 year ago - Pushed at: over 4 years ago - Stars: 10 - Forks: 2

souhaib100/marfspeakeridentapp

Language: Java - Size: 729 KB - Last synced at: over 1 year ago - Pushed at: over 4 years ago - Stars: 2 - Forks: 0

kasraouiii/Assistant-sourds-et-malentendants

Cette application utilise la reconnaissance vocale , la synthèse vocale et la reconnaissance du locuteur

Language: Java - Size: 5.67 MB - Last synced at: over 1 year ago - Pushed at: about 5 years ago - Stars: 1 - Forks: 0

MattiaLimone/dnn-hmm

A Deep LSTM-CNN-HMM Neural Network system for Speaker Identification

Language: Python - Size: 435 MB - Last synced at: over 1 year ago - Pushed at: about 2 years ago - Stars: 5 - Forks: 1

SR-MEiTY/i-SpeakR

Speaker recognition toolkit for Indian languages

Language: Python - Size: 293 KB - Last synced at: 11 months ago - Pushed at: almost 2 years ago - Stars: 0 - Forks: 1

Appen/UHV-OTS-Speech 📦

A data annotation pipeline to generate high-quality, large-scale speech datasets with machine pre-labeling and fully manual auditing.

Language: Forth - Size: 1.41 GB - Last synced at: almost 2 years ago - Pushed at: about 2 years ago - Stars: 92 - Forks: 15

ManuOtel/Speaker-Identification-AI

SDU's project in DNN, Speaker Identification AI using PyTorch.

Language: Python - Size: 410 MB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 0 - Forks: 0

vjoki/fsl-experi

Few-shot learning experiments mostly on speaker recognition.

Language: Python - Size: 3.58 MB - Last synced at: over 1 year ago - Pushed at: over 3 years ago - Stars: 5 - Forks: 1

bashbaha/speakergan

Unofficial implement with paper SpeakerGAN: Speaker identification with conditional generative adversarial network

Language: Python - Size: 155 MB - Last synced at: almost 2 years ago - Pushed at: over 3 years ago - Stars: 8 - Forks: 4

SEC4SR/SEC4SR

Source Code for 'SECurity evaluation platform FOR Speaker Recognition' released in 'Defending against Audio Adversarial Examples on Speaker Recognition Systems'

Language: Python - Size: 152 KB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 22 - Forks: 15

HosseinFayyazi/InterpretableCNN Fork of mravanelli/SincNet

An extended version of SincNet in which some general auditory filter models are added for the Speaker Identification task

Language: Python - Size: 80.6 MB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 1 - Forks: 1

Chaanks/stklia

simple version of our torch kaldi toolkit, developed at the LIA by 2 apprentices. (@Chaanks & @vbrignatz)

Language: Python - Size: 46.6 MB - Last synced at: about 2 years ago - Pushed at: over 3 years ago - Stars: 7 - Forks: 1

KornelZ/vggvox_identification

Training and evaluation of VGGVox neural network for speaker identification

Language: Python - Size: 654 KB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 2 - Forks: 2

KrishnaDN/Attentive-Statistics-Pooling-for-Deep-Speaker-Embedding

Implementation of the paper "Attentive Statistics Pooling for Deep Speaker Embedding" in Pytorch

Language: Python - Size: 35.2 KB - Last synced at: about 2 years ago - Pushed at: almost 5 years ago - Stars: 32 - Forks: 8

mjpyeon/wavenet-classifier

Keras Implementation of Deepmind's WaveNet for Supervised Learning Tasks

Language: Python - Size: 12.7 KB - Last synced at: about 2 years ago - Pushed at: about 6 years ago - Stars: 57 - Forks: 11

yudiandoris/csi

End-to-End Chinese Speaker Identification

Language: Python - Size: 24 MB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 5 - Forks: 2

tobiasfshr/gmm-ubm-speaker-identification-verification

Implementation of a speaker identification and a speaker verification system based on Gaussian Mixture Models (GMM) in combination with and Universal Background Model (UBM) on the YOHO dataset in MATLAB.

Language: Matlab - Size: 320 KB - Last synced at: about 2 years ago - Pushed at: about 7 years ago - Stars: 22 - Forks: 6

nuaazs/VAF

Backend of anti-fraud system based on speaker identification technology. 基于声纹识别的反诈系统后端

Language: Python - Size: 274 MB - Last synced at: about 2 years ago - Pushed at: about 2 years ago - Stars: 15 - Forks: 1

imranparuk/speaker-recognition-3d-cnn

Keras + pyTorch implimentation of "Deep Learning & 3D Convolutional Neural Networks for Speaker Verification"

Language: Python - Size: 1.06 MB - Last synced at: about 2 years ago - Pushed at: about 6 years ago - Stars: 27 - Forks: 12

KrishnaDN/BERTphone

Implementation of the paper "BERTphone: Phonetically-aware Encoder Representations for Utterance-level Speaker and Language Recognition"

Language: Python - Size: 923 KB - Last synced at: about 2 years ago - Pushed at: over 4 years ago - Stars: 16 - Forks: 5

SkyDocs/speaker-identification

Speaker Identification using Neural Net.

Language: Python - Size: 91.1 MB - Last synced at: about 2 years ago - Pushed at: about 2 years ago - Stars: 5 - Forks: 3

cvqluu/GE2E-Loss

Pytorch implementation of Generalized End-to-End Loss for speaker verification

Language: Python - Size: 3.91 KB - Last synced at: about 2 years ago - Pushed at: almost 6 years ago - Stars: 74 - Forks: 13

Speaker-Identification/You-Only-Speak-Once

Deep Learning - one shot learning for speaker recognition using Filter Banks

Language: Jupyter Notebook - Size: 13 MB - Last synced at: about 2 years ago - Pushed at: almost 3 years ago - Stars: 92 - Forks: 30

iseesaw/NCSISC-SpeakerRecognition

NCSISC，防重放攻击（ASV）的声纹认证（ASR）系统

Language: Shell - Size: 328 MB - Last synced at: about 2 years ago - Pushed at: over 4 years ago - Stars: 4 - Forks: 1

FAKEBOB-adversarial-attack/FAKEBOB

Source code for paper "Who is real Bob? Adversarial Attacks on Speaker Recognition Systems" (IEEE S&P 2021)

Language: Python - Size: 12.1 MB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 84 - Forks: 24

Schulze18/Who-is-this

A speaker recognition algorithm with GWO and PSO.

Language: Matlab - Size: 28 MB - Last synced at: about 1 year ago - Pushed at: over 6 years ago - Stars: 2 - Forks: 3

cvqluu/dropclass_speaker

DropClass and DropAdapt - repository for the paper accepted to Speaker Odyssey 2020

Language: Python - Size: 178 KB - Last synced at: about 2 years ago - Pushed at: over 4 years ago - Stars: 21 - Forks: 13

hsluytergaethje/speaker_identification

This repository contains a pipeline to annotate German raw text with speech, thought and writing instances and their respective speakers. For the identification of the speakers, four sieve systems are used one system for each type of representation: direct, indirect, reported and free indirect

Language: Python - Size: 5.39 MB - Last synced at: almost 2 years ago - Pushed at: over 3 years ago - Stars: 1 - Forks: 0

Related Keywords

speaker-identification 114 speaker-recognition 55 speaker-verification 45 deep-learning 21 pytorch 17 machine-learning 16 speech-processing 14 speech-recognition 14 speaker-diarization 12 speaker-embedding 11 audio 8 python 8 speech 7 audio-processing 7 voice-recognition 7 kaldi 6 deep-neural-networks 6 neural-networks 5 speech-analysis 5 neural-network 5 gaussian-mixture-models 5 speech-to-text 5 asr 5 gmm 5 mfcc 5 audio-classification 4 representation-learning 4 nlp 4 diarization 4 signal-processing 4 tensorflow 4 voice-activity-detection 4 librispeech 3 speaker 3 artificial-intelligence 3 voice 3 d-vectors 3 cnn 3 convolutional-neural-networks 3 nvidia 3 librosa 3 timit 3 android 3 keras 3 speechbrain 3 speech-emotion-recognition 3 gmm-ubm 3 adversarial-attacks 3 decision-theory 2 calibration 2 acoustic-features 2 robotics 2 ge2e 2 voxceleb 2 voice-authentication 2 tutorial 2 scikit-learn 2 tensorflow2 2 classification 2 automatic-speech-recognition 2 domain-adaptation 2 timit-dataset 2 adversarial-defense 2 kaldi-asr 2 embedding-models 2 java 2 i-vector 2 voice-biometrics 2 deep-speaker 2 dataset 2 nemo 2 speech-api 2 mamba 2 keyword-spotting 2 emotion-recognition 2 tts 2 deepspeech 2 text-to-speech 2 raspberry-pi 2 stt 2 speech-enhancement 2 offline 2 state-space-model 2 nvidia-nemo 1 speakergan 1 cgan 1 siamese-neural-network 1 resnet34 1 metric-learning 1 few-shot-learning 1 mfcc-features 1 topic-detection 1 synthetic-speech-detection 1 nvidia-gpu 1 nvidia-cuda 1 translation 1 audio-filter-models 1 teaching-materials 1 neural-machine-translation 1 explainable-ai 1