Ecosyste.ms: Repos

An open API service providing repository metadata for many open source software ecosystems.

GitHub topics: speaker-diarization

NavodPeiris/speechlib

speechlib is a library that can do speaker diarization, transcription and speaker recognition on an audio file to create transcripts with actual speaker names

Language: Python - Size: 31.3 MB - Last synced: 3 days ago - Pushed: 3 days ago - Stars: 99 - Forks: 6

juanmc2005/diart

A python package to build AI-powered real-time audio applications

Language: Python - Size: 30 MB - Last synced: 10 days ago - Pushed: 5 months ago - Stars: 830 - Forks: 71

Wenhao-Yang/SpeakerVerifiaction-pytorch

Speaker Verification using Pytorch

Language: Jupyter Notebook - Size: 17.7 MB - Last synced: 8 days ago - Pushed: almost 3 years ago - Stars: 9 - Forks: 4

luisst/SpeakerLID_GT_code

Speaker Diarization, Recognition and Language Identification. Scripts to generate GT using our WebApp and Praat software

Language: Python - Size: 41.1 MB - Last synced: 8 days ago - Pushed: 8 days ago - Stars: 0 - Forks: 0

MahmoudAshraf97/whisper-diarization

Automatic Speech Recognition with Speaker Diarization based on OpenAI Whisper

Language: Jupyter Notebook - Size: 114 KB - Last synced: 10 days ago - Pushed: 10 days ago - Stars: 2,143 - Forks: 223

linto-ai/linto-diarization

Speaker diarization service

Language: Python - Size: 36.2 MB - Last synced: 9 days ago - Pushed: 9 days ago - Stars: 11 - Forks: 0

wq2012/awesome-diarization

A curated list of awesome Speaker Diarization papers, libraries, datasets, and other resources.

Size: 175 KB - Last synced: 10 days ago - Pushed: 2 months ago - Stars: 1,473 - Forks: 225

pyannote/pyannote-audio

Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding

Language: Jupyter Notebook - Size: 250 MB - Last synced: 17 days ago - Pushed: 18 days ago - Stars: 5,154 - Forks: 702

wq2012/SpectralCluster

Python re-implementation of the (constrained) spectral clustering algorithms used in Google's speaker diarization papers.

Language: Python - Size: 1.81 MB - Last synced: 13 days ago - Pushed: 5 months ago - Stars: 491 - Forks: 73

Picovoice/falcon

On-device speaker diarization powered by deep learning

Language: Python - Size: 24.4 MB - Last synced: about 12 hours ago - Pushed: about 1 month ago - Stars: 18 - Forks: 1

7egment/3D-Speaker-Diarization-Pipeline

A simplified and faster version of the speaker diarization pipeline in the 3D-Speaker toolkit by Alibaba DAMO Academy

Language: Python - Size: 30.7 MB - Last synced: 18 days ago - Pushed: 18 days ago - Stars: 1 - Forks: 0

yinruiqing/pyannote-whisper

Language: Python - Size: 3.34 MB - Last synced: 20 days ago - Pushed: 20 days ago - Stars: 420 - Forks: 67

google/speaker-id

This repository contains audio samples and supplementary materials accompanying publications by the "Speaker, Voice and Language" team at Google.

Language: Python - Size: 171 MB - Last synced: 22 days ago - Pushed: 2 months ago - Stars: 318 - Forks: 37

google/uis-rnn

This is the library for the Unbounded Interleaved-State Recurrent Neural Network (UIS-RNN) algorithm, corresponding to the paper Fully Supervised Speaker Diarization.

Language: Python - Size: 107 MB - Last synced: 22 days ago - Pushed: 9 months ago - Stars: 1,533 - Forks: 318

transcriptionstream/transcriptionstream

turnkey self-hosted offline transcription and diarization service with llm summary

Language: Python - Size: 1.24 MB - Last synced: 26 days ago - Pushed: 26 days ago - Stars: 74 - Forks: 11

alibaba-damo-academy/FunASR

A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models. |语音识别工具包,包含丰富的性能优越的开源预训练模型,支持语音识别、语音端点检测、文本后处理等,具备服务部署能力。

Language: Python - Size: 95.5 MB - Last synced: about 1 month ago - Pushed: about 1 month ago - Stars: 3,331 - Forks: 389

wq2012/SimpleDER

A lightweight library to compute Diarization Error Rate (DER).

Language: Python - Size: 79.1 KB - Last synced: 17 days ago - Pushed: 9 months ago - Stars: 60 - Forks: 9

alibaba-damo-academy/3D-Speaker

A Repository for Single- and Multi-modal Speaker Verification, Speaker Recognition and Speaker Diarization

Language: Python - Size: 2.88 MB - Last synced: about 1 month ago - Pushed: about 1 month ago - Stars: 689 - Forks: 52

linto-ai/whisper-timestamped

Multilingual Automatic Speech Recognition with word-level timestamps and confidence

Language: Python - Size: 4.83 MB - Last synced: about 1 month ago - Pushed: about 1 month ago - Stars: 1,496 - Forks: 129

dptools/WhisperNote

Subtitle generation w/ Speaker Diarization using Whisper and pyannote.audio

Language: Python - Size: 86.9 KB - Last synced: about 1 month ago - Pushed: about 2 months ago - Stars: 1 - Forks: 1

Joost385/transcription-ui

Full-stack Transcription-UI: Features OpenAI Whisper and NVIDIA NeMo, with Docker for easy deployment.

Language: TypeScript - Size: 353 KB - Last synced: about 1 month ago - Pushed: about 1 month ago - Stars: 0 - Forks: 0

nezhar/speech-condenser

A tool for summarizing dialogues from videos or audio

Language: Python - Size: 241 KB - Last synced: 30 days ago - Pushed: 9 months ago - Stars: 68 - Forks: 6

speechbrain/speechbrain

A PyTorch-based Speech Toolkit

Language: Python - Size: 84.5 MB - Last synced: about 2 months ago - Pushed: about 2 months ago - Stars: 7,821 - Forks: 1,272

nuaazs/VAF_2

Aims to create a comprehensive voice toolkit for training, testing, and deploying speaker verification systems.

Language: Python - Size: 32.7 MB - Last synced: about 1 month ago - Pushed: about 1 month ago - Stars: 374 - Forks: 17

mathusanm6/Amaze-Voice-Lab

The goal of this research project is to be able to control the movements of characters in a Maze game using real-time voice commands such as saying out loud Up, Down, Left or Right.

Language: Java - Size: 65.8 MB - Last synced: about 2 months ago - Pushed: 3 months ago - Stars: 0 - Forks: 0

wenet-e2e/wespeaker

Research and Production Oriented Speaker Verification, Recognition and Diarization Toolkit

Language: Python - Size: 3.4 MB - Last synced: about 2 months ago - Pushed: about 2 months ago - Stars: 514 - Forks: 88

espnet/espnet

End-to-End Speech Processing Toolkit

Language: Python - Size: 920 MB - Last synced: about 2 months ago - Pushed: about 2 months ago - Stars: 7,825 - Forks: 2,083

e6quisitory/pyannote-benchmark

pyannote.audio benchmark for NVIDIA GPUs

Language: Python - Size: 2.93 KB - Last synced: about 2 months ago - Pushed: 3 months ago - Stars: 0 - Forks: 0

DrJuChunKoO/TransPal-transcriber

WhisperX Slack bot for transcribing audio files

Language: Python - Size: 40 KB - Last synced: about 2 months ago - Pushed: about 2 months ago - Stars: 1 - Forks: 0

IBM-Cloud/chatbot-watson-android 📦

An Android ChatBot powered by Watson Services - Assistant, Speech-to-Text and Text-to-Speech on IBM Cloud.

Language: Java - Size: 3.42 MB - Last synced: about 1 month ago - Pushed: over 2 years ago - Stars: 192 - Forks: 180

inferless/pyannote-speaker-diarization-3.1

Pyannote/speaker-diarization-3.1 is an open-source toolkit written in Python for speaker diarization, which is the task of determining "who spoke when" in an audio recording. It is based on the PyTorch machine learning framework and provides a set of trainable end-to-end neural building blocks that can be combined and jointly optimized.

Language: Python - Size: 14.6 KB - Last synced: 2 months ago - Pushed: 2 months ago - Stars: 0 - Forks: 1

DongKeon/Awesome-Speaker-Diarization

Some comprehensive papers about speaker diarization

Size: 479 KB - Last synced: 2 months ago - Pushed: 2 months ago - Stars: 109 - Forks: 3

VidyasagarMSC/WatBot

An Android ChatBot powered by IBM Watson Services (Assistant V1, Text-to-Speech, and Speech-to-Text with Speaker Recognition) on IBM Cloud.

Language: Java - Size: 4.82 MB - Last synced: about 1 month ago - Pushed: over 5 years ago - Stars: 72 - Forks: 54

SEERNET/Multi-Speaker-Diarization

Automated Multi Speaker diarization API for meetings, calls, interviews, press-conference etc.

Size: 13.7 KB - Last synced: 2 months ago - Pushed: over 5 years ago - Stars: 11 - Forks: 0

wq2012/VB_diarization

VB Diarization with Eigenvoice and HMM Priors, refactored

Language: Python - Size: 32.4 MB - Last synced: 10 days ago - Pushed: almost 3 years ago - Stars: 14 - Forks: 3

hitachi-speech/EEND

End-to-End Neural Diarization

Language: Python - Size: 50.6 MB - Last synced: 3 months ago - Pushed: over 2 years ago - Stars: 342 - Forks: 56

team-re-verb/RE-VERB

speaker diarization system using an LSTM

Language: Python - Size: 135 MB - Last synced: 3 months ago - Pushed: over 1 year ago - Stars: 45 - Forks: 8

taylorlu/Speaker-Diarization

speaker diarization by uis-rnn and speaker embedding by vgg-speaker-recognition

Language: Python - Size: 52.6 MB - Last synced: 3 months ago - Pushed: almost 3 years ago - Stars: 439 - Forks: 125

cvqluu/simple_diarizer

Simplified diarization pipeline using some pretrained models - audio file to diarized segments in a few lines of code

Language: Python - Size: 1.27 MB - Last synced: 3 months ago - Pushed: about 1 year ago - Stars: 114 - Forks: 23

kamakaya/gcp-speaker-diarization

Language: Jupyter Notebook - Size: 37.2 MB - Last synced: about 1 month ago - Pushed: almost 4 years ago - Stars: 6 - Forks: 1

doerlbh/MiniVox

Code for our ACML and INTERSPEECH papers: "Speaker Diarization as a Fully Online Bandit Learning Problem in MiniVox".

Language: Cuda - Size: 998 MB - Last synced: 3 months ago - Pushed: over 2 years ago - Stars: 24 - Forks: 5

Audio-WestlakeU/FS-EEND

The official Pytorch implementation of "Frame-wise streaming end-to-end speaker diarization with non-autoregressive self-attention-based attractors". [ICASSP 2024]

Language: Python - Size: 423 KB - Last synced: 4 months ago - Pushed: 4 months ago - Stars: 50 - Forks: 3

manojpamk/pytorch_xvectors

Deep speaker embeddings in PyTorch, including x-vectors. Code used in this work: https://arxiv.org/abs/2007.16196

Language: Python - Size: 356 KB - Last synced: 3 months ago - Pushed: over 3 years ago - Stars: 297 - Forks: 64

yufan-aslp/AliMeeting

The project is associated with the recently-launched ICASSP 2022 Multi-channel Multi-party Meeting Transcription Challenge (M2MeT) to provide participants with baseline systems for speech recognition and speaker diarization in conference scenario.

Language: Python - Size: 492 KB - Last synced: 3 months ago - Pushed: almost 2 years ago - Stars: 107 - Forks: 16

FlorianKrey/DNC

Discriminative Neural Clustering for Speaker Diarisation

Language: Python - Size: 3.62 GB - Last synced: 3 months ago - Pushed: about 2 years ago - Stars: 79 - Forks: 14

j-schmied/RealTimeSpeechRecognition

Various approaches for speech recognition and speaker diarization.

Language: Jupyter Notebook - Size: 2.86 MB - Last synced: 6 months ago - Pushed: 6 months ago - Stars: 1 - Forks: 0

cvqluu/Factorized-TDNN

PyTorch implementation of the Factorized TDNN (TDNN-F) from "Semi-Orthogonal Low-Rank Matrix Factorization for Deep Neural Networks" and Kaldi

Language: Python - Size: 278 KB - Last synced: 7 months ago - Pushed: over 4 years ago - Stars: 140 - Forks: 34

theomariotte/SpeakerLoc

Speaker localization algorithms in the meeting context

Language: Python - Size: 332 MB - Last synced: about 1 month ago - Pushed: about 3 years ago - Stars: 2 - Forks: 2

haoming29/ez-transcription

An easy way to make perfect audio transcript with Whisper model and speaker diarization

Language: JavaScript - Size: 1.86 MB - Last synced: 7 months ago - Pushed: 9 months ago - Stars: 1 - Forks: 0

FrenchKrab/IS2023-powerset-diarization

Official repository for the "Powerset multi-class cross entropy loss for neural speaker diarization" paper published in Interspeech 2023.

Language: Jupyter Notebook - Size: 705 MB - Last synced: 7 months ago - Pushed: 8 months ago - Stars: 28 - Forks: 1

cvqluu/TDNN

Time delay neural network (TDNN) implementation in Pytorch using unfold method

Language: Python - Size: 708 KB - Last synced: 7 months ago - Pushed: over 4 years ago - Stars: 183 - Forks: 40

ryojiysd/speaker-diarization-sample

Sample codes of Google Cloud Speech API's speaker diarization feature

Language: JavaScript - Size: 6.84 KB - Last synced: 8 months ago - Pushed: about 5 years ago - Stars: 2 - Forks: 3

cadia-lvl/kaldi-speaker-diarization

This repository creates speaker diarization recipes to be used within the egs folder of kaldi.

Language: Shell - Size: 75.2 KB - Last synced: 2 days ago - Pushed: over 2 years ago - Stars: 12 - Forks: 3

Rajeshshashank/Speaker-Diarization

Speaker Diarization using Python, Flask and Html

Language: HTML - Size: 161 KB - Last synced: 8 months ago - Pushed: about 1 year ago - Stars: 0 - Forks: 2

ElmiraGhorbani/gpt-speaker-diarization

Conversational Speaker Diarization using OpenAI AI Language Models(gpt-4) and OpenAI Whisper.

Language: Python - Size: 31.3 KB - Last synced: 10 months ago - Pushed: 10 months ago - Stars: 1 - Forks: 0

ubclaunchpad/minutes

:telescope: Speaker diarization via transfer learning

Language: Python - Size: 22.2 MB - Last synced: 10 months ago - Pushed: about 5 years ago - Stars: 27 - Forks: 5

terrykwon/lena_evaluation

Code for 'Evaluating the LENA System for Korean' (JSLHR 2021)

Language: Jupyter Notebook - Size: 3.52 MB - Last synced: 10 months ago - Pushed: about 3 years ago - Stars: 3 - Forks: 1

Jaswanth-Devarinti/meeting_summerizer

Speaker Diarization + Speech to text + abstract summerization

Language: HTML - Size: 123 MB - Last synced: 10 months ago - Pushed: 10 months ago - Stars: 2 - Forks: 1

StevenLOL/LIUM

Scripts for LIUM SpkDiarization tools

Language: Shell - Size: 26.8 MB - Last synced: 7 months ago - Pushed: almost 7 years ago - Stars: 31 - Forks: 8

htanderson/ITSbin

An R package for working with LENA ITS files at flexible timescales.

Language: R - Size: 2.36 MB - Last synced: 3 months ago - Pushed: 3 months ago - Stars: 4 - Forks: 1

terry-yip/speech-to-text

Speaker diarization and speech to text

Language: Python - Size: 32.2 MB - Last synced: 10 months ago - Pushed: over 3 years ago - Stars: 14 - Forks: 0

PranavPutsa1006/Speaker-Diarization

Identifying individual speakers in an audio stream based on the unique characteristics found in individual voices using Python

Language: Jupyter Notebook - Size: 20.2 MB - Last synced: 12 months ago - Pushed: 12 months ago - Stars: 6 - Forks: 1

Appen/UHV-OTS-Speech 📦

A data annotation pipeline to generate high-quality, large-scale speech datasets with machine pre-labeling and fully manual auditing.

Language: Forth - Size: 1.41 GB - Last synced: 12 months ago - Pushed: about 1 year ago - Stars: 92 - Forks: 15

vrrao01/speaker_diarization_nmf

A course project for DA 623: Computing with Signals. We investigate the use of Non-negative Matrix Factorization for speaker diarization and source separation.

Language: Python - Size: 1.95 MB - Last synced: about 1 year ago - Pushed: about 1 year ago - Stars: 0 - Forks: 0

yuyq96/D-TDNN

PyTorch implementation of Densely Connected Time Delay Neural Network

Language: Python - Size: 155 KB - Last synced: about 1 year ago - Pushed: about 1 year ago - Stars: 70 - Forks: 23

vishalshar/SpeakerDiarization_RNN_CNN_LSTM

Speaker Diarization is the problem of separating speakers in an audio. There could be any number of speakers and final result should state when speaker starts and ends. In this project, we analyze given audio file with 2 channels and 2 speakers (on separate channels).

Language: Python - Size: 3.21 MB - Last synced: about 1 year ago - Pushed: over 3 years ago - Stars: 57 - Forks: 8

ZhaZhaFon/resource_speech

语音算法相关资源汇总 Resource for Speech Processing || NEWS: official link of VoxCeleb fails recently and an external link is added for download

Size: 61.5 KB - Last synced: about 1 year ago - Pushed: almost 2 years ago - Stars: 30 - Forks: 6

RishiKakade/Speech-Separating-Hearing-Aid

Language: JavaScript - Size: 10.7 MB - Last synced: about 1 year ago - Pushed: about 1 year ago - Stars: 0 - Forks: 0

cvqluu/nn-similarity-diarization

Neural network based similarity scoring for diarization (pytorch implementation of "LSTM based Similarity Measurement with Spectral Clustering for Speaker Diarization")

Language: Python - Size: 347 KB - Last synced: about 1 year ago - Pushed: over 3 years ago - Stars: 35 - Forks: 11

cvqluu/GE2E-Loss

Pytorch implementation of Generalized End-to-End Loss for speaker verification

Language: Python - Size: 3.91 KB - Last synced: about 1 year ago - Pushed: about 5 years ago - Stars: 74 - Forks: 13

juanmc2005/CSDA

Companion repository for the paper "Continual Self-supervised Domain Adaptation for End-to-end Speaker Diarization"

Language: Python - Size: 15.9 MB - Last synced: about 1 year ago - Pushed: over 1 year ago - Stars: 5 - Forks: 2

bghorvath/fastClusteringDiarizer

Fast clustering of speaker embeddings for multifile speaker diarization with reappearing speakers

Language: Jupyter Notebook - Size: 402 KB - Last synced: about 1 year ago - Pushed: over 1 year ago - Stars: 0 - Forks: 0

joseph9991/SpeakerDiarisation-Python

Speaker Diarisation implemented in Python with the help of IBM Cloud's Watson, which provides a free speech-to-text API

Language: Python - Size: 9.75 MB - Last synced: about 1 year ago - Pushed: over 1 year ago - Stars: 2 - Forks: 0

deepaudio/deepaudio-speaker

neural network based speaker embedder

Language: Python - Size: 130 KB - Last synced: over 1 year ago - Pushed: over 1 year ago - Stars: 20 - Forks: 5

bghorvath/filmbaradatok

Diarized transcription and insight extraction of 780+hrs of podcast audio data

Language: Jupyter Notebook - Size: 8.05 MB - Last synced: about 1 year ago - Pushed: over 1 year ago - Stars: 2 - Forks: 0

maxhollmann/lium-diarization-editor

A very simple viewer/editor for LIUM speaker diarizations.

Language: Python - Size: 21.5 KB - Last synced: 3 months ago - Pushed: about 3 years ago - Stars: 4 - Forks: 1

juanmc2005/rttm-viewer

Application for viewing Rich Transcription Time Marked (RTTM) files in an interactive way

Language: Python - Size: 2.23 MB - Last synced: about 1 year ago - Pushed: almost 2 years ago - Stars: 17 - Forks: 3

Tralfazz/RE-VERB

speaker diarization system using an LSTM

Language: Python - Size: 137 MB - Last synced: about 1 year ago - Pushed: over 1 year ago - Stars: 22 - Forks: 0

CiscoDevNet/vo-id

Language: Python - Size: 85.7 MB - Last synced: about 2 months ago - Pushed: over 2 years ago - Stars: 10 - Forks: 3

shashikg/X-Vector-Based-Speaker-Diarization

Course project for EE698R (2020-21 Sem 2). An X-Vector Based Speaker Diarization System with AutoEncoder based clustering method. Also supports spectral and KMeans clustering method.

Language: Jupyter Notebook - Size: 97 MB - Last synced: about 1 year ago - Pushed: almost 3 years ago - Stars: 5 - Forks: 0

acheamponge/Bundy-ML

A Speaker Diarization on Google Cloud machine learning project with Ted Bundy Audio Data

Language: Jupyter Notebook - Size: 1.68 GB - Last synced: about 1 year ago - Pushed: almost 4 years ago - Stars: 5 - Forks: 0

FrenchKrab/msdwild-pyannote

Automatically setup the MSDWild dataset for usage with pyannote-database (and pyannote-audio)

Language: Python - Size: 2.93 KB - Last synced: about 1 year ago - Pushed: over 1 year ago - Stars: 0 - Forks: 0

scionoftech/speaker_diarization

speaker diarization using spectralcluster and Deeplearning

Language: Jupyter Notebook - Size: 188 KB - Last synced: about 1 year ago - Pushed: almost 4 years ago - Stars: 2 - Forks: 0

mmxgn/smooth-convex-kl-nmf

Repository holding various implementation of specific NMF methods for speaker diarization

Language: Python - Size: 17.6 KB - Last synced: about 1 year ago - Pushed: over 6 years ago - Stars: 4 - Forks: 1

IvanEvan/speaker-diarization

speaker diarization in phone recording/电话录音中的说话人分离

Language: Python - Size: 24.4 MB - Last synced: about 1 year ago - Pushed: about 4 years ago - Stars: 4 - Forks: 2

JeffT13/rd-diarization

Diarizing Legal Proceedings with d-vectors.

Language: Jupyter Notebook - Size: 33.4 MB - Last synced: 11 months ago - Pushed: over 2 years ago - Stars: 4 - Forks: 0

RoyalStorm/speaker-diarization Fork of taylorlu/Speaker-Diarization

🎙️ Speaker Diarization: A System For Solving Cocktail Party Problem

Language: Python - Size: 59.8 MB - Last synced: over 1 year ago - Pushed: almost 4 years ago - Stars: 2 - Forks: 1

yinruiqing/annotation_generator

annotation generator for diarization task

Language: Jupyter Notebook - Size: 53.7 KB - Last synced: about 1 year ago - Pushed: over 1 year ago - Stars: 0 - Forks: 0

aditya-joglekar/FS02_Scoring_Toolkit

Scoring Toolkit for the Fearless Steps Challenge Phase-02 Tasks

Language: Python - Size: 6.17 MB - Last synced: about 1 year ago - Pushed: over 3 years ago - Stars: 6 - Forks: 3

oulfik/spyzer

Speech toolkit for audio analysis, diarization and transcription

Language: Python - Size: 1.66 MB - Last synced: about 1 year ago - Pushed: almost 2 years ago - Stars: 0 - Forks: 0

PranavPutsa1006/Deep-Learning

A collection of Deep Learning Programs in Python

Language: Jupyter Notebook - Size: 24.6 MB - Last synced: 12 months ago - Pushed: almost 3 years ago - Stars: 1 - Forks: 0

dengchenlong/Unsupervised-Speaker-Clustering-Algorithms-Comparison

无监督说话人聚类算法比较

Language: Shell - Size: 60.5 KB - Last synced: about 1 year ago - Pushed: about 2 years ago - Stars: 0 - Forks: 0

kleinzcy/speech_signal_processing

Language: Python - Size: 15.5 MB - Last synced: about 1 year ago - Pushed: almost 5 years ago - Stars: 15 - Forks: 2

rvarma9604/enc_EEND

Implementation of the paper "End-to-End Speaker Diarization for an Unknown Number of Speakers with Encoder-Decoder Based Attractors" by Shota Horiguchi et al.

Language: Python - Size: 44.9 KB - Last synced: 11 months ago - Pushed: over 3 years ago - Stars: 2 - Forks: 1

eurecom-asp/pyBK Fork of josepatino/pyBK

Our group's submission to the first DIHARD speaker diarization challenge held as a special session in INTERSPEECH '18.

Size: 26.5 MB - Last synced: 12 months ago - Pushed: about 4 years ago - Stars: 0 - Forks: 0

toddstep/birdsong

Machine learning applied to soundscape audio.

Language: Jupyter Notebook - Size: 1.22 MB - Last synced: 4 months ago - Pushed: about 3 years ago - Stars: 0 - Forks: 0

cr7anand/semi-speaker-diarization

Semi Supervised Speaker Diarization with Gaussian Mixture Models

Language: Matlab - Size: 2.48 MB - Last synced: over 1 year ago - Pushed: almost 7 years ago - Stars: 0 - Forks: 1

MelekV1/Speaker-Diarization

Speaker diarization simulation built with python

Language: Python - Size: 6.03 MB - Last synced: about 1 year ago - Pushed: about 4 years ago - Stars: 0 - Forks: 0

nikitalpopov/master

research for master degree

Language: Jupyter Notebook - Size: 213 MB - Last synced: over 1 year ago - Pushed: almost 2 years ago - Stars: 0 - Forks: 0

cyrta/broadcast-news-videos-dataset

Collection of broadcast news video clips

Size: 3.91 KB - Last synced: about 1 year ago - Pushed: about 7 years ago - Stars: 4 - Forks: 1

Related Keywords
speaker-diarization 100 speaker-recognition 26 speech-recognition 21 speaker-verification 17 pytorch 16 speech-processing 14 machine-learning 14 diarization 13 deep-learning 12 asr 11 whisper 10 speech-to-text 10 speech 10 speaker-identification 8 clustering 7 python 7 kaldi 7 transcription 7 voice-activity-detection 5 end-to-end 5 lstm 5 python3 4 pyannote 4 neural-network 4 speaker-embedding 4 ml 3 speech-separation 3 audio 3 self-supervised-learning 3 source-separation 3 tdnn 3 openai 3 neural-networks 3 ai 3 speaker-diarization-problem 3 dataset 3 dialog 2 conversation-service 2 conversation 2 chatbot 2 android-studio 2 android 2 chainer 2 d-vectors 2 plda 2 java 2 voice-recognition 2 spoken-language-understanding 2 pytorch-lightning 2 speech-enhancement 2 huggingface 2 audio-processing 2 deep-neural-networks 2 transformers 2 x-vector 2 transfer-learning 2 lena 2 interspeech 2 ghostvlad 2 vue 2 reverb 2 redis 2 lium 2 librosa 2 mfcc 2 speech-transcription 2 nmf 2 express 2 cnn 2 docker 2 workspace 2 watson 2 intent 2 ibm-cloud 2 entity 2 uis-rnn 2 automatic-speech-recognition 2 3d-speaker 2 awesome 2 unsupervised-clustering 2 voxceleb 2 supervised-clustering 2 cnceleb 2 speech-activity-detection 2 eres2net 2 meeting-summarization 2 spectral-clustering 2 campplus 2 unsupervised-learning 2 whisperx 2 awesome-list 2 synthetic-speech-detection 1 speech-seperation 1 speech-annotation 1 gender-classification 1 audio-segmentation 1 accent-detection 1 attention-visualization 1 topic-detection 1 d-tdnn 1