GitHub topics: diarization
christianebacani/Roadmap
This repository serves as a temporary portfolio showcasing SQL projects, Python Scripts related to Data Engineering, highlighting key accomplishments and implementations.
Language: Python - Size: 1020 MB - Last synced at: 1 day ago - Pushed at: 1 day ago - Stars: 0 - Forks: 0

thewh1teagle/pyannote-rs
pyannote audio diarization in rust
Language: Rust - Size: 146 KB - Last synced at: about 17 hours ago - Pushed at: 3 months ago - Stars: 73 - Forks: 11

engasd999/senko
⚡ Accelerate speaker diarization with Senko, processing 1 hour of audio in just 5 seconds on powerful hardware—boost your audio analysis efficiency.
Language: Python - Size: 58 MB - Last synced at: 2 days ago - Pushed at: 2 days ago - Stars: 0 - Forks: 0

SzymiczeQ/zanshin
🎧 Navigate audio content effortlessly with Zanshin, a media player that enhances your listening experience by speaker, supporting both YouTube and local files.
Language: Svelte - Size: 5.01 MB - Last synced at: 2 days ago - Pushed at: 2 days ago - Stars: 0 - Forks: 0

Valtora/Nojoin
Nojoin listens to your calls without having a bot join the meeting. With custom tags, speaker attribution, and built-in AI, Nojoin turns your conversations into organised, actionable, and searchable notes. All for free.
Language: Python - Size: 33.9 MB - Last synced at: 5 days ago - Pushed at: 5 days ago - Stars: 0 - Forks: 0

narcotic-sh/senko
A very fast speaker diarization pipeline
Language: Python - Size: 56.7 MB - Last synced at: 5 days ago - Pushed at: 5 days ago - Stars: 0 - Forks: 0

thewh1teagle/sherpa-rs
Rust bindings to https://github.com/k2-fsa/sherpa-onnx
Language: Rust - Size: 1.49 MB - Last synced at: about 17 hours ago - Pushed at: 4 months ago - Stars: 216 - Forks: 37

empenoso/offline-audio-transcriber
Локальное и бесплатное распознавание речи с помощью OpenAI Whisper. Автоматизируйте расшифровку лекций и совещаний на вашем ПК без облачных сервисов и подписок
Language: Python - Size: 26.4 KB - Last synced at: 8 days ago - Pushed at: 8 days ago - Stars: 0 - Forks: 0

mrhallonline/WhisperXTranscription4Researchers
This repository contains a Jupyter notebook for qualitative researchers to transcribe, diarize speakers, and convert audio or video files into various text formats (csv, txt, json, & vtt).
Language: Jupyter Notebook - Size: 12.3 MB - Last synced at: 2 days ago - Pushed at: 10 days ago - Stars: 7 - Forks: 0

HanBnrd/NeMoASR
Automatic speech recognition with speaker diarisation
Language: Python - Size: 22.5 KB - Last synced at: 11 days ago - Pushed at: 11 days ago - Stars: 0 - Forks: 0

thewh1teagle/loud.cpp
Whisper.cpp with diarization
Language: C++ - Size: 111 KB - Last synced at: about 17 hours ago - Pushed at: 10 months ago - Stars: 15 - Forks: 4

R3gm/SoniTranslate
Synchronized Translation for Videos. Video dubbing
Language: Python - Size: 19.4 MB - Last synced at: 16 days ago - Pushed at: 16 days ago - Stars: 1,210 - Forks: 270

NotYuSheng/MeetMemo
Record or upload meeting audio, generate diarized transcripts, AI-powered summaries, and export results to PDF. Built with FastAPI, Whisper, and PyAnnote.
Language: JavaScript - Size: 34.6 MB - Last synced at: 13 days ago - Pushed at: 14 days ago - Stars: 1 - Forks: 0

momentics/CallAnnotate
CallAnnotate — контейнеризованная система автоматической обработки телефонных разговоров, обеспечивающая диаризацию спикеров, распознавание речи, интеграцию с CardDAV, транскрипцию и формирование структурированных JSON-отчётов. Предоставляет REST и WebSocket API для интеграции.
Language: Python - Size: 1.02 GB - Last synced at: 18 days ago - Pushed at: 18 days ago - Stars: 0 - Forks: 0

AathifZahir/WhisprSplit
A powerful, local speech-to-text transcription system that combines OpenAI's Whisper for accurate transcription with pyannote.audio for speaker diarization (identifying who spoke when). Perfect for meetings, interviews, podcasts, and any audio/video content that needs accurate transcription with speaker identification.
Language: Python - Size: 16.6 KB - Last synced at: 25 days ago - Pushed at: 25 days ago - Stars: 0 - Forks: 0

austinwmille/orca
you feed in a video; it outputs context contained clips resized to 9:16, keeping speaker in center
Language: Python - Size: 495 MB - Last synced at: 26 days ago - Pushed at: 27 days ago - Stars: 6 - Forks: 0

wq2012/SimpleDER
A lightweight library to compute Diarization Error Rate (DER).
Language: Python - Size: 79.1 KB - Last synced at: 8 days ago - Pushed at: about 2 years ago - Stars: 60 - Forks: 9

microsoft/UniSpeech
UniSpeech - Large Scale Self-Supervised Learning for Speech
Language: Python - Size: 72.4 MB - Last synced at: 6 days ago - Pushed at: over 1 year ago - Stars: 468 - Forks: 73

Picovoice/falcon
On-device speaker diarization powered by deep learning
Language: Python - Size: 22.7 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 52 - Forks: 6

jakariaemon/WSI
Whisper Speaker Identification (WSI), a cutting-edge model for multilingual speaker identification.
Language: Python - Size: 239 KB - Last synced at: 14 days ago - Pushed at: 6 months ago - Stars: 22 - Forks: 0

x2agi/x2agi-speechkit
🎧 X2AGI speech services: ASR, diarization, AI reports (gRPC, REST clients)
Language: Python - Size: 24.9 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 1 - Forks: 0

JSchmie/ScrAIbe
Tool for automatic transcription and speaker diarization based on whisper and pyannote.
Language: Python - Size: 5.91 MB - Last synced at: about 1 month ago - Pushed at: 8 months ago - Stars: 52 - Forks: 15

pulijon/Sttcast
Transcription from mp3 files to html with or without embedded player
Language: Jupyter Notebook - Size: 83.2 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 20 - Forks: 5

shivxmr/speech-diarization
Speech Diarization
Language: Jupyter Notebook - Size: 2.06 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 0 - Forks: 0

looking-glass-station/intension
Intention is a configuration driven Youtube\Twitch downloader, sound file diarizer, transcriber and automatic host match and labeler. It also contains auto-topic classifiers and bias detection.
Language: Python - Size: 236 KB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 1 - Forks: 0

LynnLox/web-Scraper-testing
too many unstructured data, alok has too many req, seeing what helps
Language: Jupyter Notebook - Size: 1.66 MB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 0 - Forks: 0

Global-Health-Engineering/ghe_transcribe
A Tool to Transcribe Audio Files with Speaker Diarization
Language: Python - Size: 45.2 MB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 2 - Forks: 0

harmlessman/PAFTS
PAFTS : Library That Preprocessing Audio For TTS.
Language: Python - Size: 266 KB - Last synced at: about 1 month ago - Pushed at: 10 months ago - Stars: 21 - Forks: 5

TheSeraphim/scribe-forge-ai
🎵 Complete offline audio transcription system with speaker diarization using OpenAI Whisper and PyAnnote. Features automatic audio cleaning, precise timestamps, multiple output formats (JSON/TXT/Markdown), and support for 20+ audio formats. No external APIs required - works entirely offline.
Language: Python - Size: 2.32 MB - Last synced at: 16 days ago - Pushed at: 3 months ago - Stars: 1 - Forks: 0

bunyaminergen/WavLMMSDD
This repository combines `WavLM`, a powerful speech representation model from Microsoft, with `MSDD` (Multi-Scale Diarization Decoder), a state-of-the-art approach for speaker diarization from Nvidia.
Language: Jupyter Notebook - Size: 1.8 MB - Last synced at: 23 days ago - Pushed at: 3 months ago - Stars: 7 - Forks: 3

desh2608/dover-lap
Python package for combining diarization system outputs.
Language: Python - Size: 1.01 MB - Last synced at: 10 days ago - Pushed at: almost 2 years ago - Stars: 88 - Forks: 12

smwlms/TranscriberApp
Local app for private transcription & analysis of audio with Whisper, Pyannote & Ollama.
Language: Python - Size: 84.1 MB - Last synced at: 3 months ago - Pushed at: 4 months ago - Stars: 1 - Forks: 0

jeanjerome/EchoInStone
EchoInStone is an audio processing tool that transcribes, diarizes, and aligns speaker segments from audio files, prioritizing accuracy and reliability.
Language: Python - Size: 1.35 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 18 - Forks: 2

eddiegulay/rtrimmer
Python package to trim RTTM diarization files and optionally audio files to a user-specified time range.
Language: Python - Size: 17.6 KB - Last synced at: about 2 months ago - Pushed at: 3 months ago - Stars: 1 - Forks: 0

shrirajh/subtitled-audio-player
A sleek, web-based audio player featuring synchronized subtitle display, speaker diarization support, and keyboard controls in a modern, responsive interface
Language: JavaScript - Size: 111 KB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 5 - Forks: 2

mtwn105/audio-intel
AudioIntel - Audio/Video Intelligence, Transcripts, Summary, and much more
Language: TypeScript - Size: 660 KB - Last synced at: 6 days ago - Pushed at: 10 months ago - Stars: 1 - Forks: 0

transcriptionstream/transcriptionstream
turnkey self-hosted offline transcription and diarization service with llm summary
Language: Python - Size: 1.23 MB - Last synced at: 4 months ago - Pushed at: 12 months ago - Stars: 853 - Forks: 51

Purfview/whisper-standalone-win
Whisper & Faster-Whisper standalone executables for those who don't want to bother with Python.
Size: 214 KB - Last synced at: 4 months ago - Pushed at: 5 months ago - Stars: 2,079 - Forks: 100

SuyashMore/MevonAI-Speech-Emotion-Recognition
Identify the emotion of multiple speakers in an Audio Segment
Language: C - Size: 63.6 MB - Last synced at: 3 months ago - Pushed at: over 2 years ago - Stars: 171 - Forks: 47

revdotcom/reverb
Open source inference code for Rev's model
Language: Python - Size: 507 KB - Last synced at: 4 months ago - Pushed at: 5 months ago - Stars: 401 - Forks: 26

cvqluu/simple_diarizer
Simplified diarization pipeline using some pretrained models - audio file to diarized segments in a few lines of code
Language: Python - Size: 1.27 MB - Last synced at: 3 months ago - Pushed at: over 1 year ago - Stars: 149 - Forks: 31

TylerKleinbauer/Personal-Projects
Welcome to my coding playground! This repo showcases my journey as a Machine Learning Engineer, blending AI, data science, and software engineering.
Language: Jupyter Notebook - Size: 68.8 MB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 0 - Forks: 0

cadia-lvl/kaldi-speaker-diarization
This repository creates speaker diarization recipes to be used within the egs folder of kaldi.
Language: Shell - Size: 82 KB - Last synced at: 3 months ago - Pushed at: about 1 year ago - Stars: 17 - Forks: 3

alvaro-francisco-gil/team-performance-study
The code processes data, applies natural language processing (NLP) and audio emotion analysis, and correlates team success metrics with personality types.
Language: Jupyter Notebook - Size: 7.45 MB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 0 - Forks: 0

medvoice-research/MedVoice-Core
MedVoice Application Core System
Language: Python - Size: 2.87 MB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 4 - Forks: 1

desh2608/spyder
Simple Python package for fast DER computation
Language: C++ - Size: 98.6 KB - Last synced at: 1 day ago - Pushed at: about 2 years ago - Stars: 33 - Forks: 7

bunyaminergen/Callytics
Callytics is an advanced call analytics solution that leverages speech recognition and large language models (LLMs) technologies to analyze phone conversations from customer service and call centers.
Language: Python - Size: 23.9 MB - Last synced at: 5 months ago - Pushed at: 6 months ago - Stars: 65 - Forks: 10

FGonzalesc/Transcripcion_AI
Transcripción de audios con Azure Speech y extracción de insights con Open AI
Language: Python - Size: 2.56 MB - Last synced at: 5 months ago - Pushed at: 6 months ago - Stars: 1 - Forks: 0

REDFLAG-bugs/trannote
trannote is a baby project for getting transcription and diarization of speaker.
Language: HTML - Size: 8.79 KB - Last synced at: 6 months ago - Pushed at: 6 months ago - Stars: 0 - Forks: 0

bigyaa/transcription-system
This versatile tool is designed for anyone in need of a robust solution for transcribing and diarizing large volumes of audio files. Whether you are dealing with terabytes or even larger quantities, our tool ensures efficient and accurate processing. Ideal for researchers, content creators, and businesses.
Language: Python - Size: 23.3 MB - Last synced at: 5 months ago - Pushed at: 10 months ago - Stars: 1 - Forks: 0

ElmiraGhorbani/gpt-speaker-diarization
Conversational Speaker Diarization using OpenAI AI Language Models(gpt-4) and OpenAI Whisper.
Language: Jupyter Notebook - Size: 39.1 KB - Last synced at: 5 months ago - Pushed at: about 2 years ago - Stars: 12 - Forks: 0

adamelkholyy/whisper-yt
Toolkit for using Whisper to transcribe YouTube videos. Includes Whisper transcription of YouTube videos, conversion of YouTube video into HuggingFace dataset (using audio and subtitles) and evaluation of Whisper transcription against YouTube subtitles
Language: Python - Size: 79.1 KB - Last synced at: 5 months ago - Pushed at: 11 months ago - Stars: 3 - Forks: 0

DKWoods/Diarization
This code takes automated transcription from audio files and attempts to add speaker identifiers. This process is known as diarization.
Language: Python - Size: 4.87 MB - Last synced at: 7 months ago - Pushed at: 7 months ago - Stars: 1 - Forks: 0

itsitgroup/call-analysis-demo
This repository is designed to be a user-facing StreamLit based frontend for LLM-powered AI Call Analysis Demo app. it's hosted on streamlit.io. Contact us if you need an API Key to test it out. ammar@itsitgroup.com
Language: Python - Size: 22.5 KB - Last synced at: 8 months ago - Pushed at: 8 months ago - Stars: 1 - Forks: 0

sohansai/speaker-diarization
A user-friendly interface for identifying and separating speakers in audio files using pyannote.audio and Gradio.
Language: Python - Size: 1000 Bytes - Last synced at: 9 months ago - Pushed at: 9 months ago - Stars: 0 - Forks: 0

TemporalLabsLLC-SOL/TemporalLabsLLC-YouTubeTranscriber
TemporalLabsLLC YouTube Transcriber is a useful tool designed to convert lists of YouTube videos into text data that can be further distilled for a generative AI pipeline.
Size: 225 KB - Last synced at: 9 months ago - Pushed at: 9 months ago - Stars: 0 - Forks: 0

yotwingian/Vocatio-AI
This streamlit demo app can create a transcription from an audio or video file. It performs speaker diarization of the conversation. Additionally, it can provide a summary or analysis of the conversation.
Language: Python - Size: 19.5 KB - Last synced at: 9 months ago - Pushed at: 9 months ago - Stars: 0 - Forks: 0

jottemka/ds-capstone-speakequal
Speequal, an app for real-time monitoring of conversation-share. Developed in the neuefische Data Science, Machine Learning & AI Bootcamp 2024 in Hamburg.
Language: Jupyter Notebook - Size: 51.6 MB - Last synced at: 10 months ago - Pushed at: 10 months ago - Stars: 0 - Forks: 0

mmaudet/audio-splitter
A Python tool to separate audio files by speaker using diarization data.
Language: Python - Size: 2.93 KB - Last synced at: 10 months ago - Pushed at: 10 months ago - Stars: 0 - Forks: 0

swapnil233/qualsearch-nextjs
Comprehensive qualitative data analysis software for UX research. User interview tagging, AI-supported analysis, team management, etc.
Language: TypeScript - Size: 58.4 MB - Last synced at: 10 months ago - Pushed at: 10 months ago - Stars: 2 - Forks: 0

chimechallenge/chime-utils
Scripts for data generation, scoring and data manifest preparation for CHiME-8 DASR task.
Language: Python - Size: 2.63 MB - Last synced at: 7 months ago - Pushed at: 7 months ago - Stars: 21 - Forks: 3

connortbot/podcast-diarizer
pipeline for speaker diarization using various clustering methods
Language: Jupyter Notebook - Size: 59.8 MB - Last synced at: 6 months ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

NickNaskida/cog-whisper-diarization Fork of thomasmol/cog-whisper-diarization
Cog implementation of transcribing + diarization pipeline with Whisper & Pyannote
Language: Python - Size: 54.7 KB - Last synced at: 11 months ago - Pushed at: 12 months ago - Stars: 0 - Forks: 0

a-jain24/Diarization
Facilitates purely text-based diarization labeling of transcripts or other written conversational data using LLMs
Language: Jupyter Notebook - Size: 18.6 MB - Last synced at: 5 months ago - Pushed at: about 1 year ago - Stars: 1 - Forks: 0

NickNaskida/insanely-fast-whisper Fork of chenxwh/insanely-fast-whisper
Incredibly fast Whisper-large-v3 with speaker diarization
Language: Jupyter Notebook - Size: 396 KB - Last synced at: 8 months ago - Pushed at: 12 months ago - Stars: 1 - Forks: 1

radadiavasu/AudioAnalysis
Whole Audio Analysis Research with Python
Language: Python - Size: 86.1 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

kmad/speakerspotter
Audio speaker diarization and detection to automatically segment spoken audio.
Language: Python - Size: 43.9 KB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

Rehan-Ahmad/Speech-Music-Segmentation
This repository consists of unsupervised segmentation of audio files consist of music and speech.
Language: Python - Size: 2.21 MB - Last synced at: about 1 year ago - Pushed at: over 6 years ago - Stars: 8 - Forks: 1

lfenzo/poc-meeting-summarization
Proof of concept implementing multi-speaker recording transcription summarization
Language: Jupyter Notebook - Size: 404 KB - Last synced at: about 2 months ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

b-ashford/MIAMI-Corpus
An English-Spanish code switching dataset adapted from the Miami-Corpus
Size: 2.25 MB - Last synced at: 7 months ago - Pushed at: almost 2 years ago - Stars: 0 - Forks: 0

flaviodelgrosso/whisper-transcriber
Use OpenAI's Whisper to transcribe audio files and diariaze speakers of the transcribed text
Language: Python - Size: 8.79 KB - Last synced at: 3 months ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

cadia-lvl/diar-az
Diarization A to Z - Kaldi to Gecko to Kaldi and corpus and back
Language: Python - Size: 146 KB - Last synced at: over 1 year ago - Pushed at: almost 5 years ago - Stars: 0 - Forks: 0

CaioMizerkowski/guaxa
Project for transcription and diarization of a podcast using ML, with a graphical interface for text correction.
Language: Python - Size: 164 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

limorl/whisper-playground
A playground to use whisper python package for transcription. A dev container is used to set up all that is needed included whisper, pyannote, ffmpeg and pydub.
Language: Python - Size: 7.81 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

KaddaOK/TASMAS
Transcriber and summarizer for file-per-speaker recordings, such as Discord calls recorded by the Craig bot
Language: Python - Size: 19.5 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

e6quisitory/pyannote-benchmark
pyannote.audio benchmark for NVIDIA GPUs
Language: Python - Size: 2.93 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

BertilBraun/Meeting-Summarizer
This Project transcribes spoken content into text and identifies distinct speakers, organizing the transcript accordingly for easier review and analysis.
Language: Python - Size: 11.7 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 1 - Forks: 0

SEERNET/Multi-Speaker-Diarization
Automated Multi Speaker diarization API for meetings, calls, interviews, press-conference etc.
Size: 13.7 KB - Last synced at: over 1 year ago - Pushed at: over 6 years ago - Stars: 11 - Forks: 0

JonasWeinert/LATACA
LocalAutomatedTranscriptionAndContentAnalysis: On device Automatic Speech Recognition & Diarization using fine tuned Whisper small.en as well as Semantic Content Analysis using BART large
Language: Python - Size: 65 MB - Last synced at: over 1 year ago - Pushed at: almost 2 years ago - Stars: 1 - Forks: 1

adam-aalah/Speech-transcription
Speech transcription and speech diarization
Language: Python - Size: 21.5 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 1 - Forks: 0

EdoardoPona/Ara
Ara (think parrot :parrot: ) is a script / api to transcribe and diarise audio. It uses Whisper and Pyannote
Language: Python - Size: 16.6 KB - Last synced at: 3 months ago - Pushed at: over 2 years ago - Stars: 1 - Forks: 1

granludo/diarize_srt
Identifies the diferent sepakers in a recording and labels them on a SRT file.
Language: Python - Size: 24.4 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 1 - Forks: 0

domtoro/whisper-diarization-experiment
On- and off I am experimenting with OpenAI whisper and related technologies. Here I attempt to create a tool that transcribes meeting recordings for me.
Language: Python - Size: 2.93 KB - Last synced at: over 1 year ago - Pushed at: almost 2 years ago - Stars: 0 - Forks: 0

gong-io/gecko
Gecko - A Tool for Effective Annotation of Human Conversations
Language: JavaScript - Size: 51.4 MB - Last synced at: almost 2 years ago - Pushed at: over 2 years ago - Stars: 248 - Forks: 38

haoming29/ez-transcription
An easy way to make perfect audio transcript with Whisper model and speaker diarization
Language: JavaScript - Size: 1.86 MB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 1 - Forks: 0

Rajeshshashank/Speaker-Diarization
Speaker Diarization using Python, Flask and Html
Language: HTML - Size: 161 KB - Last synced at: almost 2 years ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 2

orianemartin/WhispGrid
A Whisper to TextGrid script that I use to automatize Corpus Annotation on Praat, with speaker diarization.
Language: Python - Size: 35.2 KB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 0 - Forks: 0

mayankkom-dev/11labshackathon
The solution is a POC uitlity of TTS for translating a movie into English using provided subtitles and voice analytics, cloning and TTS.
Language: Java - Size: 44.8 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 1 - Forks: 0

theshajha/whisper-realtime-speech-to-text-summary
Transcribe real-world speech with an API call. Based on Whisper(ASR by OpenAI) - https://openai.com/blog/whisper/
Language: Python - Size: 11.4 MB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 2 - Forks: 0

Bdata0/call_quality_rate
The app for analyzing Call Quality Rate (CQR) of call transcripts based on audio recordings.
Language: Python - Size: 23.4 KB - Last synced at: about 2 years ago - Pushed at: about 2 years ago - Stars: 0 - Forks: 0

anonymous-demos/Multimodal-All-In-one-deprecated
Multi-Modal Speech Recognition, Separation and Diarization, Everything Streaming All at Once
Size: 24.4 KB - Last synced at: about 2 years ago - Pushed at: about 2 years ago - Stars: 0 - Forks: 0

shahruk10/kaldi-tflite
Convert kaldi feature extraction and nnet3 models into Tensorflow Lite models. Currently aimed at converting kaldi's x-vector models and diarization pipelines to tensorflow models.
Language: Python - Size: 7.62 MB - Last synced at: over 2 years ago - Pushed at: almost 3 years ago - Stars: 16 - Forks: 4

RishiKakade/Speech-Separating-Hearing-Aid
Language: JavaScript - Size: 10.7 MB - Last synced at: over 2 years ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

cvqluu/nn-similarity-diarization
Neural network based similarity scoring for diarization (pytorch implementation of "LSTM based Similarity Measurement with Spectral Clustering for Speaker Diarization")
Language: Python - Size: 347 KB - Last synced at: over 2 years ago - Pushed at: almost 5 years ago - Stars: 35 - Forks: 11

LianaMikael/SpeechDatasets
Large publicly available speech datasets
Size: 2.93 KB - Last synced at: over 2 years ago - Pushed at: about 4 years ago - Stars: 2 - Forks: 0

swapnil233/QualSearch
A web platform for UX researchers to easily analyze user interviews
Language: TypeScript - Size: 550 KB - Last synced at: over 2 years ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

exemplaryai/ai-engine
Easy to use Multi-Provider ASR/Speech To Text and NLP engine
Size: 5.15 MB - Last synced at: over 2 years ago - Pushed at: almost 3 years ago - Stars: 25 - Forks: 0

slegroux/slgKaldi
Resources for easily building ASR systems with Kaldi
Language: Shell - Size: 2.55 MB - Last synced at: over 2 years ago - Pushed at: almost 5 years ago - Stars: 2 - Forks: 0

DonBraulio/SpeechEmbeddings
Research on speech processing, speaker identification and audio diarization
Language: Jupyter Notebook - Size: 5.82 MB - Last synced at: 6 months ago - Pushed at: about 3 years ago - Stars: 0 - Forks: 0

oulfik/spyzer
Speech toolkit for audio analysis, diarization and transcription
Language: Python - Size: 1.66 MB - Last synced at: over 2 years ago - Pushed at: about 3 years ago - Stars: 0 - Forks: 0
