An open API service providing repository metadata for many open source software ecosystems.

GitHub topics: diarization

christianebacani/Roadmap

This repository serves as a temporary portfolio showcasing SQL projects, Python Scripts related to Data Engineering, highlighting key accomplishments and implementations.

Language: Python - Size: 1020 MB - Last synced at: 1 day ago - Pushed at: 1 day ago - Stars: 0 - Forks: 0

thewh1teagle/pyannote-rs

pyannote audio diarization in rust

Language: Rust - Size: 146 KB - Last synced at: about 17 hours ago - Pushed at: 3 months ago - Stars: 73 - Forks: 11

engasd999/senko

⚡ Accelerate speaker diarization with Senko, processing 1 hour of audio in just 5 seconds on powerful hardware—boost your audio analysis efficiency.

Language: Python - Size: 58 MB - Last synced at: 2 days ago - Pushed at: 2 days ago - Stars: 0 - Forks: 0

SzymiczeQ/zanshin

🎧 Navigate audio content effortlessly with Zanshin, a media player that enhances your listening experience by speaker, supporting both YouTube and local files.

Language: Svelte - Size: 5.01 MB - Last synced at: 2 days ago - Pushed at: 2 days ago - Stars: 0 - Forks: 0

Valtora/Nojoin

Nojoin listens to your calls without having a bot join the meeting. With custom tags, speaker attribution, and built-in AI, Nojoin turns your conversations into organised, actionable, and searchable notes. All for free.

Language: Python - Size: 33.9 MB - Last synced at: 5 days ago - Pushed at: 5 days ago - Stars: 0 - Forks: 0

narcotic-sh/senko

A very fast speaker diarization pipeline

Language: Python - Size: 56.7 MB - Last synced at: 5 days ago - Pushed at: 5 days ago - Stars: 0 - Forks: 0

thewh1teagle/sherpa-rs

Rust bindings to https://github.com/k2-fsa/sherpa-onnx

Language: Rust - Size: 1.49 MB - Last synced at: about 17 hours ago - Pushed at: 4 months ago - Stars: 216 - Forks: 37

empenoso/offline-audio-transcriber

Локальное и бесплатное распознавание речи с помощью OpenAI Whisper. Автоматизируйте расшифровку лекций и совещаний на вашем ПК без облачных сервисов и подписок

Language: Python - Size: 26.4 KB - Last synced at: 8 days ago - Pushed at: 8 days ago - Stars: 0 - Forks: 0

mrhallonline/WhisperXTranscription4Researchers

This repository contains a Jupyter notebook for qualitative researchers to transcribe, diarize speakers, and convert audio or video files into various text formats (csv, txt, json, & vtt).

Language: Jupyter Notebook - Size: 12.3 MB - Last synced at: 2 days ago - Pushed at: 10 days ago - Stars: 7 - Forks: 0

HanBnrd/NeMoASR

Automatic speech recognition with speaker diarisation

Language: Python - Size: 22.5 KB - Last synced at: 11 days ago - Pushed at: 11 days ago - Stars: 0 - Forks: 0

thewh1teagle/loud.cpp

Whisper.cpp with diarization

Language: C++ - Size: 111 KB - Last synced at: about 17 hours ago - Pushed at: 10 months ago - Stars: 15 - Forks: 4

R3gm/SoniTranslate

Synchronized Translation for Videos. Video dubbing

Language: Python - Size: 19.4 MB - Last synced at: 16 days ago - Pushed at: 16 days ago - Stars: 1,210 - Forks: 270

NotYuSheng/MeetMemo

Record or upload meeting audio, generate diarized transcripts, AI-powered summaries, and export results to PDF. Built with FastAPI, Whisper, and PyAnnote.

Language: JavaScript - Size: 34.6 MB - Last synced at: 13 days ago - Pushed at: 14 days ago - Stars: 1 - Forks: 0

momentics/CallAnnotate

CallAnnotate — контейнеризованная система автоматической обработки телефонных разговоров, обеспечивающая диаризацию спикеров, распознавание речи, интеграцию с CardDAV, транскрипцию и формирование структурированных JSON-отчётов. Предоставляет REST и WebSocket API для интеграции.

Language: Python - Size: 1.02 GB - Last synced at: 18 days ago - Pushed at: 18 days ago - Stars: 0 - Forks: 0

AathifZahir/WhisprSplit

A powerful, local speech-to-text transcription system that combines OpenAI's Whisper for accurate transcription with pyannote.audio for speaker diarization (identifying who spoke when). Perfect for meetings, interviews, podcasts, and any audio/video content that needs accurate transcription with speaker identification.

Language: Python - Size: 16.6 KB - Last synced at: 25 days ago - Pushed at: 25 days ago - Stars: 0 - Forks: 0

austinwmille/orca

you feed in a video; it outputs context contained clips resized to 9:16, keeping speaker in center

Language: Python - Size: 495 MB - Last synced at: 26 days ago - Pushed at: 27 days ago - Stars: 6 - Forks: 0

wq2012/SimpleDER

A lightweight library to compute Diarization Error Rate (DER).

Language: Python - Size: 79.1 KB - Last synced at: 8 days ago - Pushed at: about 2 years ago - Stars: 60 - Forks: 9

microsoft/UniSpeech

UniSpeech - Large Scale Self-Supervised Learning for Speech

Language: Python - Size: 72.4 MB - Last synced at: 6 days ago - Pushed at: over 1 year ago - Stars: 468 - Forks: 73

Picovoice/falcon

On-device speaker diarization powered by deep learning

Language: Python - Size: 22.7 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 52 - Forks: 6

jakariaemon/WSI

Whisper Speaker Identification (WSI), a cutting-edge model for multilingual speaker identification.

Language: Python - Size: 239 KB - Last synced at: 14 days ago - Pushed at: 6 months ago - Stars: 22 - Forks: 0

x2agi/x2agi-speechkit

🎧 X2AGI speech services: ASR, diarization, AI reports (gRPC, REST clients)

Language: Python - Size: 24.9 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 1 - Forks: 0

JSchmie/ScrAIbe

Tool for automatic transcription and speaker diarization based on whisper and pyannote.

Language: Python - Size: 5.91 MB - Last synced at: about 1 month ago - Pushed at: 8 months ago - Stars: 52 - Forks: 15

pulijon/Sttcast

Transcription from mp3 files to html with or without embedded player

Language: Jupyter Notebook - Size: 83.2 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 20 - Forks: 5

shivxmr/speech-diarization

Speech Diarization

Language: Jupyter Notebook - Size: 2.06 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 0 - Forks: 0

looking-glass-station/intension

Intention is a configuration driven Youtube\Twitch downloader, sound file diarizer, transcriber and automatic host match and labeler. It also contains auto-topic classifiers and bias detection.

Language: Python - Size: 236 KB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 1 - Forks: 0

LynnLox/web-Scraper-testing

too many unstructured data, alok has too many req, seeing what helps

Language: Jupyter Notebook - Size: 1.66 MB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 0 - Forks: 0

Global-Health-Engineering/ghe_transcribe

A Tool to Transcribe Audio Files with Speaker Diarization

Language: Python - Size: 45.2 MB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 2 - Forks: 0

harmlessman/PAFTS

PAFTS : Library That Preprocessing Audio For TTS.

Language: Python - Size: 266 KB - Last synced at: about 1 month ago - Pushed at: 10 months ago - Stars: 21 - Forks: 5

TheSeraphim/scribe-forge-ai

🎵 Complete offline audio transcription system with speaker diarization using OpenAI Whisper and PyAnnote. Features automatic audio cleaning, precise timestamps, multiple output formats (JSON/TXT/Markdown), and support for 20+ audio formats. No external APIs required - works entirely offline.

Language: Python - Size: 2.32 MB - Last synced at: 16 days ago - Pushed at: 3 months ago - Stars: 1 - Forks: 0

bunyaminergen/WavLMMSDD

This repository combines `WavLM`, a powerful speech representation model from Microsoft, with `MSDD` (Multi-Scale Diarization Decoder), a state-of-the-art approach for speaker diarization from Nvidia.

Language: Jupyter Notebook - Size: 1.8 MB - Last synced at: 23 days ago - Pushed at: 3 months ago - Stars: 7 - Forks: 3

desh2608/dover-lap

Python package for combining diarization system outputs.

Language: Python - Size: 1.01 MB - Last synced at: 10 days ago - Pushed at: almost 2 years ago - Stars: 88 - Forks: 12

smwlms/TranscriberApp

Local app for private transcription & analysis of audio with Whisper, Pyannote & Ollama.

Language: Python - Size: 84.1 MB - Last synced at: 3 months ago - Pushed at: 4 months ago - Stars: 1 - Forks: 0

jeanjerome/EchoInStone

EchoInStone is an audio processing tool that transcribes, diarizes, and aligns speaker segments from audio files, prioritizing accuracy and reliability.

Language: Python - Size: 1.35 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 18 - Forks: 2

eddiegulay/rtrimmer

Python package to trim RTTM diarization files and optionally audio files to a user-specified time range.

Language: Python - Size: 17.6 KB - Last synced at: about 2 months ago - Pushed at: 3 months ago - Stars: 1 - Forks: 0

shrirajh/subtitled-audio-player

A sleek, web-based audio player featuring synchronized subtitle display, speaker diarization support, and keyboard controls in a modern, responsive interface

Language: JavaScript - Size: 111 KB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 5 - Forks: 2

mtwn105/audio-intel

AudioIntel - Audio/Video Intelligence, Transcripts, Summary, and much more

Language: TypeScript - Size: 660 KB - Last synced at: 6 days ago - Pushed at: 10 months ago - Stars: 1 - Forks: 0

transcriptionstream/transcriptionstream

turnkey self-hosted offline transcription and diarization service with llm summary

Language: Python - Size: 1.23 MB - Last synced at: 4 months ago - Pushed at: 12 months ago - Stars: 853 - Forks: 51

Purfview/whisper-standalone-win

Whisper & Faster-Whisper standalone executables for those who don't want to bother with Python.

Size: 214 KB - Last synced at: 4 months ago - Pushed at: 5 months ago - Stars: 2,079 - Forks: 100

SuyashMore/MevonAI-Speech-Emotion-Recognition

Identify the emotion of multiple speakers in an Audio Segment

Language: C - Size: 63.6 MB - Last synced at: 3 months ago - Pushed at: over 2 years ago - Stars: 171 - Forks: 47

revdotcom/reverb

Open source inference code for Rev's model

Language: Python - Size: 507 KB - Last synced at: 4 months ago - Pushed at: 5 months ago - Stars: 401 - Forks: 26

cvqluu/simple_diarizer

Simplified diarization pipeline using some pretrained models - audio file to diarized segments in a few lines of code

Language: Python - Size: 1.27 MB - Last synced at: 3 months ago - Pushed at: over 1 year ago - Stars: 149 - Forks: 31

TylerKleinbauer/Personal-Projects

Welcome to my coding playground! This repo showcases my journey as a Machine Learning Engineer, blending AI, data science, and software engineering.

Language: Jupyter Notebook - Size: 68.8 MB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 0 - Forks: 0

cadia-lvl/kaldi-speaker-diarization

This repository creates speaker diarization recipes to be used within the egs folder of kaldi.

Language: Shell - Size: 82 KB - Last synced at: 3 months ago - Pushed at: about 1 year ago - Stars: 17 - Forks: 3

alvaro-francisco-gil/team-performance-study

The code processes data, applies natural language processing (NLP) and audio emotion analysis, and correlates team success metrics with personality types.

Language: Jupyter Notebook - Size: 7.45 MB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 0 - Forks: 0

medvoice-research/MedVoice-Core

MedVoice Application Core System

Language: Python - Size: 2.87 MB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 4 - Forks: 1

desh2608/spyder

Simple Python package for fast DER computation

Language: C++ - Size: 98.6 KB - Last synced at: 1 day ago - Pushed at: about 2 years ago - Stars: 33 - Forks: 7

bunyaminergen/Callytics

Callytics is an advanced call analytics solution that leverages speech recognition and large language models (LLMs) technologies to analyze phone conversations from customer service and call centers.

Language: Python - Size: 23.9 MB - Last synced at: 5 months ago - Pushed at: 6 months ago - Stars: 65 - Forks: 10

FGonzalesc/Transcripcion_AI

Transcripción de audios con Azure Speech y extracción de insights con Open AI

Language: Python - Size: 2.56 MB - Last synced at: 5 months ago - Pushed at: 6 months ago - Stars: 1 - Forks: 0

REDFLAG-bugs/trannote

trannote is a baby project for getting transcription and diarization of speaker.

Language: HTML - Size: 8.79 KB - Last synced at: 6 months ago - Pushed at: 6 months ago - Stars: 0 - Forks: 0

bigyaa/transcription-system

This versatile tool is designed for anyone in need of a robust solution for transcribing and diarizing large volumes of audio files. Whether you are dealing with terabytes or even larger quantities, our tool ensures efficient and accurate processing. Ideal for researchers, content creators, and businesses.

Language: Python - Size: 23.3 MB - Last synced at: 5 months ago - Pushed at: 10 months ago - Stars: 1 - Forks: 0

ElmiraGhorbani/gpt-speaker-diarization

Conversational Speaker Diarization using OpenAI AI Language Models(gpt-4) and OpenAI Whisper.

Language: Jupyter Notebook - Size: 39.1 KB - Last synced at: 5 months ago - Pushed at: about 2 years ago - Stars: 12 - Forks: 0

adamelkholyy/whisper-yt

Toolkit for using Whisper to transcribe YouTube videos. Includes Whisper transcription of YouTube videos, conversion of YouTube video into HuggingFace dataset (using audio and subtitles) and evaluation of Whisper transcription against YouTube subtitles

Language: Python - Size: 79.1 KB - Last synced at: 5 months ago - Pushed at: 11 months ago - Stars: 3 - Forks: 0

DKWoods/Diarization

This code takes automated transcription from audio files and attempts to add speaker identifiers. This process is known as diarization.

Language: Python - Size: 4.87 MB - Last synced at: 7 months ago - Pushed at: 7 months ago - Stars: 1 - Forks: 0

itsitgroup/call-analysis-demo

This repository is designed to be a user-facing StreamLit based frontend for LLM-powered AI Call Analysis Demo app. it's hosted on streamlit.io. Contact us if you need an API Key to test it out. ammar@itsitgroup.com

Language: Python - Size: 22.5 KB - Last synced at: 8 months ago - Pushed at: 8 months ago - Stars: 1 - Forks: 0

sohansai/speaker-diarization

A user-friendly interface for identifying and separating speakers in audio files using pyannote.audio and Gradio.

Language: Python - Size: 1000 Bytes - Last synced at: 9 months ago - Pushed at: 9 months ago - Stars: 0 - Forks: 0

TemporalLabsLLC-SOL/TemporalLabsLLC-YouTubeTranscriber

TemporalLabsLLC YouTube Transcriber is a useful tool designed to convert lists of YouTube videos into text data that can be further distilled for a generative AI pipeline.

Size: 225 KB - Last synced at: 9 months ago - Pushed at: 9 months ago - Stars: 0 - Forks: 0

yotwingian/Vocatio-AI

This streamlit demo app can create a transcription from an audio or video file. It performs speaker diarization of the conversation. Additionally, it can provide a summary or analysis of the conversation.

Language: Python - Size: 19.5 KB - Last synced at: 9 months ago - Pushed at: 9 months ago - Stars: 0 - Forks: 0

jottemka/ds-capstone-speakequal

Speequal, an app for real-time monitoring of conversation-share. Developed in the neuefische Data Science, Machine Learning & AI Bootcamp 2024 in Hamburg.

Language: Jupyter Notebook - Size: 51.6 MB - Last synced at: 10 months ago - Pushed at: 10 months ago - Stars: 0 - Forks: 0

mmaudet/audio-splitter

A Python tool to separate audio files by speaker using diarization data.

Language: Python - Size: 2.93 KB - Last synced at: 10 months ago - Pushed at: 10 months ago - Stars: 0 - Forks: 0

swapnil233/qualsearch-nextjs

Comprehensive qualitative data analysis software for UX research. User interview tagging, AI-supported analysis, team management, etc.

Language: TypeScript - Size: 58.4 MB - Last synced at: 10 months ago - Pushed at: 10 months ago - Stars: 2 - Forks: 0

chimechallenge/chime-utils

Scripts for data generation, scoring and data manifest preparation for CHiME-8 DASR task.

Language: Python - Size: 2.63 MB - Last synced at: 7 months ago - Pushed at: 7 months ago - Stars: 21 - Forks: 3

connortbot/podcast-diarizer

pipeline for speaker diarization using various clustering methods

Language: Jupyter Notebook - Size: 59.8 MB - Last synced at: 6 months ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

NickNaskida/cog-whisper-diarization Fork of thomasmol/cog-whisper-diarization

Cog implementation of transcribing + diarization pipeline with Whisper & Pyannote

Language: Python - Size: 54.7 KB - Last synced at: 11 months ago - Pushed at: 12 months ago - Stars: 0 - Forks: 0

a-jain24/Diarization

Facilitates purely text-based diarization labeling of transcripts or other written conversational data using LLMs

Language: Jupyter Notebook - Size: 18.6 MB - Last synced at: 5 months ago - Pushed at: about 1 year ago - Stars: 1 - Forks: 0

NickNaskida/insanely-fast-whisper Fork of chenxwh/insanely-fast-whisper

Incredibly fast Whisper-large-v3 with speaker diarization

Language: Jupyter Notebook - Size: 396 KB - Last synced at: 8 months ago - Pushed at: 12 months ago - Stars: 1 - Forks: 1

radadiavasu/AudioAnalysis

Whole Audio Analysis Research with Python

Language: Python - Size: 86.1 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

kmad/speakerspotter

Audio speaker diarization and detection to automatically segment spoken audio.

Language: Python - Size: 43.9 KB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

Rehan-Ahmad/Speech-Music-Segmentation

This repository consists of unsupervised segmentation of audio files consist of music and speech.

Language: Python - Size: 2.21 MB - Last synced at: about 1 year ago - Pushed at: over 6 years ago - Stars: 8 - Forks: 1

lfenzo/poc-meeting-summarization

Proof of concept implementing multi-speaker recording transcription summarization

Language: Jupyter Notebook - Size: 404 KB - Last synced at: about 2 months ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

b-ashford/MIAMI-Corpus

An English-Spanish code switching dataset adapted from the Miami-Corpus

Size: 2.25 MB - Last synced at: 7 months ago - Pushed at: almost 2 years ago - Stars: 0 - Forks: 0

flaviodelgrosso/whisper-transcriber

Use OpenAI's Whisper to transcribe audio files and diariaze speakers of the transcribed text

Language: Python - Size: 8.79 KB - Last synced at: 3 months ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

cadia-lvl/diar-az

Diarization A to Z - Kaldi to Gecko to Kaldi and corpus and back

Language: Python - Size: 146 KB - Last synced at: over 1 year ago - Pushed at: almost 5 years ago - Stars: 0 - Forks: 0

CaioMizerkowski/guaxa

Project for transcription and diarization of a podcast using ML, with a graphical interface for text correction.

Language: Python - Size: 164 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

limorl/whisper-playground

A playground to use whisper python package for transcription. A dev container is used to set up all that is needed included whisper, pyannote, ffmpeg and pydub.

Language: Python - Size: 7.81 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

KaddaOK/TASMAS

Transcriber and summarizer for file-per-speaker recordings, such as Discord calls recorded by the Craig bot

Language: Python - Size: 19.5 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

e6quisitory/pyannote-benchmark

pyannote.audio benchmark for NVIDIA GPUs

Language: Python - Size: 2.93 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

BertilBraun/Meeting-Summarizer

This Project transcribes spoken content into text and identifies distinct speakers, organizing the transcript accordingly for easier review and analysis.

Language: Python - Size: 11.7 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 1 - Forks: 0

SEERNET/Multi-Speaker-Diarization

Automated Multi Speaker diarization API for meetings, calls, interviews, press-conference etc.

Size: 13.7 KB - Last synced at: over 1 year ago - Pushed at: over 6 years ago - Stars: 11 - Forks: 0

JonasWeinert/LATACA

LocalAutomatedTranscriptionAndContentAnalysis: On device Automatic Speech Recognition & Diarization using fine tuned Whisper small.en as well as Semantic Content Analysis using BART large

Language: Python - Size: 65 MB - Last synced at: over 1 year ago - Pushed at: almost 2 years ago - Stars: 1 - Forks: 1

adam-aalah/Speech-transcription

Speech transcription and speech diarization

Language: Python - Size: 21.5 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 1 - Forks: 0

EdoardoPona/Ara

Ara (think parrot :parrot: ) is a script / api to transcribe and diarise audio. It uses Whisper and Pyannote

Language: Python - Size: 16.6 KB - Last synced at: 3 months ago - Pushed at: over 2 years ago - Stars: 1 - Forks: 1

granludo/diarize_srt

Identifies the diferent sepakers in a recording and labels them on a SRT file.

Language: Python - Size: 24.4 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 1 - Forks: 0

domtoro/whisper-diarization-experiment

On- and off I am experimenting with OpenAI whisper and related technologies. Here I attempt to create a tool that transcribes meeting recordings for me.

Language: Python - Size: 2.93 KB - Last synced at: over 1 year ago - Pushed at: almost 2 years ago - Stars: 0 - Forks: 0

gong-io/gecko

Gecko - A Tool for Effective Annotation of Human Conversations

Language: JavaScript - Size: 51.4 MB - Last synced at: almost 2 years ago - Pushed at: over 2 years ago - Stars: 248 - Forks: 38

haoming29/ez-transcription

An easy way to make perfect audio transcript with Whisper model and speaker diarization

Language: JavaScript - Size: 1.86 MB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 1 - Forks: 0

Rajeshshashank/Speaker-Diarization

Speaker Diarization using Python, Flask and Html

Language: HTML - Size: 161 KB - Last synced at: almost 2 years ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 2

orianemartin/WhispGrid

A Whisper to TextGrid script that I use to automatize Corpus Annotation on Praat, with speaker diarization.

Language: Python - Size: 35.2 KB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 0 - Forks: 0

mayankkom-dev/11labshackathon

The solution is a POC uitlity of TTS for translating a movie into English using provided subtitles and voice analytics, cloning and TTS.

Language: Java - Size: 44.8 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 1 - Forks: 0

theshajha/whisper-realtime-speech-to-text-summary

Transcribe real-world speech with an API call. Based on Whisper(ASR by OpenAI) - https://openai.com/blog/whisper/

Language: Python - Size: 11.4 MB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 2 - Forks: 0

Bdata0/call_quality_rate

The app for analyzing Call Quality Rate (CQR) of call transcripts based on audio recordings.

Language: Python - Size: 23.4 KB - Last synced at: about 2 years ago - Pushed at: about 2 years ago - Stars: 0 - Forks: 0

anonymous-demos/Multimodal-All-In-one-deprecated

Multi-Modal Speech Recognition, Separation and Diarization, Everything Streaming All at Once

Size: 24.4 KB - Last synced at: about 2 years ago - Pushed at: about 2 years ago - Stars: 0 - Forks: 0

shahruk10/kaldi-tflite

Convert kaldi feature extraction and nnet3 models into Tensorflow Lite models. Currently aimed at converting kaldi's x-vector models and diarization pipelines to tensorflow models.

Language: Python - Size: 7.62 MB - Last synced at: over 2 years ago - Pushed at: almost 3 years ago - Stars: 16 - Forks: 4

RishiKakade/Speech-Separating-Hearing-Aid

Language: JavaScript - Size: 10.7 MB - Last synced at: over 2 years ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

cvqluu/nn-similarity-diarization

Neural network based similarity scoring for diarization (pytorch implementation of "LSTM based Similarity Measurement with Spectral Clustering for Speaker Diarization")

Language: Python - Size: 347 KB - Last synced at: over 2 years ago - Pushed at: almost 5 years ago - Stars: 35 - Forks: 11

LianaMikael/SpeechDatasets

Large publicly available speech datasets

Size: 2.93 KB - Last synced at: over 2 years ago - Pushed at: about 4 years ago - Stars: 2 - Forks: 0

swapnil233/QualSearch

A web platform for UX researchers to easily analyze user interviews

Language: TypeScript - Size: 550 KB - Last synced at: over 2 years ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

exemplaryai/ai-engine

Easy to use Multi-Provider ASR/Speech To Text and NLP engine

Size: 5.15 MB - Last synced at: over 2 years ago - Pushed at: almost 3 years ago - Stars: 25 - Forks: 0

slegroux/slgKaldi

Resources for easily building ASR systems with Kaldi

Language: Shell - Size: 2.55 MB - Last synced at: over 2 years ago - Pushed at: almost 5 years ago - Stars: 2 - Forks: 0

DonBraulio/SpeechEmbeddings

Research on speech processing, speaker identification and audio diarization

Language: Jupyter Notebook - Size: 5.82 MB - Last synced at: 6 months ago - Pushed at: about 3 years ago - Stars: 0 - Forks: 0

oulfik/spyzer

Speech toolkit for audio analysis, diarization and transcription

Language: Python - Size: 1.66 MB - Last synced at: over 2 years ago - Pushed at: about 3 years ago - Stars: 0 - Forks: 0