Ecosyste.ms: Repos
An open API service providing repository metadata for many open source software ecosystems.
GitHub topics: diarization
swapnil233/qualsearch-nextjs
Comprehensive qualitative data analysis software for UX research. User interview tagging, AI-supported analysis, team management, etc.
Language: TypeScript - Size: 55.9 MB - Last synced: about 22 hours ago - Pushed: about 23 hours ago - Stars: 1 - Forks: 0
transcriptionstream/transcriptionstream
turnkey self-hosted offline transcription and diarization service with llm summary
Language: Python - Size: 1.23 MB - Last synced: 3 days ago - Pushed: 3 days ago - Stars: 566 - Forks: 24
JSchmie/ScrAIbe
Tool for automatic transcription and speaker diarization based on whisper and pyannote.
Language: Python - Size: 5.86 MB - Last synced: 4 days ago - Pushed: 5 days ago - Stars: 14 - Forks: 2
TemporalLabsLLC-SOL/TemporalLabsLLC-YouTubeTranscriber
TemporalLabsLLC YouTube Transcriber is a useful tool designed to convert lists of YouTube videos into text data that can be further distilled for a generative AI pipeline.
Size: 225 KB - Last synced: 6 days ago - Pushed: 8 days ago - Stars: 0 - Forks: 0
cadia-lvl/diar-az
Diarization A to Z - Kaldi to Gecko to Kaldi and corpus and back
Language: Python - Size: 146 KB - Last synced: 8 days ago - Pushed: over 3 years ago - Stars: 0 - Forks: 0
MedVoice-RMIT-CapStone-2024/MedVoice-FastAPI
Backend for MedVoice Project
Language: Python - Size: 466 KB - Last synced: about 14 hours ago - Pushed: about 15 hours ago - Stars: 0 - Forks: 0
CaioMizerkowski/guaxa
Project for transcription and diarization of a podcast using ML, with a graphical interface for text correction.
Language: Python - Size: 164 MB - Last synced: 16 days ago - Pushed: 17 days ago - Stars: 0 - Forks: 0
R3gm/SoniTranslate
Synchronized Translation for Videos
Language: Python - Size: 18.9 MB - Last synced: 6 months ago - Pushed: 6 months ago - Stars: 45 - Forks: 11
chimechallenge/chime-utils
Scripts for data generation, scoring and data manifest preparation for CHiME-8 DASR task.
Language: Python - Size: 2.51 MB - Last synced: 20 days ago - Pushed: 21 days ago - Stars: 13 - Forks: 2
cvqluu/simple_diarizer
Simplified diarization pipeline using some pretrained models - audio file to diarized segments in a few lines of code
Language: Python - Size: 1.27 MB - Last synced: about 12 hours ago - Pushed: about 1 month ago - Stars: 123 - Forks: 26
desh2608/spyder
Simple Python package for fast DER computation
Language: C++ - Size: 98.6 KB - Last synced: about 12 hours ago - Pushed: 11 months ago - Stars: 30 - Forks: 7
desh2608/dover-lap
Python package for combining diarization system outputs.
Language: Python - Size: 1.01 MB - Last synced: about 11 hours ago - Pushed: 8 months ago - Stars: 73 - Forks: 13
limorl/whisper-playground
A playground to use whisper python package for transcription. A dev container is used to set up all that is needed included whisper, pyannote, ffmpeg and pydub.
Language: Python - Size: 7.81 KB - Last synced: about 1 month ago - Pushed: about 1 month ago - Stars: 0 - Forks: 0
wq2012/SimpleDER
A lightweight library to compute Diarization Error Rate (DER).
Language: Python - Size: 79.1 KB - Last synced: about 12 hours ago - Pushed: 9 months ago - Stars: 60 - Forks: 9
KaddaOK/TASMAS
Transcriber and summarizer for file-per-speaker recordings, such as Discord calls recorded by the Craig bot
Language: Python - Size: 19.5 KB - Last synced: about 1 month ago - Pushed: about 1 month ago - Stars: 0 - Forks: 0
microsoft/UniSpeech
UniSpeech - Large Scale Self-Supervised Learning for Speech
Language: Python - Size: 72.4 MB - Last synced: about 2 months ago - Pushed: 2 months ago - Stars: 388 - Forks: 70
e6quisitory/pyannote-benchmark
pyannote.audio benchmark for NVIDIA GPUs
Language: Python - Size: 2.93 KB - Last synced: 2 months ago - Pushed: 3 months ago - Stars: 0 - Forks: 0
BertilBraun/Meeting-Summarizer
This Project transcribes spoken content into text and identifies distinct speakers, organizing the transcript accordingly for easier review and analysis.
Language: Python - Size: 11.7 KB - Last synced: 2 months ago - Pushed: 2 months ago - Stars: 1 - Forks: 0
SEERNET/Multi-Speaker-Diarization
Automated Multi Speaker diarization API for meetings, calls, interviews, press-conference etc.
Size: 13.7 KB - Last synced: 3 months ago - Pushed: over 5 years ago - Stars: 11 - Forks: 0
JonasWeinert/LATACA
LocalAutomatedTranscriptionAndContentAnalysis: On device Automatic Speech Recognition & Diarization using fine tuned Whisper small.en as well as Semantic Content Analysis using BART large
Language: Python - Size: 65 MB - Last synced: about 2 months ago - Pushed: 7 months ago - Stars: 1 - Forks: 1
adam-aalah/Speech-transcription
Speech transcription and speech diarization
Language: Python - Size: 21.5 KB - Last synced: 2 months ago - Pushed: 2 months ago - Stars: 1 - Forks: 0
granludo/diarize_srt
Identifies the diferent sepakers in a recording and labels them on a SRT file.
Language: Python - Size: 24.4 KB - Last synced: 4 months ago - Pushed: 4 months ago - Stars: 1 - Forks: 0
domtoro/whisper-diarization-experiment
On- and off I am experimenting with OpenAI whisper and related technologies. Here I attempt to create a tool that transcribes meeting recordings for me.
Language: Python - Size: 2.93 KB - Last synced: about 1 month ago - Pushed: 6 months ago - Stars: 0 - Forks: 0
gong-io/gecko
Gecko - A Tool for Effective Annotation of Human Conversations
Language: JavaScript - Size: 51.4 MB - Last synced: 7 months ago - Pushed: about 1 year ago - Stars: 248 - Forks: 38
haoming29/ez-transcription
An easy way to make perfect audio transcript with Whisper model and speaker diarization
Language: JavaScript - Size: 1.86 MB - Last synced: 8 months ago - Pushed: 9 months ago - Stars: 1 - Forks: 0
SuyashMore/MevonAI-Speech-Emotion-Recognition
Identify the emotion of multiple speakers in an Audio Segment
Language: C - Size: 63.6 MB - Last synced: 8 months ago - Pushed: over 1 year ago - Stars: 144 - Forks: 46
cadia-lvl/kaldi-speaker-diarization
This repository creates speaker diarization recipes to be used within the egs folder of kaldi.
Language: Shell - Size: 75.2 KB - Last synced: 8 days ago - Pushed: over 2 years ago - Stars: 12 - Forks: 3
Rajeshshashank/Speaker-Diarization
Speaker Diarization using Python, Flask and Html
Language: HTML - Size: 161 KB - Last synced: 8 months ago - Pushed: about 1 year ago - Stars: 0 - Forks: 2
orianemartin/WhispGrid
A Whisper to TextGrid script that I use to automatize Corpus Annotation on Praat, with speaker diarization.
Language: Python - Size: 35.2 KB - Last synced: 7 months ago - Pushed: 7 months ago - Stars: 0 - Forks: 0
mayankkom-dev/11labshackathon
The solution is a POC uitlity of TTS for translating a movie into English using provided subtitles and voice analytics, cloning and TTS.
Language: Java - Size: 44.8 MB - Last synced: 6 months ago - Pushed: 6 months ago - Stars: 1 - Forks: 0
theshajha/whisper-realtime-speech-to-text-summary
Transcribe real-world speech with an API call. Based on Whisper(ASR by OpenAI) - https://openai.com/blog/whisper/
Language: Python - Size: 11.4 MB - Last synced: 10 months ago - Pushed: over 1 year ago - Stars: 2 - Forks: 0
ElmiraGhorbani/gpt-speaker-diarization
Conversational Speaker Diarization using OpenAI AI Language Models(gpt-4) and OpenAI Whisper.
Language: Python - Size: 31.3 KB - Last synced: 10 months ago - Pushed: 10 months ago - Stars: 1 - Forks: 0
Bdata0/call_quality_rate
The app for analyzing Call Quality Rate (CQR) of call transcripts based on audio recordings.
Language: Python - Size: 23.4 KB - Last synced: 10 months ago - Pushed: 10 months ago - Stars: 0 - Forks: 0
anonymous-demos/Multimodal-All-In-one-deprecated
Multi-Modal Speech Recognition, Separation and Diarization, Everything Streaming All at Once
Size: 24.4 KB - Last synced: 11 months ago - Pushed: 11 months ago - Stars: 0 - Forks: 0
shahruk10/kaldi-tflite
Convert kaldi feature extraction and nnet3 models into Tensorflow Lite models. Currently aimed at converting kaldi's x-vector models and diarization pipelines to tensorflow models.
Language: Python - Size: 7.62 MB - Last synced: about 1 year ago - Pushed: over 1 year ago - Stars: 16 - Forks: 4
RishiKakade/Speech-Separating-Hearing-Aid
Language: JavaScript - Size: 10.7 MB - Last synced: about 1 year ago - Pushed: about 1 year ago - Stars: 0 - Forks: 0
Amatofrancesco99/speech_to_dialogue
This streamlit web-app has been developed in order to obtain starting from a recorded audio-track the correspondent dialogue.
Language: Python - Size: 660 KB - Last synced: over 1 year ago - Pushed: over 1 year ago - Stars: 0 - Forks: 0
cvqluu/nn-similarity-diarization
Neural network based similarity scoring for diarization (pytorch implementation of "LSTM based Similarity Measurement with Spectral Clustering for Speaker Diarization")
Language: Python - Size: 347 KB - Last synced: about 1 year ago - Pushed: over 3 years ago - Stars: 35 - Forks: 11
LianaMikael/SpeechDatasets
Large publicly available speech datasets
Size: 2.93 KB - Last synced: about 1 year ago - Pushed: almost 3 years ago - Stars: 2 - Forks: 0
swapnil233/QualSearch
A web platform for UX researchers to easily analyze user interviews
Language: TypeScript - Size: 550 KB - Last synced: about 1 year ago - Pushed: over 1 year ago - Stars: 0 - Forks: 0
exemplaryai/ai-engine
Easy to use Multi-Provider ASR/Speech To Text and NLP engine
Size: 5.15 MB - Last synced: over 1 year ago - Pushed: over 1 year ago - Stars: 25 - Forks: 0
EdoardoPona/Ara
Ara (think parrot :parrot: ) is a script / api to transcribe and diarise audio. It uses Whisper and Pyannote
Language: Python - Size: 16.6 KB - Last synced: about 1 year ago - Pushed: over 1 year ago - Stars: 0 - Forks: 0
slegroux/slgKaldi
Resources for easily building ASR systems with Kaldi
Language: Shell - Size: 2.55 MB - Last synced: over 1 year ago - Pushed: over 3 years ago - Stars: 2 - Forks: 0
DonBraulio/SpeechEmbeddings
Research on speech processing, speaker identification and audio diarization
Language: Jupyter Notebook - Size: 5.82 MB - Last synced: over 1 year ago - Pushed: almost 2 years ago - Stars: 0 - Forks: 0
oulfik/spyzer
Speech toolkit for audio analysis, diarization and transcription
Language: Python - Size: 1.66 MB - Last synced: about 1 year ago - Pushed: almost 2 years ago - Stars: 0 - Forks: 0
dengchenlong/Unsupervised-Speaker-Clustering-Algorithms-Comparison
无监督说话人聚类算法比较
Language: Shell - Size: 60.5 KB - Last synced: about 1 year ago - Pushed: about 2 years ago - Stars: 0 - Forks: 0
JMasr/atenea
Atenea is a Rich-Transcription Toolkit focused on automatic subtitling.
Language: Shell - Size: 3.85 MB - Last synced: 12 months ago - Pushed: about 2 years ago - Stars: 0 - Forks: 0
CPqD/trd-sdk-python
Kit de desenvolvimento de aplicações em Python para o CPQD Transcrição de Diálogos
Language: Python - Size: 59.6 KB - Last synced: about 2 months ago - Pushed: over 2 years ago - Stars: 2 - Forks: 1
keptsecret/Speaker-Diarization Fork of taylorlu/Speaker-Diarization
Fork of the repository by taylorlu, modified for usability and without changing the pretrained models
Language: Python - Size: 52.6 MB - Last synced: over 1 year ago - Pushed: almost 4 years ago - Stars: 0 - Forks: 0