Ecosyste.ms: Repos

An open API service providing repository metadata for many open source software ecosystems.

GitHub topics: diarization

swapnil233/qualsearch-nextjs

Comprehensive qualitative data analysis software for UX research. User interview tagging, AI-supported analysis, team management, etc.

Language: TypeScript - Size: 55.9 MB - Last synced: about 22 hours ago - Pushed: about 23 hours ago - Stars: 1 - Forks: 0

transcriptionstream/transcriptionstream

turnkey self-hosted offline transcription and diarization service with llm summary

Language: Python - Size: 1.23 MB - Last synced: 3 days ago - Pushed: 3 days ago - Stars: 566 - Forks: 24

JSchmie/ScrAIbe

Tool for automatic transcription and speaker diarization based on whisper and pyannote.

Language: Python - Size: 5.86 MB - Last synced: 4 days ago - Pushed: 5 days ago - Stars: 14 - Forks: 2

TemporalLabsLLC-SOL/TemporalLabsLLC-YouTubeTranscriber

TemporalLabsLLC YouTube Transcriber is a useful tool designed to convert lists of YouTube videos into text data that can be further distilled for a generative AI pipeline.

Size: 225 KB - Last synced: 6 days ago - Pushed: 8 days ago - Stars: 0 - Forks: 0

cadia-lvl/diar-az

Diarization A to Z - Kaldi to Gecko to Kaldi and corpus and back

Language: Python - Size: 146 KB - Last synced: 8 days ago - Pushed: over 3 years ago - Stars: 0 - Forks: 0

MedVoice-RMIT-CapStone-2024/MedVoice-FastAPI

Backend for MedVoice Project

Language: Python - Size: 466 KB - Last synced: about 14 hours ago - Pushed: about 15 hours ago - Stars: 0 - Forks: 0

CaioMizerkowski/guaxa

Project for transcription and diarization of a podcast using ML, with a graphical interface for text correction.

Language: Python - Size: 164 MB - Last synced: 16 days ago - Pushed: 17 days ago - Stars: 0 - Forks: 0

R3gm/SoniTranslate

Synchronized Translation for Videos

Language: Python - Size: 18.9 MB - Last synced: 6 months ago - Pushed: 6 months ago - Stars: 45 - Forks: 11

chimechallenge/chime-utils

Scripts for data generation, scoring and data manifest preparation for CHiME-8 DASR task.

Language: Python - Size: 2.51 MB - Last synced: 20 days ago - Pushed: 21 days ago - Stars: 13 - Forks: 2

cvqluu/simple_diarizer

Simplified diarization pipeline using some pretrained models - audio file to diarized segments in a few lines of code

Language: Python - Size: 1.27 MB - Last synced: about 12 hours ago - Pushed: about 1 month ago - Stars: 123 - Forks: 26

desh2608/spyder

Simple Python package for fast DER computation

Language: C++ - Size: 98.6 KB - Last synced: about 12 hours ago - Pushed: 11 months ago - Stars: 30 - Forks: 7

desh2608/dover-lap

Python package for combining diarization system outputs.

Language: Python - Size: 1.01 MB - Last synced: about 11 hours ago - Pushed: 8 months ago - Stars: 73 - Forks: 13

limorl/whisper-playground

A playground to use whisper python package for transcription. A dev container is used to set up all that is needed included whisper, pyannote, ffmpeg and pydub.

Language: Python - Size: 7.81 KB - Last synced: about 1 month ago - Pushed: about 1 month ago - Stars: 0 - Forks: 0

wq2012/SimpleDER

A lightweight library to compute Diarization Error Rate (DER).

Language: Python - Size: 79.1 KB - Last synced: about 12 hours ago - Pushed: 9 months ago - Stars: 60 - Forks: 9

KaddaOK/TASMAS

Transcriber and summarizer for file-per-speaker recordings, such as Discord calls recorded by the Craig bot

Language: Python - Size: 19.5 KB - Last synced: about 1 month ago - Pushed: about 1 month ago - Stars: 0 - Forks: 0

microsoft/UniSpeech

UniSpeech - Large Scale Self-Supervised Learning for Speech

Language: Python - Size: 72.4 MB - Last synced: about 2 months ago - Pushed: 2 months ago - Stars: 388 - Forks: 70

e6quisitory/pyannote-benchmark

pyannote.audio benchmark for NVIDIA GPUs

Language: Python - Size: 2.93 KB - Last synced: 2 months ago - Pushed: 3 months ago - Stars: 0 - Forks: 0

BertilBraun/Meeting-Summarizer

This Project transcribes spoken content into text and identifies distinct speakers, organizing the transcript accordingly for easier review and analysis.

Language: Python - Size: 11.7 KB - Last synced: 2 months ago - Pushed: 2 months ago - Stars: 1 - Forks: 0

SEERNET/Multi-Speaker-Diarization

Automated Multi Speaker diarization API for meetings, calls, interviews, press-conference etc.

Size: 13.7 KB - Last synced: 3 months ago - Pushed: over 5 years ago - Stars: 11 - Forks: 0

JonasWeinert/LATACA

LocalAutomatedTranscriptionAndContentAnalysis: On device Automatic Speech Recognition & Diarization using fine tuned Whisper small.en as well as Semantic Content Analysis using BART large

Language: Python - Size: 65 MB - Last synced: about 2 months ago - Pushed: 7 months ago - Stars: 1 - Forks: 1

adam-aalah/Speech-transcription

Speech transcription and speech diarization

Language: Python - Size: 21.5 KB - Last synced: 2 months ago - Pushed: 2 months ago - Stars: 1 - Forks: 0

granludo/diarize_srt

Identifies the diferent sepakers in a recording and labels them on a SRT file.

Language: Python - Size: 24.4 KB - Last synced: 4 months ago - Pushed: 4 months ago - Stars: 1 - Forks: 0

domtoro/whisper-diarization-experiment

On- and off I am experimenting with OpenAI whisper and related technologies. Here I attempt to create a tool that transcribes meeting recordings for me.

Language: Python - Size: 2.93 KB - Last synced: about 1 month ago - Pushed: 6 months ago - Stars: 0 - Forks: 0

gong-io/gecko

Gecko - A Tool for Effective Annotation of Human Conversations

Language: JavaScript - Size: 51.4 MB - Last synced: 7 months ago - Pushed: about 1 year ago - Stars: 248 - Forks: 38

haoming29/ez-transcription

An easy way to make perfect audio transcript with Whisper model and speaker diarization

Language: JavaScript - Size: 1.86 MB - Last synced: 8 months ago - Pushed: 9 months ago - Stars: 1 - Forks: 0

SuyashMore/MevonAI-Speech-Emotion-Recognition

Identify the emotion of multiple speakers in an Audio Segment

Language: C - Size: 63.6 MB - Last synced: 8 months ago - Pushed: over 1 year ago - Stars: 144 - Forks: 46

cadia-lvl/kaldi-speaker-diarization

This repository creates speaker diarization recipes to be used within the egs folder of kaldi.

Language: Shell - Size: 75.2 KB - Last synced: 8 days ago - Pushed: over 2 years ago - Stars: 12 - Forks: 3

Rajeshshashank/Speaker-Diarization

Speaker Diarization using Python, Flask and Html

Language: HTML - Size: 161 KB - Last synced: 8 months ago - Pushed: about 1 year ago - Stars: 0 - Forks: 2

orianemartin/WhispGrid

A Whisper to TextGrid script that I use to automatize Corpus Annotation on Praat, with speaker diarization.

Language: Python - Size: 35.2 KB - Last synced: 7 months ago - Pushed: 7 months ago - Stars: 0 - Forks: 0

mayankkom-dev/11labshackathon

The solution is a POC uitlity of TTS for translating a movie into English using provided subtitles and voice analytics, cloning and TTS.

Language: Java - Size: 44.8 MB - Last synced: 6 months ago - Pushed: 6 months ago - Stars: 1 - Forks: 0

theshajha/whisper-realtime-speech-to-text-summary

Transcribe real-world speech with an API call. Based on Whisper(ASR by OpenAI) - https://openai.com/blog/whisper/

Language: Python - Size: 11.4 MB - Last synced: 10 months ago - Pushed: over 1 year ago - Stars: 2 - Forks: 0

ElmiraGhorbani/gpt-speaker-diarization

Conversational Speaker Diarization using OpenAI AI Language Models(gpt-4) and OpenAI Whisper.

Language: Python - Size: 31.3 KB - Last synced: 10 months ago - Pushed: 10 months ago - Stars: 1 - Forks: 0

Bdata0/call_quality_rate

The app for analyzing Call Quality Rate (CQR) of call transcripts based on audio recordings.

Language: Python - Size: 23.4 KB - Last synced: 10 months ago - Pushed: 10 months ago - Stars: 0 - Forks: 0

anonymous-demos/Multimodal-All-In-one-deprecated

Multi-Modal Speech Recognition, Separation and Diarization, Everything Streaming All at Once

Size: 24.4 KB - Last synced: 11 months ago - Pushed: 11 months ago - Stars: 0 - Forks: 0

shahruk10/kaldi-tflite

Convert kaldi feature extraction and nnet3 models into Tensorflow Lite models. Currently aimed at converting kaldi's x-vector models and diarization pipelines to tensorflow models.

Language: Python - Size: 7.62 MB - Last synced: about 1 year ago - Pushed: over 1 year ago - Stars: 16 - Forks: 4

RishiKakade/Speech-Separating-Hearing-Aid

Language: JavaScript - Size: 10.7 MB - Last synced: about 1 year ago - Pushed: about 1 year ago - Stars: 0 - Forks: 0

Amatofrancesco99/speech_to_dialogue

This streamlit web-app has been developed in order to obtain starting from a recorded audio-track the correspondent dialogue.

Language: Python - Size: 660 KB - Last synced: over 1 year ago - Pushed: over 1 year ago - Stars: 0 - Forks: 0

cvqluu/nn-similarity-diarization

Neural network based similarity scoring for diarization (pytorch implementation of "LSTM based Similarity Measurement with Spectral Clustering for Speaker Diarization")

Language: Python - Size: 347 KB - Last synced: about 1 year ago - Pushed: over 3 years ago - Stars: 35 - Forks: 11

LianaMikael/SpeechDatasets

Large publicly available speech datasets

Size: 2.93 KB - Last synced: about 1 year ago - Pushed: almost 3 years ago - Stars: 2 - Forks: 0

swapnil233/QualSearch

A web platform for UX researchers to easily analyze user interviews

Language: TypeScript - Size: 550 KB - Last synced: about 1 year ago - Pushed: over 1 year ago - Stars: 0 - Forks: 0

exemplaryai/ai-engine

Easy to use Multi-Provider ASR/Speech To Text and NLP engine

Size: 5.15 MB - Last synced: over 1 year ago - Pushed: over 1 year ago - Stars: 25 - Forks: 0

EdoardoPona/Ara

Ara (think parrot :parrot: ) is a script / api to transcribe and diarise audio. It uses Whisper and Pyannote

Language: Python - Size: 16.6 KB - Last synced: about 1 year ago - Pushed: over 1 year ago - Stars: 0 - Forks: 0

slegroux/slgKaldi

Resources for easily building ASR systems with Kaldi

Language: Shell - Size: 2.55 MB - Last synced: over 1 year ago - Pushed: over 3 years ago - Stars: 2 - Forks: 0

DonBraulio/SpeechEmbeddings

Research on speech processing, speaker identification and audio diarization

Language: Jupyter Notebook - Size: 5.82 MB - Last synced: over 1 year ago - Pushed: almost 2 years ago - Stars: 0 - Forks: 0

oulfik/spyzer

Speech toolkit for audio analysis, diarization and transcription

Language: Python - Size: 1.66 MB - Last synced: about 1 year ago - Pushed: almost 2 years ago - Stars: 0 - Forks: 0

dengchenlong/Unsupervised-Speaker-Clustering-Algorithms-Comparison

无监督说话人聚类算法比较

Language: Shell - Size: 60.5 KB - Last synced: about 1 year ago - Pushed: about 2 years ago - Stars: 0 - Forks: 0

JMasr/atenea

Atenea is a Rich-Transcription Toolkit focused on automatic subtitling.

Language: Shell - Size: 3.85 MB - Last synced: 12 months ago - Pushed: about 2 years ago - Stars: 0 - Forks: 0

CPqD/trd-sdk-python

Kit de desenvolvimento de aplicações em Python para o CPQD Transcrição de Diálogos

Language: Python - Size: 59.6 KB - Last synced: about 2 months ago - Pushed: over 2 years ago - Stars: 2 - Forks: 1

keptsecret/Speaker-Diarization Fork of taylorlu/Speaker-Diarization

Fork of the repository by taylorlu, modified for usability and without changing the pretrained models

Language: Python - Size: 52.6 MB - Last synced: over 1 year ago - Pushed: almost 4 years ago - Stars: 0 - Forks: 0

Related Keywords
diarization 49 transcription 19 speaker-diarization 13 speech-recognition 10 asr 9 whisper 9 speech-to-text 8 python 6 speech-processing 6 speech 5 kaldi 5 pytorch 4 speaker-recognition 4 deep-learning 3 machine-learning 3 speech-separation 3 speech-diarization 2 colab-notebook 2 speechbrain 2 sentiment-analysis 2 gradio 2 audio-processing 2 speaker-identification 2 automatic-speech-recognition 2 audio 2 ai 2 tensorflow 2 openai 2 summarization 2 ux 2 tflite 1 multichannel-microphone-arrays 1 multi-talker 1 separation 1 audio-visual 1 multimodal 1 vosk 1 streamlit 1 metrics 1 audio-files 1 icelandic 1 mfccs 1 plda 1 wav 1 pyaudio 1 alignment 1 praat 1 textgrid 1 tts 1 gpt-4 1 voice-activity-detection 1 youtube-dl 1 call-quality 1 docker 1 google-sheets 1 natural-language-processing 1 natural-language-understanding 1 neural-networks 1 nlp 1 open-source 1 stt 1 language 1 audio-analysis 1 deep-neural-networks 1 kivy 1 clustering-algorithm 1 unsupervised 1 unsupervised-clustering 1 rich-transcription 1 subtitles 1 cpqd 1 sdk 1 rnn-tensorflow 1 x-vector 1 beamforming 1 source-separation 1 streamlit-application 1 lstm 1 neural-network 1 similarity 1 similarity-score 1 voice 1 voice-cloning 1 voice-separation 1 qualitative-research 1 user-experience 1 automatic-speech-processing 1 conversational-ai 1 language-models 1 low-code 1 multi-provider 1 ensemble-machine-learning 1 dover-lap 1 der 1 speech-enhancement 1 multi-speaker-asr 1 meeting-transcription 1 far-field-speech-recognition 1 translation 1 translate-video 1