GitHub topics: audio-segmentation
nianlonggu/WhisperSeg
Code for ICASSP 2024 paper WhisperSeg: Positive Transfer of the Whisper Speech Transformer to Human and Animal Voice Activity Detection
Language: Python - Size: 243 MB - Last synced at: 9 days ago - Pushed at: 9 days ago - Stars: 29 - Forks: 9

BingLingGroup/autosub Fork of iWangJiaxiang/autosub
Command-line utility to transcribe/translate from video/audio/subtitles to subtitles
Language: Python - Size: 1.29 MB - Last synced at: 14 days ago - Pushed at: over 1 year ago - Stars: 1,986 - Forks: 245

amsehili/auditok
An audio/acoustic activity detection and audio segmentation tool
Language: Python - Size: 3.68 MB - Last synced at: 12 days ago - Pushed at: 4 months ago - Stars: 769 - Forks: 95

ina-foss/InaGVAD
Voice activity detection and speaker gender segmentation audiovisual corpus
Language: Jupyter Notebook - Size: 1.4 MB - Last synced at: 22 days ago - Pushed at: 3 months ago - Stars: 10 - Forks: 1

sushant1827/CrewAI-Agents-MinutesOfMeeting-Gmail
MinutesOfMeeting and Gmail is a collaborative crew of AI agents that autonomously understand audio, transcripts, summarizes, writes and drafts an email in Gmail account.
Language: Python - Size: 28.4 MB - Last synced at: 15 days ago - Pushed at: 3 months ago - Stars: 1 - Forks: 1

nuvita97/music-source-separation
Music Source Separation web application using U-Net model with 2 main features: Audio Separation & Karaoke
Language: Jupyter Notebook - Size: 162 MB - Last synced at: 18 days ago - Pushed at: about 1 year ago - Stars: 2 - Forks: 1

boromir674/music-album-creator
Build a digital music library by downloading and segmenting youtube videos.
Language: Python - Size: 12.4 MB - Last synced at: 2 days ago - Pushed at: over 1 year ago - Stars: 4 - Forks: 0

radadiavasu/AudioAnalysis
Whole Audio Analysis Research with Python
Language: Python - Size: 86.1 MB - Last synced at: 8 months ago - Pushed at: 8 months ago - Stars: 0 - Forks: 0

dangrebenkin/wav2vec2_speech_markuper
Automatic generation of speech dataset markup using Wav2Vec2 ASR models
Language: Python - Size: 396 KB - Last synced at: 9 months ago - Pushed at: 9 months ago - Stars: 2 - Forks: 0

mt-upc/SegAugment
SEGAUGMENT: Maximizing the Utility of Speech Translation Data with Segmentation-based Augmentations
Language: Python - Size: 136 KB - Last synced at: 10 months ago - Pushed at: 10 months ago - Stars: 1 - Forks: 0

zqlsnr/speech-music-detection
tensorflow for speech-music-detection task,acc 96%+
Language: Python - Size: 2.49 MB - Last synced at: 10 months ago - Pushed at: 10 months ago - Stars: 1 - Forks: 0

huzaifakhan04/music-recommendation-web-application-based-on-rhythmic-similarity-using-locality-sensitive-hashing
This repository contains a web application that integrates with a music recommendation system, which leverages a dataset of 3,415 audio files, each lasting thirty seconds, utilising a Locality-Sensitive Hashing (LSH) implementation to determine rhythmic similarity, as part of an assignment for the Fundamental of Big Data Analytics (DS2004) course.
Language: Jupyter Notebook - Size: 4.93 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 5 - Forks: 0

luuil/Tools
Our Little Tools
Language: Stylus - Size: 28.8 MB - Last synced at: about 1 year ago - Pushed at: over 3 years ago - Stars: 2 - Forks: 1

0x7o/PyanNet
Training and using audio segmentation
Size: 1.95 KB - Last synced at: about 1 month ago - Pushed at: about 2 years ago - Stars: 0 - Forks: 1

dangrebenkin/speech_audio_separator
A useful tool to split speech WAV PCM files to fragments with use of energy signal minimums (speech pauses).
Language: Python - Size: 444 KB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 2 - Forks: 0

dangvansam/pyannote-onnx Fork of pyannote/pyannote-audio
PyAnnote with ONNX model
Language: Jupyter Notebook - Size: 273 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

Metiu-Metiu/Neural-Texture-Sound-synthesis---data-sets
Synthetic sounds datasets and real sounds datasets of waterflow sounds for the repo 'Neural-Texture-Sound-Synthesis-with-physically-driven-continuous-controls'.
Size: 1.5 GB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 3 - Forks: 0

Appen/UHV-OTS-Speech 📦
A data annotation pipeline to generate high-quality, large-scale speech datasets with machine pre-labeling and fully manual auditing.
Language: Forth - Size: 1.41 GB - Last synced at: almost 2 years ago - Pushed at: about 2 years ago - Stars: 92 - Forks: 15

mt-upc/SHAS
SHAS: Approaching optimal Segmentation for End-to-End Speech Translation
Language: Python - Size: 368 KB - Last synced at: about 2 years ago - Pushed at: about 2 years ago - Stars: 24 - Forks: 2

yxlijun/solfege-segmentation
pitch detection,CNN
Language: Python - Size: 16.5 MB - Last synced at: about 2 years ago - Pushed at: over 6 years ago - Stars: 4 - Forks: 0

LIMUNIMI/labelSignal
Automatic annotation of timbre variation for monophonic musical instruments
Language: MATLAB - Size: 764 KB - Last synced at: about 2 years ago - Pushed at: about 3 years ago - Stars: 1 - Forks: 0

ElHaban3ro/AsegTool
AsegTool is a tool designed to generate a segmentation file that is usable within my other tool. 🌵
Language: JavaScript - Size: 944 KB - Last synced at: 27 days ago - Pushed at: over 2 years ago - Stars: 1 - Forks: 0
