GitHub topics: audio-segmentation

Repositories

nianlonggu/WhisperSeg

Code for ICASSP 2024 paper WhisperSeg: Positive Transfer of the Whisper Speech Transformer to Human and Animal Voice Activity Detection

Language: Python - Size: 243 MB - Last synced at: 9 days ago - Pushed at: 9 days ago - Stars: 29 - Forks: 9

BingLingGroup/autosub Fork of iWangJiaxiang/autosub

Command-line utility to transcribe/translate from video/audio/subtitles to subtitles

Language: Python - Size: 1.29 MB - Last synced at: 14 days ago - Pushed at: over 1 year ago - Stars: 1,986 - Forks: 245

amsehili/auditok

An audio/acoustic activity detection and audio segmentation tool

Language: Python - Size: 3.68 MB - Last synced at: 12 days ago - Pushed at: 4 months ago - Stars: 769 - Forks: 95

ina-foss/InaGVAD

Voice activity detection and speaker gender segmentation audiovisual corpus

Language: Jupyter Notebook - Size: 1.4 MB - Last synced at: 22 days ago - Pushed at: 3 months ago - Stars: 10 - Forks: 1

sushant1827/CrewAI-Agents-MinutesOfMeeting-Gmail

MinutesOfMeeting and Gmail is a collaborative crew of AI agents that autonomously understand audio, transcripts, summarizes, writes and drafts an email in Gmail account.

Language: Python - Size: 28.4 MB - Last synced at: 15 days ago - Pushed at: 3 months ago - Stars: 1 - Forks: 1

nuvita97/music-source-separation

Music Source Separation web application using U-Net model with 2 main features: Audio Separation & Karaoke

Language: Jupyter Notebook - Size: 162 MB - Last synced at: 18 days ago - Pushed at: about 1 year ago - Stars: 2 - Forks: 1

boromir674/music-album-creator

Build a digital music library by downloading and segmenting youtube videos.

Language: Python - Size: 12.4 MB - Last synced at: 2 days ago - Pushed at: over 1 year ago - Stars: 4 - Forks: 0

radadiavasu/AudioAnalysis

Whole Audio Analysis Research with Python

Language: Python - Size: 86.1 MB - Last synced at: 8 months ago - Pushed at: 8 months ago - Stars: 0 - Forks: 0

dangrebenkin/wav2vec2_speech_markuper

Automatic generation of speech dataset markup using Wav2Vec2 ASR models

Language: Python - Size: 396 KB - Last synced at: 9 months ago - Pushed at: 9 months ago - Stars: 2 - Forks: 0

mt-upc/SegAugment

SEGAUGMENT: Maximizing the Utility of Speech Translation Data with Segmentation-based Augmentations

Language: Python - Size: 136 KB - Last synced at: 10 months ago - Pushed at: 10 months ago - Stars: 1 - Forks: 0

zqlsnr/speech-music-detection

tensorflow for speech-music-detection task，acc 96%+

Language: Python - Size: 2.49 MB - Last synced at: 10 months ago - Pushed at: 10 months ago - Stars: 1 - Forks: 0

huzaifakhan04/music-recommendation-web-application-based-on-rhythmic-similarity-using-locality-sensitive-hashing

This repository contains a web application that integrates with a music recommendation system, which leverages a dataset of 3,415 audio files, each lasting thirty seconds, utilising a Locality-Sensitive Hashing (LSH) implementation to determine rhythmic similarity, as part of an assignment for the Fundamental of Big Data Analytics (DS2004) course.

Language: Jupyter Notebook - Size: 4.93 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 5 - Forks: 0

luuil/Tools

Our Little Tools

Language: Stylus - Size: 28.8 MB - Last synced at: about 1 year ago - Pushed at: over 3 years ago - Stars: 2 - Forks: 1

0x7o/PyanNet

Training and using audio segmentation

Size: 1.95 KB - Last synced at: about 1 month ago - Pushed at: about 2 years ago - Stars: 0 - Forks: 1

dangrebenkin/speech_audio_separator

A useful tool to split speech WAV PCM files to fragments with use of energy signal minimums (speech pauses).

Language: Python - Size: 444 KB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 2 - Forks: 0

dangvansam/pyannote-onnx Fork of pyannote/pyannote-audio

PyAnnote with ONNX model

Language: Jupyter Notebook - Size: 273 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

Metiu-Metiu/Neural-Texture-Sound-synthesis---data-sets

Synthetic sounds datasets and real sounds datasets of waterflow sounds for the repo 'Neural-Texture-Sound-Synthesis-with-physically-driven-continuous-controls'.

Size: 1.5 GB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 3 - Forks: 0