An open API service providing repository metadata for many open source software ecosystems.

GitHub topics: audio-segmentation

nianlonggu/WhisperSeg

Code for ICASSP 2024 paper WhisperSeg: Positive Transfer of the Whisper Speech Transformer to Human and Animal Voice Activity Detection

Language: Python - Size: 243 MB - Last synced at: 9 days ago - Pushed at: 9 days ago - Stars: 29 - Forks: 9

BingLingGroup/autosub Fork of iWangJiaxiang/autosub

Command-line utility to transcribe/translate from video/audio/subtitles to subtitles

Language: Python - Size: 1.29 MB - Last synced at: 14 days ago - Pushed at: over 1 year ago - Stars: 1,986 - Forks: 245

amsehili/auditok

An audio/acoustic activity detection and audio segmentation tool

Language: Python - Size: 3.68 MB - Last synced at: 12 days ago - Pushed at: 4 months ago - Stars: 769 - Forks: 95

ina-foss/InaGVAD

Voice activity detection and speaker gender segmentation audiovisual corpus

Language: Jupyter Notebook - Size: 1.4 MB - Last synced at: 22 days ago - Pushed at: 3 months ago - Stars: 10 - Forks: 1

sushant1827/CrewAI-Agents-MinutesOfMeeting-Gmail

MinutesOfMeeting and Gmail is a collaborative crew of AI agents that autonomously understand audio, transcripts, summarizes, writes and drafts an email in Gmail account.

Language: Python - Size: 28.4 MB - Last synced at: 15 days ago - Pushed at: 3 months ago - Stars: 1 - Forks: 1

nuvita97/music-source-separation

Music Source Separation web application using U-Net model with 2 main features: Audio Separation & Karaoke

Language: Jupyter Notebook - Size: 162 MB - Last synced at: 18 days ago - Pushed at: about 1 year ago - Stars: 2 - Forks: 1

boromir674/music-album-creator

Build a digital music library by downloading and segmenting youtube videos.

Language: Python - Size: 12.4 MB - Last synced at: 2 days ago - Pushed at: over 1 year ago - Stars: 4 - Forks: 0

radadiavasu/AudioAnalysis

Whole Audio Analysis Research with Python

Language: Python - Size: 86.1 MB - Last synced at: 8 months ago - Pushed at: 8 months ago - Stars: 0 - Forks: 0

dangrebenkin/wav2vec2_speech_markuper

Automatic generation of speech dataset markup using Wav2Vec2 ASR models

Language: Python - Size: 396 KB - Last synced at: 9 months ago - Pushed at: 9 months ago - Stars: 2 - Forks: 0

mt-upc/SegAugment

SEGAUGMENT: Maximizing the Utility of Speech Translation Data with Segmentation-based Augmentations

Language: Python - Size: 136 KB - Last synced at: 10 months ago - Pushed at: 10 months ago - Stars: 1 - Forks: 0

zqlsnr/speech-music-detection

tensorflow for speech-music-detection task,acc 96%+

Language: Python - Size: 2.49 MB - Last synced at: 10 months ago - Pushed at: 10 months ago - Stars: 1 - Forks: 0

huzaifakhan04/music-recommendation-web-application-based-on-rhythmic-similarity-using-locality-sensitive-hashing

This repository contains a web application that integrates with a music recommendation system, which leverages a dataset of 3,415 audio files, each lasting thirty seconds, utilising a Locality-Sensitive Hashing (LSH) implementation to determine rhythmic similarity, as part of an assignment for the Fundamental of Big Data Analytics (DS2004) course.

Language: Jupyter Notebook - Size: 4.93 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 5 - Forks: 0

luuil/Tools

Our Little Tools

Language: Stylus - Size: 28.8 MB - Last synced at: about 1 year ago - Pushed at: over 3 years ago - Stars: 2 - Forks: 1

0x7o/PyanNet

Training and using audio segmentation

Size: 1.95 KB - Last synced at: about 1 month ago - Pushed at: about 2 years ago - Stars: 0 - Forks: 1

dangrebenkin/speech_audio_separator

A useful tool to split speech WAV PCM files to fragments with use of energy signal minimums (speech pauses).

Language: Python - Size: 444 KB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 2 - Forks: 0

dangvansam/pyannote-onnx Fork of pyannote/pyannote-audio

PyAnnote with ONNX model

Language: Jupyter Notebook - Size: 273 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

Metiu-Metiu/Neural-Texture-Sound-synthesis---data-sets

Synthetic sounds datasets and real sounds datasets of waterflow sounds for the repo 'Neural-Texture-Sound-Synthesis-with-physically-driven-continuous-controls'.

Size: 1.5 GB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 3 - Forks: 0

Appen/UHV-OTS-Speech 📦

A data annotation pipeline to generate high-quality, large-scale speech datasets with machine pre-labeling and fully manual auditing.

Language: Forth - Size: 1.41 GB - Last synced at: almost 2 years ago - Pushed at: about 2 years ago - Stars: 92 - Forks: 15

mt-upc/SHAS

SHAS: Approaching optimal Segmentation for End-to-End Speech Translation

Language: Python - Size: 368 KB - Last synced at: about 2 years ago - Pushed at: about 2 years ago - Stars: 24 - Forks: 2

yxlijun/solfege-segmentation

pitch detection,CNN

Language: Python - Size: 16.5 MB - Last synced at: about 2 years ago - Pushed at: over 6 years ago - Stars: 4 - Forks: 0

LIMUNIMI/labelSignal

Automatic annotation of timbre variation for monophonic musical instruments

Language: MATLAB - Size: 764 KB - Last synced at: about 2 years ago - Pushed at: about 3 years ago - Stars: 1 - Forks: 0

ElHaban3ro/AsegTool

AsegTool is a tool designed to generate a segmentation file that is usable within my other tool. 🌵

Language: JavaScript - Size: 944 KB - Last synced at: 27 days ago - Pushed at: over 2 years ago - Stars: 1 - Forks: 0

Related Keywords
audio-segmentation 22 audio-processing 4 voice-activity-detection 4 python 2 speech-activity-detection 2 music 2 vad 2 audio-classification 2 speech-translation 2 data-augmentation 2 speech-recognition 2 speech-to-text 2 wav2vec2 2 tensorflow 1 svg2png 1 tensorflow-serving 1 savedmodel 1 locust 1 grpc 1 dockerfile 1 docker 1 web-application 1 spotify 1 music-recommendation-system 1 music-recommendation 1 music-information-retrieval 1 machine-learning 1 lsh 1 locality-sensitive-hashing 1 flask-application 1 data-science 1 cosine-distance 1 big-data 1 audio-recommendation 1 approximate-nearest-neighbors 1 ann 1 speech-music-detection 1 speech-processing 1 speech-seperation 1 speech-transcription 1 synthetic-speech-detection 1 topic-detection 1 speech 1 cnn 1 f0-detection 1 solfege-segmentation 1 audio 1 audio-analysis 1 signal-processing 1 sound-and-music-computing 1 timbre 1 video-processing 1 video-segmentation 1 audio-split 1 audio-splitter 1 onnx 1 pyannote 1 speech-separation 1 voice-ac 1 audio-dataset-for-machine-learning 1 audio-datasets 1 real-dataset 1 synthetic-dataset 1 synthetic-dataset-generation 1 accent-detection 1 gender-classification 1 speaker-diarization 1 speaker-identification 1 speech-annotation 1 tv 1 speech-dataset 1 speech-corpus 1 speaker-gender 1 radio 1 gender-representation 1 gender-prediction 1 gender-bias 1 gender 1 dataset 1 corpus 1 benchmark 1 audiovisual-dataset 1 audio-dataset 1 acoustic-diversity 1 voice-detection 1 audio-data 1 audio-activities 1 xunfei-api 1 xfyun 1 subtitles 1 substation-alpha 1 cloud-speech-api 1 baidu-api 1 whisperseg 1 whisper 1 transformer 1 icassp2024 1 animal-sound-detection 1 forced-alignment 1 pyaudio-processing 1