GitHub topics: sound-processing
software-mansion/react-native-audio-api
High-performance audio engine
Language: C++ - Size: 134 MB - Last synced at: 2 days ago - Pushed at: 2 days ago - Stars: 564 - Forks: 33
Ztry8/SoundTool
Simple and fast script to normalize SFX and music for games
Language: Python - Size: 14.6 KB - Last synced at: 3 days ago - Pushed at: 3 days ago - Stars: 0 - Forks: 0
RaphiMC/AudioMixer
High performance Java audio library
Language: Java - Size: 654 KB - Last synced at: 5 days ago - Pushed at: 5 days ago - Stars: 3 - Forks: 0
rohankanhaisingh/FluexGL-DSP
An open-source, web-based DSP library developed alongside FluexGL, designed for creating and manipulating sounds in various contexts.
Language: TypeScript - Size: 47.9 KB - Last synced at: 5 days ago - Pushed at: 5 days ago - Stars: 2 - Forks: 0
Severynson/chord-detector
Application detecting guitar Chords in real time powered by Deep Learning.
Language: Jupyter Notebook - Size: 108 MB - Last synced at: 6 days ago - Pushed at: 6 days ago - Stars: 0 - Forks: 0
matiaszanolli/blastbeats 📦
Media player written in pure Python with sound remastering on the go.
Language: Python - Size: 6.4 MB - Last synced at: 8 days ago - Pushed at: 8 days ago - Stars: 9 - Forks: 0
bejranonda/Code-of-the-Sea--performance-control-panel
Interactive Multi-Device Art Installation (2025) 🌊 Turn a Raspberry Pi into an Art Installation Orchestra: Control lights that react to sound, FM radio, live audio mixing & environmental sensors - all from one web dashboard. Battle-tested in gallery exhibitions across 2 continents.
Language: Python - Size: 1.08 MB - Last synced at: 13 days ago - Pushed at: 13 days ago - Stars: 0 - Forks: 0
EtienneAb3d/karaok-AI
Karaoke Player / Editor with automatic clip creation from any song file using vocals and lyrics extraction (Speech-to-Text)
Language: Java - Size: 23.4 MB - Last synced at: 6 days ago - Pushed at: almost 2 years ago - Stars: 79 - Forks: 3
iver56/torch-audiomentations
Fast audio data augmentation in PyTorch. Inspired by audiomentations. Useful for deep learning.
Language: Python - Size: 2.28 MB - Last synced at: 13 days ago - Pushed at: 10 months ago - Stars: 1,095 - Forks: 96
Tal0na/Equalizer-Profiles
Custom Equalizer Profiles for Enhanced Audio Quality
Size: 179 KB - Last synced at: 8 days ago - Pushed at: over 1 year ago - Stars: 8 - Forks: 1
bejranonda/Experiment--Heating-DJ
Music-Performance Art (2023) 🔥 What if your body heat could DJ the party? Interactive thermal sensing system that reads temperature changes and movement—then automatically controls music, scratches, tempo. IR camera becomes instrument.
Language: C++ - Size: 141 KB - Last synced at: 28 days ago - Pushed at: 28 days ago - Stars: 0 - Forks: 0
thedocruby/resounding Fork of vlad2305m/Sound-Physics-Fabric
A New Minecraft mod that provides realistic audio physics using parallel wave tracing and an improved physics algorithm.
Language: Java - Size: 2.34 MB - Last synced at: 26 days ago - Pushed at: over 1 year ago - Stars: 85 - Forks: 4
xasopheno/weresocool
A language for composing microtonal music built in Rust. Make cool sounds. Impress your friends/pets/plants.
Language: Rust - Size: 606 MB - Last synced at: 4 days ago - Pushed at: 28 days ago - Stars: 62 - Forks: 5
MTG/essentia
C++ library for audio and music analysis, description and synthesis, including Python bindings
Language: C++ - Size: 299 MB - Last synced at: 30 days ago - Pushed at: 3 months ago - Stars: 3,221 - Forks: 576
miltiadiss/CEID-MSc-Thesis
My Integrated Master of Science Thesis for CEID, University of Patras with Topic: "Supervised domain adaptation techniques for the classification of abnormal respiratory sounds".
Language: Jupyter Notebook - Size: 198 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 0 - Forks: 0
iver56/audiomentations
A Python library for audio data augmentation. Useful for making audio ML models work well in the real world, not just in the lab.
Language: Python - Size: 11.1 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 2,141 - Forks: 204
datarootsio/fresh-coffee-listener
Using a raspberry pi, we listen to the coffee machine and count the number of coffee consumption
Language: Python - Size: 1.4 MB - Last synced at: about 1 month ago - Pushed at: about 4 years ago - Stars: 56 - Forks: 2
valeriorlandini/sonus
A collection of various Max/MSP objects for creative patching
Language: C++ - Size: 24.3 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 9 - Forks: 1
berndporr/sound_weighting_filters Fork of endolith/waveform-analysis
IIR filter coeficienets for sound weighting filters: A,B,C and ITU_R_468
Language: Python - Size: 387 KB - Last synced at: about 1 month ago - Pushed at: about 3 years ago - Stars: 4 - Forks: 0
Rishithakallurii/Plants-acoustic-sound-classification
A non-invasive automated system for real-time monitoring of plant health by categorizing plant sounds into stress and damage classes.
Language: Jupyter Notebook - Size: 12.3 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 0 - Forks: 0
satelllte/adsr
Simple synthesizer built with Elementary Audio.
Language: TypeScript - Size: 602 KB - Last synced at: 2 days ago - Pushed at: over 1 year ago - Stars: 25 - Forks: 2
velipso/sndfilter
Algorithms for sound filters, like reverb, dynamic range compression, lowpass, highpass, notch, etc
Language: C - Size: 74.2 KB - Last synced at: about 1 month ago - Pushed at: over 1 year ago - Stars: 473 - Forks: 73
DanielPXL/nitro-fs
NDS Filesystem reading and parsing library
Language: TypeScript - Size: 139 KB - Last synced at: 20 days ago - Pushed at: over 1 year ago - Stars: 11 - Forks: 1
lockepatton/sonipy
Sonification tool for turning scatter plots into perceptually uniform sound files for science and science access.
Language: Python - Size: 10.9 MB - Last synced at: about 1 month ago - Pushed at: about 2 years ago - Stars: 35 - Forks: 7
amp1ee/deranger
Echoic effect well-suited for sound design
Language: C++ - Size: 323 KB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 1 - Forks: 0
LeviBorodenko/spectrographic
Turn an image into sound whose spectrogram looks like the image.
Language: Python - Size: 5.5 MB - Last synced at: about 1 month ago - Pushed at: almost 3 years ago - Stars: 286 - Forks: 27
sandner-art/VST-Eigensound-Lite
Eigensound Open-Source Test Lab App: Quantum Physics and Music Research
Language: HTML - Size: 10 MB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 1 - Forks: 0
ExaggeratedRumors/demooder
Emotions recognizing mobile application and model.
Language: Jupyter Notebook - Size: 178 MB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 2 - Forks: 0
giaourtaki/Digital-Sound-Processing-Project-MIR
[2025][Python][Machine Learning][Sound Processing] Through this project I have gained experience on tools and methodologies on deriving descriptors of audio signals, and fundamental machine learning (ML) classification algorithms.
Language: Python - Size: 25.4 MB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 0 - Forks: 0
KentoNishi/torch-pitch-shift
Pitch-shift audio clips quickly with PyTorch (CUDA supported)! Additional utilities for searching efficient transformations are included.
Language: Python - Size: 51.8 MB - Last synced at: 26 days ago - Pushed at: about 1 year ago - Stars: 137 - Forks: 12
SebaSOFT/walls-have-ears
A simple-as-possible FoundryVTT module to muffle sounds that are behind a wall for a player.
Language: TypeScript - Size: 1.2 MB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 8 - Forks: 5
InboraStudio/WaveBlender-Sound-Transformer-Compute-Unity3D
Synthesizing sound sources for modern physics-based animation on GPU Unity3D
Language: C# - Size: 23.3 MB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 0 - Forks: 0
Veryzon/qwadro Fork of sigmaco/qwadro
The Qwadro Execution Ecosystem
Language: C - Size: 203 MB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 0 - Forks: 0
sigmaco/qwadro
The Qwadro Execution Ecosystem
Language: C - Size: 165 MB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 0 - Forks: 1
Veryzon/afx Fork of sigmaco/afx
The Standard Qwadro Implementation
Language: C - Size: 477 MB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 0 - Forks: 0
Yujia-Yan/Transkun
A simple yet effective Audio-to-Midi Automatic Piano Transcription system
Language: Python - Size: 81.7 MB - Last synced at: about 2 months ago - Pushed at: 11 months ago - Stars: 230 - Forks: 22
ObsessiveCompulsiveAudiophile/A1EvoAcoustica
Sound Optimization Tool for Denon & Marantz AV Receivers
Language: HTML - Size: 19.6 MB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 148 - Forks: 25
Abish-27/Ghostnote
An app that isolates or extracts chosen instruments/vocals from any song and allows the user to play and download the the generated track. Created in python/flask environment using spleeter for audio analysis.
Language: Python - Size: 9.77 KB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 1 - Forks: 1
cidadecentral/LZF-Music
🎵 Build a sleek music player in Flutter, supporting various audio formats and offering features like smart lyric matching and local cloud integration.
Language: Dart - Size: 28.1 MB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 0 - Forks: 0
michaelkolesidis/javascript-software-synthesizer
JSS-01 | JavaScript Software Synthesizer
Language: TypeScript - Size: 10.8 MB - Last synced at: about 2 months ago - Pushed at: 5 months ago - Stars: 141 - Forks: 11
Squishy47/OpenVerb
Algorithmic reverb VST using FDN's
Language: C++ - Size: 31.3 KB - Last synced at: 28 days ago - Pushed at: about 7 years ago - Stars: 43 - Forks: 2
crlandsc/torch-log-wmse
logWMSE, an audio quality metric & loss function with support for digital silence target. Useful for training and evaluating audio source separation systems.
Language: Python - Size: 408 KB - Last synced at: 2 months ago - Pushed at: 6 months ago - Stars: 41 - Forks: 1
Jodus-Melodus/rustique
Audio note recognizer
Language: Rust - Size: 399 MB - Last synced at: about 2 months ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0
zak-45/SCAnalyzer-Chataigne-Module
Song analyzer, timecoded sequence creation. Actions/ Triggers execution based on segmenter/rhythm difference. deeply integrated with WLED and LedFX.
Language: JavaScript - Size: 436 KB - Last synced at: 2 months ago - Pushed at: 6 months ago - Stars: 15 - Forks: 1
josafatburmeister/BirdSongIdentification
Fully automated machine learning pipeline for bird sound recognition
Language: Jupyter Notebook - Size: 3.77 MB - Last synced at: about 2 months ago - Pushed at: over 3 years ago - Stars: 26 - Forks: 5
csd4ni3l/music-player
A simple and fast music player in Arcade.
Language: Python - Size: 293 KB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 1 - Forks: 0
cloud-py-api/visionatrix
Scalable AI Provider for Nextcloud
Language: Python - Size: 3.79 MB - Last synced at: 9 days ago - Pushed at: 5 months ago - Stars: 10 - Forks: 4
filippov112/sound-to-text
A local Windows application for speech recognition and text input simulation
Language: Python - Size: 22.9 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0
melinteflxrin/ML-Sound-Gestures
Control Spotify to skip or pause songs with sound gestures (like double claps) using machine learning, ensemble models and real-time audio detection.
Language: Python - Size: 155 KB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0
SergeyChelak/harmonicity-plugin
Oscillator-based VST MIDI plugin
Language: Rust - Size: 40 KB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 0 - Forks: 0
hirve/old-radio-sound
The server to emulate old warm AM radio sound on Raspberry Pi
Language: C - Size: 4.34 MB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 32 - Forks: 11
landscape82/awesome-sound-design-resources
Awesome Sound Design resources
Size: 72.3 KB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 0 - Forks: 0
hisguitar/willow
Side-scrolling RPG game (My first project)
Language: ShaderLab - Size: 69.1 MB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 1 - Forks: 0
jbaudru/SoundStat-soft
SoundStat is a free and open-source desktop application for comprehensive audio analysis and transformation. Built with Electron, it provides audio statistics, waveform visualization, and format conversion tools in an intuitive interface.
Language: HTML - Size: 652 MB - Last synced at: 3 months ago - Pushed at: 4 months ago - Stars: 1 - Forks: 0
miche27/Trigger-Rhythm
Download and manage your audio presets easily with Trigger Rhythm. Get started today and enhance your music production! 🎶✨
Size: 862 KB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 0 - Forks: 0
aureluxx/music_remixer
Remix songs with audio effects and presets.
Language: Python - Size: 25.4 KB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 0 - Forks: 0
Wb-az/pyspark-mlib-soundlevel-prediction
Creates a ML Pipeline leveraging PySpark SQL and PySpark MLib to predict sound level
Language: Jupyter Notebook - Size: 1.09 MB - Last synced at: 3 months ago - Pushed at: 4 months ago - Stars: 0 - Forks: 0
KentoNishi/torch-time-stretch
Time-stretch audio clips quickly with PyTorch (CUDA supported)! Additional utilities for searching efficient transformations are included.
Language: Python - Size: 7.16 MB - Last synced at: about 2 months ago - Pushed at: about 3 years ago - Stars: 40 - Forks: 3
rabeaifeanyi/acoustic-camera
This project uses Acoular to implement an acoustic camera for the miniDSP UMA-16 microphone array, with optional integration of transformer model results for enhanced audio analysis.
Language: Python - Size: 14.7 MB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 3 - Forks: 0
lpestov/interactive_ai_model_builder
Smart builder for interaction with AI and ML
Language: Jupyter Notebook - Size: 207 MB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 24 - Forks: 1
alec-shalashou/pywavchopper
Chops large wav files into chunks based on loudness level drops (silence gets cut out). Performs good on large files. Might be useful for chopping band rehearsal into songs or interviews into chunks.
Language: Python - Size: 8.79 KB - Last synced at: 5 months ago - Pushed at: over 4 years ago - Stars: 1 - Forks: 0
michaelkolesidis/javascript-software-synthesizer-classic
JSS-01C | JavaScript Software Synthesizer Classic | The original version of the JavaScript Software Synthesizer before the ongoing refactoring/redesign.
Language: TypeScript - Size: 5.78 MB - Last synced at: about 2 months ago - Pushed at: 7 months ago - Stars: 9 - Forks: 0
henocn/media-refiner
Package python pour améliorer et personnaliser la qualité des médias notamment les sons les images et surtout les vidéos.
Language: Python - Size: 0 Bytes - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 1 - Forks: 0
wattai/sound-source-position-estimation
This scripts estimate Sound Source Position based on Cross-power Spectrum Phase (CSP) or Multiple Signal Classification (MUSIC).
Language: Python - Size: 39.1 KB - Last synced at: 2 months ago - Pushed at: 11 months ago - Stars: 7 - Forks: 0
RhythrosaLabs/soundstorm
Soundstorm is a cutting-edge AI-powered audio manipulation application designed to provide a rich yet simplified experience for sound designers, algorithmic composers, and experimental audio enthusiasts. From sample pack creation and algorithmic composition to AI text-to-audio and onscreen ChatGPT, Soundstorm is a sonic powerhouse.
Language: Python - Size: 3.39 MB - Last synced at: 3 months ago - Pushed at: over 1 year ago - Stars: 31 - Forks: 7
EtienneAb3d/WhisperHallu
Experimental code: sound file preprocessing to optimize Whisper transcriptions without hallucinated texts
Language: Python - Size: 12.1 MB - Last synced at: 5 months ago - Pushed at: 12 months ago - Stars: 326 - Forks: 25
Kermalis/VGMusicStudio
🎵 A program that lets you listen to the music from popular video game formats. 🎵
Language: C# - Size: 4.59 MB - Last synced at: 5 months ago - Pushed at: over 1 year ago - Stars: 276 - Forks: 33
patrick-mns/AudioSS
Audio§ is an open-source tool written in C for efficient silence detection in .wav audio files, optimized for speed and accuracy. Only 16-bit PCM is supported at the moment.
Language: C - Size: 163 KB - Last synced at: 6 months ago - Pushed at: 6 months ago - Stars: 0 - Forks: 0
cmescobar/Lung_heart_source_separation
Repository containing the codes used for the development of a source separation system for cardiorespiratory sounds using Non-negative Matrix Factorisation (NMF). The aim is to obtain a pure respiratory sound and a pure cardiac sound using this algorithm.
Language: Python - Size: 22.7 MB - Last synced at: 6 months ago - Pushed at: 6 months ago - Stars: 8 - Forks: 2
WilliamVenner/ffaudio2json
Convert audio files to JSON waveforms using FFmpeg
Language: Rust - Size: 51 MB - Last synced at: 25 days ago - Pushed at: about 1 year ago - Stars: 4 - Forks: 2
RecursiveVoid/pixeltonejs
PixelToneJS is a JavaScript library that converts images into sound based on their RGB data. By interpreting the colors of the image, the library generates corresponding frequencies to create an audio representation of the image.
Language: TypeScript - Size: 119 KB - Last synced at: 27 days ago - Pushed at: 11 months ago - Stars: 2 - Forks: 0
Fireboltz/Psychic-CCTV
A video analysis tool built completely in python.
Language: Python - Size: 123 MB - Last synced at: 6 months ago - Pushed at: 6 months ago - Stars: 59 - Forks: 13
mika314/melonix
[WIP] Pitch correction application written using ImGui and OpenGL 3
Language: C++ - Size: 222 KB - Last synced at: 6 months ago - Pushed at: 6 months ago - Stars: 59 - Forks: 4
dnbsammie/SoundMorph
This repository belongs to "SoundMorph", a VST Plugin made from C++ and JUCE
Language: C++ - Size: 784 KB - Last synced at: 6 months ago - Pushed at: 6 months ago - Stars: 0 - Forks: 0
mborik/SAA1099Tracker
SAA1099Tracker is chiptune music tracker for Philips SAA 1099 soundchip
Language: TypeScript - Size: 5.1 MB - Last synced at: 3 months ago - Pushed at: 6 months ago - Stars: 37 - Forks: 3
Squishy47/Circular-Buffer
A Circular Buffer with Cubic and Linear interpolation
Language: C++ - Size: 17.6 KB - Last synced at: 4 months ago - Pushed at: over 6 years ago - Stars: 9 - Forks: 1
ddiakopoulos/libnyquist
:microphone: Cross platform C++11 library for decoding audio (mp3, wav, ogg, opus, flac, etc)
Language: C++ - Size: 32.9 MB - Last synced at: 5 months ago - Pushed at: over 2 years ago - Stars: 547 - Forks: 68
birdnet-team/BirdNET-V1 📦
Soundscape analysis with BirdNET.
Language: Python - Size: 65.7 MB - Last synced at: about 2 months ago - Pushed at: 8 months ago - Stars: 327 - Forks: 45
giovannibedetti/bach_stress_test
a p5.js experiment with Web Audio API
Language: JavaScript - Size: 5.43 MB - Last synced at: 5 months ago - Pushed at: over 8 years ago - Stars: 0 - Forks: 0
abdozmantar/deepextract
DeepExtract Vocal and Sound Separator
Language: Python - Size: 77.7 MB - Last synced at: 6 months ago - Pushed at: 11 months ago - Stars: 3 - Forks: 0
SuperKogito/pydiogment
:mega: Python library for audio augmentation
Language: Python - Size: 88.7 MB - Last synced at: about 1 month ago - Pushed at: over 2 years ago - Stars: 84 - Forks: 16
alsa-project/tinycompress
The Advanced Linux Sound Architecture (ALSA) - tinycompress
Language: C - Size: 128 KB - Last synced at: 23 days ago - Pushed at: 12 months ago - Stars: 23 - Forks: 25
PetterS/reverb
Calculates the room RT60 reverberation time by sending out tones
Language: Python - Size: 147 KB - Last synced at: about 2 months ago - Pushed at: almost 4 years ago - Stars: 4 - Forks: 1
saadsalmanakram/Essence-of-Sound
Understanding sound with respect to AI and Data
Size: 60.5 KB - Last synced at: 7 months ago - Pushed at: 7 months ago - Stars: 2 - Forks: 0
HwlloChen/OfficeGuardian
让办公室不再受到噪音污染
Language: Python - Size: 29.3 KB - Last synced at: 7 months ago - Pushed at: 7 months ago - Stars: 1 - Forks: 0
alexadam/img-encode
Encode an image to sound and view it as a spectrogram - turn your images into music
Language: JavaScript - Size: 5.48 MB - Last synced at: 7 months ago - Pushed at: about 5 years ago - Stars: 280 - Forks: 46
rednafi/urban-sound-classification
Urban sound source tagging from an aggregation of four second noisy audio clips via 1D and 2D CNN (Xception)
Language: Jupyter Notebook - Size: 36.9 MB - Last synced at: about 2 months ago - Pushed at: over 2 years ago - Stars: 60 - Forks: 15
mahshid1378/DeepFilterNet2
The Section model sound and run code then you can use for reposeity coding.
Language: Python - Size: 9.46 MB - Last synced at: 7 months ago - Pushed at: 7 months ago - Stars: 0 - Forks: 0
NonBreathableAir/audiblez
Generate audiobooks from e-books
Size: 1000 Bytes - Last synced at: 8 months ago - Pushed at: 8 months ago - Stars: 0 - Forks: 0
sdima1357/stm32f401ccAudioNative
stm32 black pill usb sound card
Language: C - Size: 2.22 MB - Last synced at: 7 months ago - Pushed at: almost 2 years ago - Stars: 55 - Forks: 10
zilliz-bootcamp/audio_search
This project use PANNs for audio tagging and sound event detection, and finally get audio embeddings. Then Milvus is used to search the similarity audio items.
Language: Python - Size: 7.17 MB - Last synced at: 5 months ago - Pushed at: about 4 years ago - Stars: 24 - Forks: 6
Vort/SoundTransceiver
Program for transmitting text messages using sound waves
Language: C# - Size: 343 KB - Last synced at: 4 days ago - Pushed at: over 3 years ago - Stars: 3 - Forks: 1
justin-marian/gabor-bank-filters
Feature extraction, visualizations, and a KNN classifier for sound categorization based on Gabor filters.
Language: Python - Size: 698 KB - Last synced at: 7 months ago - Pushed at: 12 months ago - Stars: 1 - Forks: 0
adrenak/trill
Send data using sound
Language: C# - Size: 32.2 KB - Last synced at: 6 months ago - Pushed at: over 6 years ago - Stars: 7 - Forks: 0
katahiromz/cmd_play
PC88 CMD PLAY emulation in C++/Win32
Language: C++ - Size: 2.3 MB - Last synced at: 13 days ago - Pushed at: 8 months ago - Stars: 1 - Forks: 0
malifalhakim/prompt-based-tts-indo
Prompt-based Text-to-Speech system using Parler TTS, designed for generating natural-sounding speech in Indonesian. Includes dataset preparation, model training, inference pipeline, and performance evaluation.
Language: Jupyter Notebook - Size: 534 KB - Last synced at: 8 months ago - Pushed at: 8 months ago - Stars: 1 - Forks: 0
indiscipline/ffmpeg-loudnorm-helper
Command line helper for performing linear audio loudness normalization using ffmpeg's loudnorm audio filter.
Language: Rust - Size: 32.2 KB - Last synced at: 7 months ago - Pushed at: over 1 year ago - Stars: 19 - Forks: 1
datastronaut/lewagon-deepdive-front
Front-end repo for lewagon-deepdive project
Language: Jupyter Notebook - Size: 14.3 MB - Last synced at: 8 months ago - Pushed at: 8 months ago - Stars: 2 - Forks: 0
michaeldzjap/MDUGens
A collection of various custom SuperCollider (pseudo) UGens
Language: C++ - Size: 64.5 KB - Last synced at: 7 months ago - Pushed at: 9 months ago - Stars: 5 - Forks: 2
mont29/ubat
Python script conforming audio data for the French Vigie-Chiro bats survey project, working on Linux, using ffmpeg.
Language: Python - Size: 16.6 KB - Last synced at: 9 months ago - Pushed at: 9 months ago - Stars: 0 - Forks: 0