GitHub topics: audio-processing
stingalleman/awesome-audiovisual
Curated list of audiovisual projects
Size: 227 KB - Last synced at: 2 days ago - Pushed at: 12 months ago - Stars: 203 - Forks: 16

nickmura/AudioPlugin1
Language: C++ - Size: 19.5 KB - Last synced at: 6 days ago - Pushed at: 9 days ago - Stars: 0 - Forks: 0

VitorDiToro/YouTube-Audio-Splitter
CLI tool to download YouTube audio and split it into tracks based on timestamps
Language: Python - Size: 7.81 KB - Last synced at: 9 days ago - Pushed at: 9 days ago - Stars: 0 - Forks: 0

carlosedp/morphaweb Fork of Ericxgao/morphaweb-self
Make Noise Morphagene Audio Editor
Language: JavaScript - Size: 8.48 MB - Last synced at: 9 days ago - Pushed at: 9 days ago - Stars: 0 - Forks: 0

dlahmad/sync-nudger
Sync-Nudger is a command-line utility designed for precise audio stream manipulation within video files. It allows you to split an audio track at specific timestamps, apply individual delays to each new segment, and then seamlessly remux the modified audio back into the original container.
Language: Rust - Size: 54.7 KB - Last synced at: 9 days ago - Pushed at: 9 days ago - Stars: 0 - Forks: 0

axeldelafosse/stemgen
🎛 Stemgen is a Stem file generator. Convert any track into a Stem and have fun with Traktor.
Language: Python - Size: 71.1 MB - Last synced at: 2 days ago - Pushed at: 4 months ago - Stars: 238 - Forks: 43

bocaletto-luca/Chorus-Audio-Effect
"Chorus" is a Python application developed by Luca Bocaletto that allows you to create the audio effect known as Chorus. This application provides users with the ability to adjust various parameters of the Chorus effect to creatively modify audio. The Chorus effect is widely used in audio production to add depth and spatiality to sound...
Language: Python - Size: 24.4 KB - Last synced at: 5 days ago - Pushed at: about 2 months ago - Stars: 5 - Forks: 0

mbari-org/pbp
Process ocean audio data archives to daily analysis products of hybrid millidecade spectra using PyPAM.
Language: Python - Size: 3.45 MB - Last synced at: 8 days ago - Pushed at: 9 days ago - Stars: 15 - Forks: 6

julius-speech/julius
Open-Source Large Vocabulary Continuous Speech Recognition Engine
Language: C - Size: 10.3 MB - Last synced at: 8 days ago - Pushed at: about 1 year ago - Stars: 1,891 - Forks: 305

spotify/klio
Smarter data pipelines for audio.
Language: Python - Size: 73.7 MB - Last synced at: 5 days ago - Pushed at: over 1 year ago - Stars: 853 - Forks: 50

ybayle/awesome-deep-learning-music
List of articles related to deep learning applied to music
Language: TeX - Size: 5.87 MB - Last synced at: 8 days ago - Pushed at: over 1 year ago - Stars: 2,889 - Forks: 340

jacob7choi-xyz/harmonyrestorer-v1
🎵 World-class AI audio restoration platform with 1D Operational GANs - Real-time processing, professional noise reduction, and modern React UI
Language: Python - Size: 71.3 KB - Last synced at: 9 days ago - Pushed at: 9 days ago - Stars: 0 - Forks: 0

Ivan-Ayub97/MetroMuse-PyAudioEditor
A modern audio editor with multitrack capabilities, enhanced waveform visualization, and an intuitive, sleek interface.
Language: Python - Size: 5.63 MB - Last synced at: 9 days ago - Pushed at: 10 days ago - Stars: 6 - Forks: 3

LedFx/LedFx
LedFx is a network based LED effect engine designed to deliver advanced real-time audio effects to a wide variety of devices.
Language: Python - Size: 291 MB - Last synced at: 9 days ago - Pushed at: 9 days ago - Stars: 1,559 - Forks: 184

alexisvassquez/ai_spotibot_player
AudioMIX is an open-source, AI-driven music production tool designed to empower indie artists with mood-based audio analysis, LED integration, and creative autonomy. Spotibot will be its first test use case.
Language: Python - Size: 1.94 MB - Last synced at: 10 days ago - Pushed at: 10 days ago - Stars: 0 - Forks: 0

obinexus/retrosaga-poc
RetroSaga V1 Trial: MIDI synthesis POC with dynamic cost-function optimization. Demonstrates 8-bit audio processing, real-time MIDI handling, and NexusLink integration for future game engine development.
Language: C - Size: 251 KB - Last synced at: 10 days ago - Pushed at: 10 days ago - Stars: 1 - Forks: 0

Jaded-Encoding-Thaumaturgy/muxtools
Automation package for everything related to encoding and subbing
Language: Python - Size: 2.95 MB - Last synced at: 10 days ago - Pushed at: 10 days ago - Stars: 21 - Forks: 11

justfollowyourdreams/StMFC
StMFC — Stereo to Mono Fast Converter.
Language: C++ - Size: 2.11 MB - Last synced at: 10 days ago - Pushed at: 10 days ago - Stars: 3 - Forks: 0

Ayush-2404/AUDIORA
🎧 A full-stack Shazam-like music recognition app that identifies songs from short audio clips using FFT-based fingerprinting and real-time matching.
Language: TypeScript - Size: 169 MB - Last synced at: 8 days ago - Pushed at: 16 days ago - Stars: 1 - Forks: 1

RaveGeneration/Sonic-Sweep-2
VST Presets for Sonic Sweep 2 (SonicSweep2-Presets.zip) + User Manual
Size: 8.68 MB - Last synced at: 10 days ago - Pushed at: 10 days ago - Stars: 0 - Forks: 0

NonSuperma/RaccoonUtilities
Collection of audio and video editing and downloading scripts.
Language: Python - Size: 261 KB - Last synced at: 10 days ago - Pushed at: 10 days ago - Stars: 0 - Forks: 0

kubakorniluk/audio-visualizer
Audio player/visualizer for my music projects. [Work in progress]
Language: Vue - Size: 105 MB - Last synced at: 10 days ago - Pushed at: 10 days ago - Stars: 0 - Forks: 0

Pudochu/audio-waveform
Waveform Video Generator: Convert audio files to customized waveform videos with captions using Gradio, FFmpeg, and ImageMagick.
Language: Python - Size: 99.6 KB - Last synced at: 3 days ago - Pushed at: about 1 year ago - Stars: 6 - Forks: 2

rajasekarnp1/neural-audio-upscaler
Advanced neural network-based audio upscaling application that enhances audio quality using deep learning.Cross platform Windows and Mac with gui.
Language: JavaScript - Size: 136 KB - Last synced at: 3 days ago - Pushed at: 11 days ago - Stars: 1 - Forks: 0

shaypower/DawnPro-GUI
A tool for controlling the MoonDrop Dawn Pro DAC/AMP
Language: Python - Size: 43 KB - Last synced at: 11 days ago - Pushed at: 11 days ago - Stars: 5 - Forks: 1

francesclluis/source-separation-wavenet
A neural network for end-to-end music source separation
Language: Python - Size: 82.2 MB - Last synced at: 8 days ago - Pushed at: over 5 years ago - Stars: 227 - Forks: 33

jurihock/remucs
Demucs wrapper for remixing audio files with additional customizations
Language: Python - Size: 87.9 KB - Last synced at: 1 day ago - Pushed at: about 1 year ago - Stars: 7 - Forks: 0

anira-project/anira
an architecture for neural network inference in real-time audio applications
Language: C++ - Size: 921 KB - Last synced at: 11 days ago - Pushed at: 11 days ago - Stars: 148 - Forks: 7

Tracktion/tracktion_engine
Tracktion Engine module
Language: C++ - Size: 1.35 GB - Last synced at: 11 days ago - Pushed at: 11 days ago - Stars: 1,287 - Forks: 166

jurihock/qdft
Constant-Q Sliding DFT in C++, Rust and Python
Language: Python - Size: 2.48 MB - Last synced at: about 9 hours ago - Pushed at: over 1 year ago - Stars: 36 - Forks: 3

NumberOneBot/dsssp-demo
Demo Project of the DSSSP: React Library of Audio Processing and Visualization
Language: TypeScript - Size: 41.9 MB - Last synced at: 1 day ago - Pushed at: 3 months ago - Stars: 5 - Forks: 0

HANKSOONG/Charisma-Predictor
Multimodal AI pipeline to predict Big Five personality traits and assess charismatic leadership using audio, text, and video inputs.
Language: Jupyter Notebook - Size: 1.91 MB - Last synced at: 5 days ago - Pushed at: 12 days ago - Stars: 3 - Forks: 0

TeamAudio/reaspeech
Speech recognition for REAPER
Language: Lua - Size: 11.3 MB - Last synced at: 12 days ago - Pushed at: 12 days ago - Stars: 28 - Forks: 3

mallory-scotton/arcade
🕹️ Retro gaming platform with dynamic library loading for games and graphics. Built as a 2nd-year EPITECH project
Language: C - Size: 80.8 MB - Last synced at: about 18 hours ago - Pushed at: 19 days ago - Stars: 1 - Forks: 0

vitalsong/dsplib
C++ DSP library for MATLAB-like coding
Language: C++ - Size: 803 KB - Last synced at: 12 days ago - Pushed at: 12 days ago - Stars: 24 - Forks: 4

Zaibten/Zaibten-AI-Interviewer
Zaibten AI Interviewer is a cross-platform Mern Stack Web + Flutter application that revolutionizes the hiring process through AI-powered voice interviews, smart analysis, and real-time job scraping using NLP from Indeed and other job portals.
Language: TypeScript - Size: 33.4 MB - Last synced at: 5 days ago - Pushed at: 12 days ago - Stars: 0 - Forks: 0

gabyfle/SoundML
A high level DSP library in the OCaml language
Language: OCaml - Size: 179 MB - Last synced at: 3 days ago - Pushed at: 12 days ago - Stars: 22 - Forks: 0

massimo-rnd/FFMP
⚡A multithreaded C# CLI for digital media processing using FFMPEG. Transcode as many files in parallel as your system can handle.
Language: C# - Size: 56.6 KB - Last synced at: 5 days ago - Pushed at: 4 months ago - Stars: 2 - Forks: 0

speechbrain/speechbrain
A PyTorch-based Speech Toolkit
Language: Python - Size: 98.2 MB - Last synced at: 12 days ago - Pushed at: 12 days ago - Stars: 9,952 - Forks: 1,504

sevagh/audio-degradation-toolbox
easy-to-use implementation of the ISMIR 2013 Audio Degradation Toolbox
Language: Python - Size: 43.9 KB - Last synced at: 6 days ago - Pushed at: over 5 years ago - Stars: 50 - Forks: 10

IG-onGit/YouTuPy
By using YouTuPy, you can download entire playlists or specific videos as .mp4 video files or .mp3 audio files.
Language: Python - Size: 47.9 KB - Last synced at: 12 days ago - Pushed at: 12 days ago - Stars: 0 - Forks: 0

JorenSix/Panako
The Panako acoustic fingerprinting system.
Language: Java - Size: 56 MB - Last synced at: 7 days ago - Pushed at: about 1 year ago - Stars: 215 - Forks: 39

neonwatty/bleep-that-shit
Automatically filter, censor, and replace profanity, swear words, curse words, or custom terms in audio and video with a beep or bleep sound effect. Built to self-host with Python, AI, Streamlit, and Docker. Free and open source.
Language: Jupyter Notebook - Size: 17.7 MB - Last synced at: 6 days ago - Pushed at: 6 days ago - Stars: 41 - Forks: 6

sergio-sanz-rodriguez/torchsuite
A Comprehensive Pytorch library for Deep Learning Modeling
Language: Python - Size: 102 MB - Last synced at: 13 days ago - Pushed at: 13 days ago - Stars: 0 - Forks: 0

maRce10/warbleR
streamline acoustic analysis in R
Language: R - Size: 180 MB - Last synced at: about 1 hour ago - Pushed at: 13 days ago - Stars: 55 - Forks: 21

communitymedia/mediautilities
An Android library containing common classes and functions used for mediaphone and mediatablet
Language: Java - Size: 2.19 MB - Last synced at: 13 days ago - Pushed at: 13 days ago - Stars: 9 - Forks: 6

Mach1Studios/m1-spatialsystem
DAW focused plugins and apps relating to mixing Mach1 Spatial multichannel mixes
Language: C++ - Size: 6.12 MB - Last synced at: 13 days ago - Pushed at: 13 days ago - Stars: 0 - Forks: 0

Mach1Studios/m1-panner
GUI and plugin concept for Mach1Encode API
Language: C++ - Size: 1.77 MB - Last synced at: 13 days ago - Pushed at: 13 days ago - Stars: 0 - Forks: 0

HanXinzi-AI/awesome-NLP-resources
a collection of NLP projects&tools. 自然语言处理方向项目和工具集合。
Size: 17.2 MB - Last synced at: 11 days ago - Pushed at: about 1 year ago - Stars: 209 - Forks: 33

Mach1Studios/m1-monitor
GUI and plugin concept for Mach1Decode API
Language: C++ - Size: 621 KB - Last synced at: 14 days ago - Pushed at: 14 days ago - Stars: 0 - Forks: 0

rodgalleUCM/PITEA Fork of Alberto12x/PITEA
PITEA es un proyecto de Trabajo de Fin de Grado enfocado en el diseño y desarrollo de una herramienta capaz de ocultar información sensible dentro de archivos multimedia, aprovechando el uso de técnicas de esteganografía y criptografía.
Language: Python - Size: 175 MB - Last synced at: 14 days ago - Pushed at: 14 days ago - Stars: 0 - Forks: 0

Emmet-Hayes/HayesPlugins
A free and open-source bundle of 8 plug-ins, containing an EQ, Compressor, Distortion, PitchShifter, a basic Delay, a tape Delay, an algorithmic Reverb, and a convolution Reverb.
Language: C++ - Size: 30.4 MB - Last synced at: 11 days ago - Pushed at: almost 2 years ago - Stars: 7 - Forks: 0

kiing-dom/herbie
Deep learning chord recognition model using convolutional neural networks for music information retrieval (MIR). Extracts harmonic content from audio signals.
Language: Python - Size: 12.7 KB - Last synced at: 5 days ago - Pushed at: 14 days ago - Stars: 0 - Forks: 0

HEnquist/rawsample
A library for working with raw audio samples
Language: Rust - Size: 67.4 KB - Last synced at: 5 days ago - Pushed at: over 1 year ago - Stars: 4 - Forks: 2

peterprospl12/breathing-classification-v2
This repository focuses on the classification of breathing sounds using machine learning techniques. It includes training, validation, and test data for developing and evaluating models.
Language: Python - Size: 856 MB - Last synced at: 14 days ago - Pushed at: 14 days ago - Stars: 0 - Forks: 1

ieasybooks/almufarrigh
الواجهة الرسومية الخاصة بأداة تفريغ على أنظمة التشغيل المختلفة
Language: QML - Size: 1.23 MB - Last synced at: 8 days ago - Pushed at: 8 days ago - Stars: 15 - Forks: 0

daoch4n/promptdj-midi
dynamically steering lyria realtime 48khz stereo music generation with 32 reassignable midi knobs / auto flow / presets / lmm parameters
Language: TypeScript - Size: 434 KB - Last synced at: 14 days ago - Pushed at: 14 days ago - Stars: 0 - Forks: 1

Korilakkuma/Web-Music-Documentation
Web Music Documentation for Web Audio API, Web MIDI API ... etc
Size: 21.7 MB - Last synced at: 3 days ago - Pushed at: 14 days ago - Stars: 3 - Forks: 0

Badri467/DubFlow
DubFlow lets you effortlessly dub YouTube videos into any language with high-quality translations and synced audio. Simply enter a YouTube URL, choose your target language, and get a dubbed video ready to share. Perfect for creators and viewers looking to break language barriers.
Language: JavaScript - Size: 120 KB - Last synced at: 14 days ago - Pushed at: 14 days ago - Stars: 1 - Forks: 0

justanotherinternetguy/XSpeech
XSpeech: A Novel Deep Learning Approach to Classifying Stutters
Language: Jupyter Notebook - Size: 19 MB - Last synced at: 15 days ago - Pushed at: 15 days ago - Stars: 1 - Forks: 0

codeperfectplus/Speak2Summary
Flask/LLM based meeting summarization tool
Language: HTML - Size: 1.8 MB - Last synced at: 5 days ago - Pushed at: 15 days ago - Stars: 2 - Forks: 4

gaurav2git/retrosaga-v1trial
RetroSaga V1 Trial showcases an innovative approach to MIDI synthesis, focusing on efficiency and real-time performance. Explore the project on GitHub to see how dynamic cost-function optimization enhances audio processing. 🛠️🎶
Language: C - Size: 253 KB - Last synced at: 15 days ago - Pushed at: 15 days ago - Stars: 0 - Forks: 0

Srinidhi-Yoganand/audioRecognition-monorepo
Monorepo to house full stack Audio Recognition App
Language: Java - Size: 52.6 MB - Last synced at: 15 days ago - Pushed at: 15 days ago - Stars: 0 - Forks: 0

RaphiMC/AudioMixer
High performance Java audio mixing library
Language: Java - Size: 579 KB - Last synced at: 14 days ago - Pushed at: 15 days ago - Stars: 2 - Forks: 0

see2sound/see2sound
Official code for SEE-2-SOUND: Zero-Shot Spatial Environment-to-Spatial Sound
Language: Python - Size: 3.21 MB - Last synced at: 11 days ago - Pushed at: 3 months ago - Stars: 125 - Forks: 9

jonnor/machinehearing
Machine Learning applied to sound
Language: Jupyter Notebook - Size: 209 MB - Last synced at: 15 days ago - Pushed at: 15 days ago - Stars: 272 - Forks: 48

makalin/Bitwave
Bitwave is a high-fidelity, developer-friendly, future-proof audio format designed for modern sound experiences — including spatial audio, dynamic tempo adjustment, and multi-track support.
Language: Python - Size: 642 KB - Last synced at: 15 days ago - Pushed at: 15 days ago - Stars: 4 - Forks: 0

Georgecane/kite
Kite is a programming language designed for Digital Signal Processing (DSP) and Audio Processing, written in Zig. Its design emphasizes real-time processing, efficiency, and ease of integration with modern audio workflows.
Size: 5.86 KB - Last synced at: 15 days ago - Pushed at: 15 days ago - Stars: 0 - Forks: 0

OpenShot/libopenshot-audio
OpenShot Audio Library (libopenshot-audio) is a free, open-source project that enables high-quality editing and playback of audio, and is based on the amazing JUCE library.
Language: C++ - Size: 7.46 MB - Last synced at: 16 days ago - Pushed at: 16 days ago - Stars: 268 - Forks: 105

Sharan-Kumar-R/voice-chat-agent
Real-time voice-enabled AI chatbot using Deepgram and Groq LLM for natural conversations.
Language: Python - Size: 57.6 KB - Last synced at: 16 days ago - Pushed at: 16 days ago - Stars: 3 - Forks: 2

timschneeb/DDCToolbox
Create and edit DDC headset correction files
Language: C++ - Size: 34.3 MB - Last synced at: 7 days ago - Pushed at: 10 months ago - Stars: 157 - Forks: 15

AUDIY/FIR_x2
FPGA based PCM oversampling FIR filter.
Language: Verilog - Size: 57.6 MB - Last synced at: 4 days ago - Pushed at: 4 days ago - Stars: 6 - Forks: 1

KumudithaSilva/ibm-watson-tts
This project showcases the use of IBM Watson Text to Speech API within a Google Colab notebook. It securely handles API credentials stored in Google Drive, reads input text files, converts text to spoken audio, and saves the resulting MP3 files directly to Google Drive for easy access.
Language: Jupyter Notebook - Size: 6.84 KB - Last synced at: 16 days ago - Pushed at: 17 days ago - Stars: 0 - Forks: 0

alec-shalashou/pywavchopper
Chops large wav files into chunks based on loudness level drops (silence gets cut out). Performs good on large files. Might be useful for chopping band rehearsal into songs or interviews into chunks.
Language: Python - Size: 8.79 KB - Last synced at: 6 days ago - Pushed at: about 4 years ago - Stars: 1 - Forks: 0

DBraun/DawDreamer
Digital Audio Workstation with Python; VST instruments/effects, parameter automation, FAUST, JAX, Warp Markers, and JUCE processors
Language: C++ - Size: 483 MB - Last synced at: 15 days ago - Pushed at: 5 months ago - Stars: 1,026 - Forks: 79

thirteen-1/Cursor-Ai-Free
Cursor AI Free lets you harness AI tools without any cost. Explore features that enhance your workflow on Windows, macOS, and Linux. 🐙🌟
Size: 10.7 KB - Last synced at: 17 days ago - Pushed at: 17 days ago - Stars: 25 - Forks: 20

line/lighthouse
[EMNLP2024 Demo], [ICASSP 2025] A user-friendly library for reproducible video moment retrieval and highlight detection. It also supports audio moment retrieval.
Language: Python - Size: 33.3 MB - Last synced at: 17 days ago - Pushed at: 17 days ago - Stars: 147 - Forks: 10

kristofferv98/whisper_turboapi
An optimized FastAPI server for OpenAI's Whisper whisper-large-v3-turbo model using MLX optimization
Language: Python - Size: 377 KB - Last synced at: 17 days ago - Pushed at: 17 days ago - Stars: 9 - Forks: 2

x86kernel/react-native-superpowered
Implementation of the Superpowered audio engine SDK For React Native
Language: C++ - Size: 36.1 KB - Last synced at: 10 days ago - Pushed at: over 1 year ago - Stars: 22 - Forks: 7

lakshya1210/Grammar-Scoring-Engine-for-Voice-Samples
Grammar Scoring Engine: An AI-powered tool that automatically assesses spoken English grammar from audio recordings.
Language: Jupyter Notebook - Size: 2.45 MB - Last synced at: 17 days ago - Pushed at: 17 days ago - Stars: 2 - Forks: 0

shangeth/wavencoder
WavEncoder is a Python library for encoding audio signals, transforms for audio augmentation, and training audio classification models with PyTorch backend.
Language: Python - Size: 5.21 MB - Last synced at: 9 days ago - Pushed at: about 4 years ago - Stars: 91 - Forks: 14

JosefLeinweber/ConnectDAWs-VSTPlugin
ConnectDAWs is a VST3 plugin that enables the users to stream audio data between two DAWs with low latency. The goal of ConnectDAWs is to connect two DAWs allowing the users to collaborate in real time where ever they are.
Language: C++ - Size: 5.28 MB - Last synced at: 17 days ago - Pushed at: 18 days ago - Stars: 0 - Forks: 0

loglux/FlexAudioPrint
FlexAudioPrint is a Python-based app for transcribing audio to text using OpenAI's Whisper model. It offers a Gradio web interface and a script for programmatic use. With FFmpeg for audio conversion, it supports multiple formats like MP3 and WAV. Ideal for transcribing meetings, lectures, and podcasts, with options to save results as text file
Language: Python - Size: 167 KB - Last synced at: 18 days ago - Pushed at: 18 days ago - Stars: 9 - Forks: 0

cjunwon/ODAQ-SDA
Applying Categorical Exploratory Data Analysis (CEDA) methods to study audio quality perception
Language: Python - Size: 950 MB - Last synced at: 18 days ago - Pushed at: 18 days ago - Stars: 0 - Forks: 0

Villwin007/Bird-sound-classification-using-CNN
Engineered a robust deep learning model using Convolutional Neural Networks and TensorFlow to classify 114 bird species based on audio recordings. Model achieved an impressive accuracy of 78.75%, providing valuable insights for conservationists and ecologists in the wildlife & ecological research sectors.
Language: Python - Size: 29 MB - Last synced at: 18 days ago - Pushed at: 18 days ago - Stars: 1 - Forks: 0

AIoT-Group-UoP/crossai-ts
An open-source Python library for developing end-to-end AI pipelines for Time Series Analysis, such as Audio, Motion, and other uni- or multi-axes tasks
Language: Python - Size: 22 MB - Last synced at: 6 days ago - Pushed at: 6 days ago - Stars: 3 - Forks: 3

Veryzon/qwadro Fork of sigmaco/qwadro
The Qwadro Execution Ecosystem
Language: C - Size: 160 MB - Last synced at: 18 days ago - Pushed at: 19 days ago - Stars: 0 - Forks: 0

Veryzon/afx Fork of sigmaco/afx
The Standard Qwadro Implementation
Language: C - Size: 468 MB - Last synced at: 18 days ago - Pushed at: 19 days ago - Stars: 0 - Forks: 0

zachaa2/ytp-convert
A CLI tool to convert Youtube playlists to one downloaded file
Language: Python - Size: 5.43 MB - Last synced at: 18 days ago - Pushed at: 19 days ago - Stars: 0 - Forks: 0

mltframework/mlt
MLT Multimedia Framework
Language: C - Size: 16.6 MB - Last synced at: 18 days ago - Pushed at: 19 days ago - Stars: 1,587 - Forks: 338

Mirsario/SteamAudio.NET
Auto-generated C# / .NET bindings for Valve's Steam Audio (Phonon)
Language: C# - Size: 17.8 MB - Last synced at: 3 days ago - Pushed at: about 1 year ago - Stars: 23 - Forks: 5

CodeF0x/ffzap
⚡A multithreaded CLI for digital media processing using ffmpeg. If ffmpeg can do it, ffzap can do it - as many files in parallel as your system can handle.
Language: Rust - Size: 92.8 KB - Last synced at: about 5 hours ago - Pushed at: 19 days ago - Stars: 8 - Forks: 1

JoeATlunirq/slicr.me Fork of JoeTreMedia/silent-cut-magic
slicr.me is a fast, parameter-based media cutter that intelligently removes silences and trims unwanted segments from audio or video files. Built for creators, editors, and automation workflows.
Language: TypeScript - Size: 505 KB - Last synced at: 19 days ago - Pushed at: 19 days ago - Stars: 1 - Forks: 0

JoseRuiz01/Detecciones-de-Manipulaciones-Copy-Move-y-Splicing-en-Audio-usando-Tecnicas-de-Aprendizaje-Profundo
Trabajo Fin de Grado sobre Detección de Manipulaciones de Audio usando técnicas Deep Learning.
Language: Jupyter Notebook - Size: 129 MB - Last synced at: 5 days ago - Pushed at: 19 days ago - Stars: 0 - Forks: 0

dofuuz/python-soxr
Fast and high quality sample-rate conversion library for Python
Language: Python - Size: 159 KB - Last synced at: 19 days ago - Pushed at: 19 days ago - Stars: 92 - Forks: 6

gpustack/vox-box
A text-to-speech and speech-to-text server compatible with the OpenAI API, supporting Whisper, FunASR, Bark, and CosyVoice backends.
Language: Python - Size: 647 KB - Last synced at: 19 days ago - Pushed at: 19 days ago - Stars: 118 - Forks: 15

vijaykrpp/acapella-extractor
Acapella Extractor Online- Remove vocals, bass & instrumentals from any song. Free vocal remover
Language: PHP - Size: 42 KB - Last synced at: 19 days ago - Pushed at: 19 days ago - Stars: 0 - Forks: 0

DBraun/TD-Faust
FAUST (Functional Audio Stream) for TouchDesigner
Language: C++ - Size: 5.87 MB - Last synced at: 14 days ago - Pushed at: 2 months ago - Stars: 70 - Forks: 3

reshalfahsi/music-genre-classification
Music Genre Classification using MFCC + ANN
Language: Jupyter Notebook - Size: 3.64 MB - Last synced at: 3 days ago - Pushed at: about 2 years ago - Stars: 2 - Forks: 0

drscotthawley/audio-algebra
alchemy with embeddings
Language: Jupyter Notebook - Size: 178 MB - Last synced at: about 14 hours ago - Pushed at: about 2 years ago - Stars: 34 - Forks: 2
