Topic: "torchaudio"
2noise/ChatTTS
A generative speech model for daily dialogue.
Language: Python - Size: 9.59 MB - Last synced at: 1 day ago - Pushed at: about 1 month ago - Stars: 35,880 - Forks: 3,895

ujiaqi/MusicRecommend
:star: 本科毕业设计:基于内容的音乐推荐系统设计与开发。使用了Pytorch框架构建训练模型代码,使用Django构建了前后端。
Language: JavaScript - Size: 106 MB - Last synced at: 12 months ago - Pushed at: 12 months ago - Stars: 147 - Forks: 14

KentoNishi/torch-pitch-shift
Pitch-shift audio clips quickly with PyTorch (CUDA supported)! Additional utilities for searching efficient transformations are included.
Language: Python - Size: 51.8 MB - Last synced at: 15 days ago - Pushed at: 7 months ago - Stars: 135 - Forks: 12

nipponjo/tts-arabic-pytorch
TTS models for Arabic (Tacotron2, FastPitch)
Language: Jupyter Notebook - Size: 3.23 MB - Last synced at: 6 months ago - Pushed at: 6 months ago - Stars: 89 - Forks: 19

evshiron/rocm_lab
Language: Shell - Size: 31.3 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 41 - Forks: 2

KentoNishi/torch-time-stretch
Time-stretch audio clips quickly with PyTorch (CUDA supported)! Additional utilities for searching efficient transformations are included.
Language: Python - Size: 7.16 MB - Last synced at: 12 months ago - Pushed at: over 2 years ago - Stars: 34 - Forks: 3

torchsmoke/Python3-Wheels
Wheels for Python 3
Size: 457 MB - Last synced at: 2 months ago - Pushed at: about 4 years ago - Stars: 27 - Forks: 3

PINTO0309/pytorch4raspberrypi
Cross-compilation of PyTorch armv7l (32bit) for RaspberryPi OS
Language: Dockerfile - Size: 104 KB - Last synced at: 24 days ago - Pushed at: about 3 years ago - Stars: 20 - Forks: 2

LukeSutor/programmatic-pitch
High fidelity music synthesis using diffusion and UnivNet.
Language: Python - Size: 211 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 8 - Forks: 2

eonu/torch-fsdd
A utility for wrapping the Free Spoken Digit Dataset into PyTorch-ready data set splits.
Language: Python - Size: 14.8 MB - Last synced at: 12 days ago - Pushed at: over 2 years ago - Stars: 8 - Forks: 2

CrispenGari/emotionAI
(😞 😨 😄 😮 😍 😠 😐 🤮) This is a simple DL API that classifies human emotions from audios and text.
Language: Jupyter Notebook - Size: 17.4 MB - Last synced at: 20 days ago - Pushed at: about 3 years ago - Stars: 7 - Forks: 1

CrispenGari/animal-sound-classification
this is a simple artificial neural network model using deep learning and torch-audio to classify cats and dog sounds.
Language: Jupyter Notebook - Size: 1.85 MB - Last synced at: 28 days ago - Pushed at: about 3 years ago - Stars: 7 - Forks: 2

nipponjo/tts-german-pytorch
TTS (FastPitch) for German
Language: Python - Size: 53.7 KB - Last synced at: 7 months ago - Pushed at: 7 months ago - Stars: 6 - Forks: 0

pradeepbatchu/speechtotext
Speech to Text with Wav2Vec2 using torchaudio
Language: Python - Size: 533 KB - Last synced at: 12 months ago - Pushed at: over 3 years ago - Stars: 6 - Forks: 1

glefundes/misophonia-bot
🤖 Telegram bot powered by Deep Learning. Automatically assesses the safety of audios and voice messages for people suffering from misophonia.
Language: Python - Size: 546 KB - Last synced at: 17 days ago - Pushed at: over 4 years ago - Stars: 6 - Forks: 0

overcrash66/OpenTranslator
Open Translator: Speech To Speech and Speech to text Translator with voice cloning and other cool features
Language: Python - Size: 7.48 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 5 - Forks: 2

LumenPallidium/audio_generation
Experiments in neural networks for audio generation.
Language: Python - Size: 682 KB - Last synced at: 6 months ago - Pushed at: 6 months ago - Stars: 5 - Forks: 0

mehdihosseinimoghadam/Signal-Processing
Signal Processing with Python and Librosa
Language: Jupyter Notebook - Size: 46.6 MB - Last synced at: 19 days ago - Pushed at: about 3 years ago - Stars: 5 - Forks: 2

vectominist/Switchboard-WSJ-Utils
Utilities for preprocessing the Switchboard and WSJ corpora in Python3
Language: Python - Size: 4.88 KB - Last synced at: 21 days ago - Pushed at: over 4 years ago - Stars: 5 - Forks: 1

aminul-huq/Speech-Command-Classification
Speech command classification on Speech-Command v0.02 dataset using PyTorch and torchaudio. In this example, three models have been trained using the raw signal waveforms, MFCC features and MelSpectogram features.
Language: Python - Size: 5.86 KB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 4 - Forks: 4

SekiroRong/KAN-AutoEncoder
KAE : KAN-based AutoEncoder (AE, VAE, VQ-VAE, RVQ, etc.)
Language: Jupyter Notebook - Size: 2.02 MB - Last synced at: 11 months ago - Pushed at: 11 months ago - Stars: 3 - Forks: 0

yangarbiter/torchaudio-benchmark
TorchAudio: Building Blocks for Audio and Speech Processing
Language: Jupyter Notebook - Size: 182 MB - Last synced at: about 1 year ago - Pushed at: about 3 years ago - Stars: 3 - Forks: 0

dhpollack/spokenlanguages 📦
Language: Jupyter Notebook - Size: 2.51 MB - Last synced at: over 1 year ago - Pushed at: about 7 years ago - Stars: 3 - Forks: 5

PhilipAmadasun/SER-Model-for-dimensional-attribute-prediction
Speaker Emotion Recognition model for multi-attribute prediction
Language: Python - Size: 156 KB - Last synced at: 13 days ago - Pushed at: about 1 month ago - Stars: 2 - Forks: 0

achgls/music-genre-classification
Music genre classification project as part of the Numerical Analysis for Machine Learning course at Politecnico di Milano, A.Y 2022-2023.
Language: Python - Size: 7.63 MB - Last synced at: 6 months ago - Pushed at: 6 months ago - Stars: 2 - Forks: 1

nhassl3/Detect-russian-road-signs
The road sign recognition system of the Russian Federation, which uses an already prepared model for object detection and image segmentation in real time to improve road safety
Language: Python - Size: 15.6 KB - Last synced at: about 2 months ago - Pushed at: 11 months ago - Stars: 2 - Forks: 0

CrispenGari/HBSC
🩺♥ Heart Beat Sound Classification (HBSC) is a GraphQL API for classifying heart beats sounds in real time.
Language: Python - Size: 2.25 MB - Last synced at: 20 days ago - Pushed at: about 1 year ago - Stars: 2 - Forks: 2

escolanogui/Speech_Emotion_Recognition_Real_Time_TFM
Language: Python - Size: 45.1 MB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 2 - Forks: 0

CrispenGari/torch-audio
🎶🎼 This repository contains some notebooks that were used to train Audio Classification models in pytorch using torchaudio.
Language: Jupyter Notebook - Size: 4.03 MB - Last synced at: 21 days ago - Pushed at: over 2 years ago - Stars: 2 - Forks: 0

manhph2211/Speech-Processing
Building a speaker identification & verification pipeline for Vietnamese voices :sleepy:
Language: Jupyter Notebook - Size: 3.92 MB - Last synced at: about 2 years ago - Pushed at: over 3 years ago - Stars: 2 - Forks: 0

BaoNguyen6742/uv-install-torch
Tutorial to install torch/pytorch with cuda using uv
Language: Python - Size: 1.25 MB - Last synced at: 5 days ago - Pushed at: 5 days ago - Stars: 1 - Forks: 0

d-f/nylon-amt
Automatic music transcription for classical guitar with hierarchical frequency-time transformers and the MAESTRO dataset
Language: Python - Size: 212 KB - Last synced at: 17 days ago - Pushed at: 2 months ago - Stars: 1 - Forks: 0

thekartikeyamishra/VoiceCloner
The Voice Cloner is a Python-based project that leverages Tacotron 2 and WaveGlow models for text-to-speech (TTS) synthesis and basic voice cloning. This project supports 22 official Indian languages, including Sanskrit, making it versatile for multilingual text input.
Language: Python - Size: 11.7 KB - Last synced at: 2 months ago - Pushed at: 4 months ago - Stars: 1 - Forks: 1

albinjm/FinSpeech
A Speech Recognition Framework for Banking Interactions using Convolutional Recurrent Dense Neural Networks and Language Models
Language: Jupyter Notebook - Size: 188 KB - Last synced at: 11 months ago - Pushed at: 11 months ago - Stars: 1 - Forks: 0

Danand/audio-sample-generator
Generating unique one-shot audio samples with Stable Diffusion.
Language: Python - Size: 53.7 KB - Last synced at: 12 months ago - Pushed at: 12 months ago - Stars: 1 - Forks: 0

aronei44/audio-processing
Repo for all of my ML and DL Audio Processing
Language: Jupyter Notebook - Size: 2.82 MB - Last synced at: 12 months ago - Pushed at: over 1 year ago - Stars: 1 - Forks: 0

achgls/music-genre-demo
Streamlit-based demo of our project on Deep Learning for music genre classification as part of the Numerical Analysis for Machine Learning course at Politecnico di Milano, A.Y 2022-2023.
Language: Python - Size: 6.68 MB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 1 - Forks: 0

chris-santiago/emonet
CNN-LSTM model for audio emotion detection in children with adverse childhood events.
Language: Jupyter Notebook - Size: 28.8 MB - Last synced at: 12 days ago - Pushed at: almost 3 years ago - Stars: 1 - Forks: 1

LukeSutor/solo_classifier
Convolutional Neural Net trained on over two hours of audio data, capable of differentiating between guitarists playing solos.
Language: Python - Size: 213 MB - Last synced at: almost 2 years ago - Pushed at: almost 3 years ago - Stars: 1 - Forks: 0

BakingBrains/Sound_Classification
Sound classification on Urban Sound Dataset
Language: Jupyter Notebook - Size: 3.8 MB - Last synced at: about 2 years ago - Pushed at: about 3 years ago - Stars: 1 - Forks: 1

AsianZeus/Audio-Classification
Classifying Music Genre with Urban Sound Dataset, Preprocessing with Librosa and Torch audio, Model made in Tensorflow and PyTorch
Language: Jupyter Notebook - Size: 228 KB - Last synced at: almost 2 years ago - Pushed at: about 3 years ago - Stars: 1 - Forks: 0

Efenstor/PyTorch-ROCm-gfx1010
Instructions on how to build PyTorch on Debian 12 with support for the AMD gfx1010 architecture
Size: 10.7 KB - Last synced at: 11 days ago - Pushed at: 12 days ago - Stars: 0 - Forks: 0

nipponjo/mixer-tts-pytorch
Language: Jupyter Notebook - Size: 1.21 MB - Last synced at: 11 days ago - Pushed at: 12 days ago - Stars: 0 - Forks: 0

PhilipAmadasun/whisper_torch
An implmenetation of whisper "turbo" model in torch
Language: Python - Size: 744 KB - Last synced at: 25 days ago - Pushed at: 26 days ago - Stars: 0 - Forks: 0

ashithapallath/CHiLScope Fork of sonarosa/zero_shot_image
CHiLScope is a smart surveillance system that enhances existing CCTV infrastructure by integrating AI-based emergency detection and activity classification. The system is designed to identify emergencies and unusual activities, including events that were not encountered during training, ensuring real-time situational awareness.
Language: Python - Size: 93.5 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

Amir-Hofo/Speech-commands-Classification
In this notebook, we aim to recognize speech commands using classification. For this purpose, we used the SPEECHCOMMANDS dataset and the deep convolutional model M5. The code is written in Python and designed for the PyTorch platform.
Language: Jupyter Notebook - Size: 827 KB - Last synced at: 10 months ago - Pushed at: 10 months ago - Stars: 0 - Forks: 0

OneLeoTav/voXify
voXify is a Streamlit-powered speech-to-text web application, enabling to generate transcripts from various audio sources and download in PDF or Word format.
Language: Python - Size: 18.2 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

Sujeeth-infosec/Image-object-Detection-and-Recognition
This project leverages Python, computer vision, and deep learning techniques, utilizing pre-trained models such as RetinaNet_ResNet-50 for image-based object detection. It is designed with a primary focus on enhancing security across various sectors. The RetinaNet_ResNet-50 model enables both image and video-based detection functionalities.
Size: 5.05 MB - Last synced at: 11 months ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

theadamsabra/audio_sleuth
an open-source framework for detecting audio generated from generative systems
Language: Python - Size: 307 KB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

darylalim/generate-audio-audiocraft
Generate music from text and melody with AudioCraft MusicGen.
Language: Jupyter Notebook - Size: 8.7 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

lomash-relia/music-transcription
cnn-based model for audio trained on cpu using pytorch
Language: Python - Size: 172 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

darylalim/generate-audio-audiocraft-audiogen
Generate audio from text with AudioCraft AudioGen.
Language: Jupyter Notebook - Size: 7.74 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

friskyspock/BirdCallsForestRecording
Audio classification using pytorch
Language: Python - Size: 3.91 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

capjamesg/taylor-swift
Find how similar your voice is to Taylor Swift (WIP) ✨
Language: Python - Size: 588 KB - Last synced at: 8 days ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

olga-black/urban_sounds
Code from the ASR tutorial https://towardsdatascience.com/audio-deep-learning-made-simple-sound-classification-step-by-step-cebc936bbe5
Language: Jupyter Notebook - Size: 2.48 MB - Last synced at: about 2 years ago - Pushed at: about 2 years ago - Stars: 0 - Forks: 0

iTerner/Automatic-Speech-Recognition
Automatic Speech Recognition using torchaudio
Language: Python - Size: 228 KB - Last synced at: almost 2 years ago - Pushed at: about 2 years ago - Stars: 0 - Forks: 0

vitrioil/signal_separation
Signal Separation API
Language: Python - Size: 558 KB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

dsashulya/speech_binary_classifier
Тестовое задание на дипломный проект в Huawei
Language: Python - Size: 791 KB - Last synced at: 12 months ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

LukeSutor/guitar_source_separation
The unmix model trained to separate guitar playing from audio samples using a custom-built dataset.
Language: Python - Size: 198 KB - Last synced at: almost 2 years ago - Pushed at: almost 3 years ago - Stars: 0 - Forks: 0

Synthwaver/vocals-extractor 📦
The core of my graduation project that uses convolutional neural networks to extract the vocal part from a song by removing the sound of musical instruments. The project is rather academic, it did not achieve too great real results, but this is expected. I'm not going to develop it further.
Language: Python - Size: 1.16 MB - Last synced at: over 1 year ago - Pushed at: almost 5 years ago - Stars: 0 - Forks: 0
