torchaudio | Topic | Ecosyste.ms: Repos

Topic: "torchaudio"

2noise/ChatTTS

A generative speech model for daily dialogue.

Language: Python - Size: 9.59 MB - Last synced at: 1 day ago - Pushed at: about 1 month ago - Stars: 35,880 - Forks: 3,895

ujiaqi/MusicRecommend

:star: 本科毕业设计：基于内容的音乐推荐系统设计与开发。使用了Pytorch框架构建训练模型代码，使用Django构建了前后端。

Language: JavaScript - Size: 106 MB - Last synced at: 12 months ago - Pushed at: 12 months ago - Stars: 147 - Forks: 14

KentoNishi/torch-pitch-shift

Pitch-shift audio clips quickly with PyTorch (CUDA supported)! Additional utilities for searching efficient transformations are included.

Language: Python - Size: 51.8 MB - Last synced at: 15 days ago - Pushed at: 7 months ago - Stars: 135 - Forks: 12

nipponjo/tts-arabic-pytorch

TTS models for Arabic (Tacotron2, FastPitch)

Language: Jupyter Notebook - Size: 3.23 MB - Last synced at: 6 months ago - Pushed at: 6 months ago - Stars: 89 - Forks: 19

evshiron/rocm_lab

Language: Shell - Size: 31.3 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 41 - Forks: 2

KentoNishi/torch-time-stretch

Time-stretch audio clips quickly with PyTorch (CUDA supported)! Additional utilities for searching efficient transformations are included.

Language: Python - Size: 7.16 MB - Last synced at: 12 months ago - Pushed at: over 2 years ago - Stars: 34 - Forks: 3

torchsmoke/Python3-Wheels

Wheels for Python 3

Size: 457 MB - Last synced at: 2 months ago - Pushed at: about 4 years ago - Stars: 27 - Forks: 3

PINTO0309/pytorch4raspberrypi

Cross-compilation of PyTorch armv7l (32bit) for RaspberryPi OS

Language: Dockerfile - Size: 104 KB - Last synced at: 24 days ago - Pushed at: about 3 years ago - Stars: 20 - Forks: 2

LukeSutor/programmatic-pitch

High fidelity music synthesis using diffusion and UnivNet.

Language: Python - Size: 211 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 8 - Forks: 2

eonu/torch-fsdd

A utility for wrapping the Free Spoken Digit Dataset into PyTorch-ready data set splits.

Language: Python - Size: 14.8 MB - Last synced at: 12 days ago - Pushed at: over 2 years ago - Stars: 8 - Forks: 2

CrispenGari/emotionAI

(😞 😨 😄 😮 😍 😠 😐 🤮) This is a simple DL API that classifies human emotions from audios and text.

Language: Jupyter Notebook - Size: 17.4 MB - Last synced at: 20 days ago - Pushed at: about 3 years ago - Stars: 7 - Forks: 1

CrispenGari/animal-sound-classification

this is a simple artificial neural network model using deep learning and torch-audio to classify cats and dog sounds.

Language: Jupyter Notebook - Size: 1.85 MB - Last synced at: 28 days ago - Pushed at: about 3 years ago - Stars: 7 - Forks: 2

nipponjo/tts-german-pytorch

TTS (FastPitch) for German

Language: Python - Size: 53.7 KB - Last synced at: 7 months ago - Pushed at: 7 months ago - Stars: 6 - Forks: 0

pradeepbatchu/speechtotext

Speech to Text with Wav2Vec2 using torchaudio

Language: Python - Size: 533 KB - Last synced at: 12 months ago - Pushed at: over 3 years ago - Stars: 6 - Forks: 1

glefundes/misophonia-bot

🤖 Telegram bot powered by Deep Learning. Automatically assesses the safety of audios and voice messages for people suffering from misophonia.

Language: Python - Size: 546 KB - Last synced at: 17 days ago - Pushed at: over 4 years ago - Stars: 6 - Forks: 0

overcrash66/OpenTranslator

Open Translator: Speech To Speech and Speech to text Translator with voice cloning and other cool features

Language: Python - Size: 7.48 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 5 - Forks: 2

LumenPallidium/audio_generation

Experiments in neural networks for audio generation.

Language: Python - Size: 682 KB - Last synced at: 6 months ago - Pushed at: 6 months ago - Stars: 5 - Forks: 0

mehdihosseinimoghadam/Signal-Processing

Signal Processing with Python and Librosa

Language: Jupyter Notebook - Size: 46.6 MB - Last synced at: 19 days ago - Pushed at: about 3 years ago - Stars: 5 - Forks: 2

vectominist/Switchboard-WSJ-Utils

Utilities for preprocessing the Switchboard and WSJ corpora in Python3

Language: Python - Size: 4.88 KB - Last synced at: 21 days ago - Pushed at: over 4 years ago - Stars: 5 - Forks: 1

aminul-huq/Speech-Command-Classification

Speech command classification on Speech-Command v0.02 dataset using PyTorch and torchaudio. In this example, three models have been trained using the raw signal waveforms, MFCC features and MelSpectogram features.

Language: Python - Size: 5.86 KB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 4 - Forks: 4

SekiroRong/KAN-AutoEncoder

KAE : KAN-based AutoEncoder (AE, VAE, VQ-VAE, RVQ, etc.)

Language: Jupyter Notebook - Size: 2.02 MB - Last synced at: 11 months ago - Pushed at: 11 months ago - Stars: 3 - Forks: 0

yangarbiter/torchaudio-benchmark

TorchAudio: Building Blocks for Audio and Speech Processing

Language: Jupyter Notebook - Size: 182 MB - Last synced at: about 1 year ago - Pushed at: about 3 years ago - Stars: 3 - Forks: 0

dhpollack/spokenlanguages 📦

Language: Jupyter Notebook - Size: 2.51 MB - Last synced at: over 1 year ago - Pushed at: about 7 years ago - Stars: 3 - Forks: 5

PhilipAmadasun/SER-Model-for-dimensional-attribute-prediction

Speaker Emotion Recognition model for multi-attribute prediction

Language: Python - Size: 156 KB - Last synced at: 13 days ago - Pushed at: about 1 month ago - Stars: 2 - Forks: 0

achgls/music-genre-classification

Music genre classification project as part of the Numerical Analysis for Machine Learning course at Politecnico di Milano, A.Y 2022-2023.

Language: Python - Size: 7.63 MB - Last synced at: 6 months ago - Pushed at: 6 months ago - Stars: 2 - Forks: 1

nhassl3/Detect-russian-road-signs

The road sign recognition system of the Russian Federation, which uses an already prepared model for object detection and image segmentation in real time to improve road safety

Language: Python - Size: 15.6 KB - Last synced at: about 2 months ago - Pushed at: 11 months ago - Stars: 2 - Forks: 0

CrispenGari/HBSC

🩺♥ Heart Beat Sound Classification (HBSC) is a GraphQL API for classifying heart beats sounds in real time.

Language: Python - Size: 2.25 MB - Last synced at: 20 days ago - Pushed at: about 1 year ago - Stars: 2 - Forks: 2

escolanogui/Speech_Emotion_Recognition_Real_Time_TFM

Language: Python - Size: 45.1 MB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 2 - Forks: 0

CrispenGari/torch-audio

🎶🎼 This repository contains some notebooks that were used to train Audio Classification models in pytorch using torchaudio.

Language: Jupyter Notebook - Size: 4.03 MB - Last synced at: 21 days ago - Pushed at: over 2 years ago - Stars: 2 - Forks: 0

manhph2211/Speech-Processing

Building a speaker identification & verification pipeline for Vietnamese voices :sleepy:

Language: Jupyter Notebook - Size: 3.92 MB - Last synced at: about 2 years ago - Pushed at: over 3 years ago - Stars: 2 - Forks: 0

BaoNguyen6742/uv-install-torch

Tutorial to install torch/pytorch with cuda using uv

Language: Python - Size: 1.25 MB - Last synced at: 5 days ago - Pushed at: 5 days ago - Stars: 1 - Forks: 0

d-f/nylon-amt

Automatic music transcription for classical guitar with hierarchical frequency-time transformers and the MAESTRO dataset

Language: Python - Size: 212 KB - Last synced at: 17 days ago - Pushed at: 2 months ago - Stars: 1 - Forks: 0

thekartikeyamishra/VoiceCloner

The Voice Cloner is a Python-based project that leverages Tacotron 2 and WaveGlow models for text-to-speech (TTS) synthesis and basic voice cloning. This project supports 22 official Indian languages, including Sanskrit, making it versatile for multilingual text input.

Language: Python - Size: 11.7 KB - Last synced at: 2 months ago - Pushed at: 4 months ago - Stars: 1 - Forks: 1

albinjm/FinSpeech

A Speech Recognition Framework for Banking Interactions using Convolutional Recurrent Dense Neural Networks and Language Models

Language: Jupyter Notebook - Size: 188 KB - Last synced at: 11 months ago - Pushed at: 11 months ago - Stars: 1 - Forks: 0

Danand/audio-sample-generator

Generating unique one-shot audio samples with Stable Diffusion.

Language: Python - Size: 53.7 KB - Last synced at: 12 months ago - Pushed at: 12 months ago - Stars: 1 - Forks: 0

aronei44/audio-processing

Repo for all of my ML and DL Audio Processing

Language: Jupyter Notebook - Size: 2.82 MB - Last synced at: 12 months ago - Pushed at: over 1 year ago - Stars: 1 - Forks: 0

achgls/music-genre-demo

Streamlit-based demo of our project on Deep Learning for music genre classification as part of the Numerical Analysis for Machine Learning course at Politecnico di Milano, A.Y 2022-2023.

Language: Python - Size: 6.68 MB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 1 - Forks: 0

chris-santiago/emonet

CNN-LSTM model for audio emotion detection in children with adverse childhood events.

Language: Jupyter Notebook - Size: 28.8 MB - Last synced at: 12 days ago - Pushed at: almost 3 years ago - Stars: 1 - Forks: 1

LukeSutor/solo_classifier

Convolutional Neural Net trained on over two hours of audio data, capable of differentiating between guitarists playing solos.

Language: Python - Size: 213 MB - Last synced at: almost 2 years ago - Pushed at: almost 3 years ago - Stars: 1 - Forks: 0

BakingBrains/Sound_Classification

Sound classification on Urban Sound Dataset

Language: Jupyter Notebook - Size: 3.8 MB - Last synced at: about 2 years ago - Pushed at: about 3 years ago - Stars: 1 - Forks: 1

AsianZeus/Audio-Classification

Classifying Music Genre with Urban Sound Dataset, Preprocessing with Librosa and Torch audio, Model made in Tensorflow and PyTorch

Language: Jupyter Notebook - Size: 228 KB - Last synced at: almost 2 years ago - Pushed at: about 3 years ago - Stars: 1 - Forks: 0

Efenstor/PyTorch-ROCm-gfx1010

Instructions on how to build PyTorch on Debian 12 with support for the AMD gfx1010 architecture

Size: 10.7 KB - Last synced at: 11 days ago - Pushed at: 12 days ago - Stars: 0 - Forks: 0

nipponjo/mixer-tts-pytorch

Language: Jupyter Notebook - Size: 1.21 MB - Last synced at: 11 days ago - Pushed at: 12 days ago - Stars: 0 - Forks: 0

PhilipAmadasun/whisper_torch

An implmenetation of whisper "turbo" model in torch

Language: Python - Size: 744 KB - Last synced at: 25 days ago - Pushed at: 26 days ago - Stars: 0 - Forks: 0

ashithapallath/CHiLScope Fork of sonarosa/zero_shot_image

CHiLScope is a smart surveillance system that enhances existing CCTV infrastructure by integrating AI-based emergency detection and activity classification. The system is designed to identify emergencies and unusual activities, including events that were not encountered during training, ensuring real-time situational awareness.

Language: Python - Size: 93.5 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

Amir-Hofo/Speech-commands-Classification

In this notebook, we aim to recognize speech commands using classification. For this purpose, we used the SPEECHCOMMANDS dataset and the deep convolutional model M5. The code is written in Python and designed for the PyTorch platform.

Language: Jupyter Notebook - Size: 827 KB - Last synced at: 10 months ago - Pushed at: 10 months ago - Stars: 0 - Forks: 0

OneLeoTav/voXify

voXify is a Streamlit-powered speech-to-text web application, enabling to generate transcripts from various audio sources and download in PDF or Word format.

Language: Python - Size: 18.2 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

Sujeeth-infosec/Image-object-Detection-and-Recognition

This project leverages Python, computer vision, and deep learning techniques, utilizing pre-trained models such as RetinaNet_ResNet-50 for image-based object detection. It is designed with a primary focus on enhancing security across various sectors. The RetinaNet_ResNet-50 model enables both image and video-based detection functionalities.

Size: 5.05 MB - Last synced at: 11 months ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

theadamsabra/audio_sleuth

an open-source framework for detecting audio generated from generative systems

Language: Python - Size: 307 KB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

darylalim/generate-audio-audiocraft

Generate music from text and melody with AudioCraft MusicGen.

Language: Jupyter Notebook - Size: 8.7 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

lomash-relia/music-transcription

cnn-based model for audio trained on cpu using pytorch

Language: Python - Size: 172 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

darylalim/generate-audio-audiocraft-audiogen

Generate audio from text with AudioCraft AudioGen.

Language: Jupyter Notebook - Size: 7.74 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

friskyspock/BirdCallsForestRecording

Audio classification using pytorch

Language: Python - Size: 3.91 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

capjamesg/taylor-swift

Find how similar your voice is to Taylor Swift (WIP) ✨

Language: Python - Size: 588 KB - Last synced at: 8 days ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

olga-black/urban_sounds

Code from the ASR tutorial https://towardsdatascience.com/audio-deep-learning-made-simple-sound-classification-step-by-step-cebc936bbe5

Language: Jupyter Notebook - Size: 2.48 MB - Last synced at: about 2 years ago - Pushed at: about 2 years ago - Stars: 0 - Forks: 0

iTerner/Automatic-Speech-Recognition

Automatic Speech Recognition using torchaudio

Language: Python - Size: 228 KB - Last synced at: almost 2 years ago - Pushed at: about 2 years ago - Stars: 0 - Forks: 0

vitrioil/signal_separation

Signal Separation API

Language: Python - Size: 558 KB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

dsashulya/speech_binary_classifier

Тестовое задание на дипломный проект в Huawei

Language: Python - Size: 791 KB - Last synced at: 12 months ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

LukeSutor/guitar_source_separation

The unmix model trained to separate guitar playing from audio samples using a custom-built dataset.

Language: Python - Size: 198 KB - Last synced at: almost 2 years ago - Pushed at: almost 3 years ago - Stars: 0 - Forks: 0

Synthwaver/vocals-extractor 📦

The core of my graduation project that uses convolutional neural networks to extract the vocal part from a song by removing the sound of musical instruments. The project is rather academic, it did not achieve too great real results, but this is expected. I'm not going to develop it further.

Language: Python - Size: 1.16 MB - Last synced at: over 1 year ago - Pushed at: almost 5 years ago - Stars: 0 - Forks: 0