python_speech_features | pypi | Package Usage

pvsnp9/audio_classification_using_deep_learning

This project automatically classifies musical instruments using CNN and RNN

==0.6 requirements.txt

Size: 47.1 MB - Last synced: about 1 year ago - Pushed: over 1 year ago

bababoss/anticovidrobo

* tflite_speech_recognition/requirements.txt

Size: 24.6 MB - Last synced: about 1 year ago - Pushed: over 1 year ago

dumengnan/unicorn

==0.6 06source_code/service-center/language-service/requirements.txt

Size: 94.5 MB - Last synced: about 1 year ago - Pushed: over 1 year ago

mesolitica/malaya-speech

Speech Toolkit for Malaysian language, https://malaya-speech.readthedocs.io/

* docs/requirements.txt

Size: 552 MB - Last synced: 10 days ago - Pushed: 10 days ago

zhangchenxu528/FACIAL

FACIAL: Synthesizing Dynamic Talking Face With Implicit Attribute Learning. ICCV, 2021.

* requirements.txt

Size: 9.23 MB - Last synced: 7 months ago - Pushed: almost 2 years ago

yeyupiaoling/PaddlePaddle-DeepSpeech

基于PaddlePaddle实现的语音识别，中文语音识别。项目完善，识别效果好。支持Windows，Linux下训练和预测，支持Nvidia Jetson开发板预测。

==0.6 requirements.txt

Size: 14.8 MB - Last synced: 9 days ago - Pushed: 9 days ago

joonson/syncnet_python

Out of time: automated lip sync in the wild

* requirements.txt

Size: 92.8 KB - Last synced: 6 months ago - Pushed: 9 months ago

RoBorregos/robocup-home

Legacy Roborregos @Home division.

* catkin_home/src/action_selectors/scripts/DeepSpeech/requirements.txt

Size: 380 MB - Last synced: 5 days ago - Pushed: about 1 month ago

rhasspy/rhasspy

Offline private voice assistant for many human languages

==0.6 requirements.txt

Size: 11.8 MB - Last synced: 15 days ago - Pushed: 10 months ago

ReneeYe/ConST

code for paper "Cross-modal Contrastive Learning for Speech Translation" (NAACL 2022)

* requirements.txt

Size: 3.62 MB - Last synced: about 1 year ago - Pushed: almost 2 years ago

rtaori/Black-Box-Audio

Targeted Adversarial Examples for Black Box Audio Systems

* requirements.txt

Size: 455 KB - Last synced: about 1 year ago - Pushed: over 3 years ago

sephiroce/srf

Supplementary files for the sequential routing framework

* requirements.txt

Size: 38.1 KB - Last synced: 3 months ago - Pushed: almost 2 years ago

Jason-Oleana/speech-emotion-classification

MFCC features + SVM for speech emotion classification

==0.6 requirements.txt

Size: 428 KB - Last synced: about 1 year ago - Pushed: over 3 years ago

rhasspy/rhasspy-wake-raven

Wake word detection engine based on Snips Personal Wakeword Detector

==0.6 requirements.txt

Size: 6.29 MB - Last synced: 25 days ago - Pushed: 6 months ago

chaosparrot/parrot.py

Computer interaction using audio and speechrecognition

* requirements-posix.txt
* requirements-windows.txt

Size: 2.33 MB - Last synced: 2 days ago - Pushed: 2 days ago

nhadiq/project

* END TO END SPEECH/code/STT/requirements.txt

Size: 19.5 MB - Last synced: about 1 year ago - Pushed: over 1 year ago

vivinastase/voxseg

A parameterized version of Voxseg

* setup.py

Size: 35.6 MB - Last synced: about 1 year ago - Pushed: almost 2 years ago

Alohomora-team/AlohomoraAPI

Um sistema de autenticação biométrica por voz utilizando mfcc e fastDTW.

==0.6 docker/requirements.txt

Size: 4.7 MB - Last synced: about 1 year ago - Pushed: over 1 year ago

bytedance/neurst

Neural end-to-end Speech Translation Toolkit

* requirements.txt

Size: 1.32 MB - Last synced: 26 days ago - Pushed: almost 2 years ago

realzza/ChinaBirds

* requirements.txt

Size: 116 MB - Last synced: about 1 year ago - Pushed: almost 2 years ago

amoljagadambe/lazarus

Pit for speaker verification and identification

==0.6 requirements.txt

Size: 38.1 KB - Last synced: about 1 year ago - Pushed: about 1 year ago

EmnaRejaibi/SQA_Lab

* requirements.txt

Size: 6.17 MB - Last synced: about 1 year ago - Pushed: about 2 years ago

Yassinlazaar/lab6QualityAndTesting

* requirements.txt

Size: 6.23 MB - Last synced: about 1 year ago - Pushed: about 2 years ago

oumayma122/SQA-lab

* requirements.txt

Size: 6.23 MB - Last synced: about 1 year ago - Pushed: about 2 years ago

dhiebenzid/lab6

* requirements.txt

Size: 6.23 MB - Last synced: 4 months ago - Pushed: about 2 years ago

kaisbarboura99/SQA_lab

* requirements.txt

Size: 6.23 MB - Last synced: about 1 year ago - Pushed: about 2 years ago

Ameur150999/SQA_Lab

* requirements.txt

Size: 6.16 MB - Last synced: 4 months ago - Pushed: about 2 years ago

Jihene-ch/SQA_Lab

* requirements.txt

Size: 6.16 MB - Last synced: about 1 year ago - Pushed: about 2 years ago

thamer824/lab6

* requirements.txt

Size: 6.15 MB - Last synced: about 1 year ago - Pushed: about 2 years ago

RanimAmor/SQA_Lab

* requirements.txt

Size: 6.15 MB - Last synced: about 1 year ago - Pushed: about 2 years ago

lihanghang/CASR-DEMO

基于Flask Web的中文自动语音识别演示系统,包含语音识别、语音合成、声纹识别之说话人识别。

==0.6 requirements.txt

Size: 97.2 MB - Last synced: about 1 month ago - Pushed: about 1 month ago

Evelynn-n/wav2pic

一次将音乐转化为画面的试验

* requirements.txt

Size: 51.8 KB - Last synced: about 1 year ago - Pushed: over 1 year ago

philipperemy/deep-speaker

Deep Speaker: an End-to-End Neural Speaker Embedding System.

>=0.6 requirements.txt

Size: 79.6 MB - Last synced: 10 days ago - Pushed: 29 days ago

NVIDIA/OpenSeq2Seq 📦

Toolkit for efficient experimentation with Speech Recognition, Text2Speech and NLP

* requirements.txt

Size: 57.4 MB - Last synced: 15 days ago - Pushed: about 3 years ago

microsoft/ELL

Embedded Learning Library

* requirements.txt

Size: 31.9 MB - Last synced: 3 days ago - Pushed: over 1 year ago

hantswilliams/digitalClone

digital clone

* fastApi/requirements.txt
* fastApi/src/components/video/original_requirements.txt

Size: 26.6 MB - Last synced: about 1 year ago - Pushed: over 1 year ago

philipperemy/tensorflow-ctc-speech-recognition

Application of Connectionist Temporal Classification (CTC) for Speech Recognition (Tensorflow 1.0 but compatible with 2.0).

* requirements.txt

Size: 634 KB - Last synced: 10 days ago - Pushed: about 3 years ago

deskool/nlp-class

A Natural Language Processing course taught by Professor Ghassemi

* homework/HW7/requirements.txt

Size: 48.2 MB - Last synced: 6 months ago - Pushed: about 3 years ago

Jenarthanan14/Unified-Voice-Embedding-Using-Multi-task-Learning

>=0.6 hive-mtl/requirements.txt

Size: 465 KB - Last synced: 4 months ago - Pushed: about 3 years ago

kahst/AcousticEventDetection

Source code complementing our paper for acoustic event classification using convolutional neural networks.

* requirements.txt

Size: 518 KB - Last synced: 7 months ago - Pushed: over 3 years ago

zkmkarlsruhe/language-identification

Spoken Language Identification on Common Voice and AudioSet using Deep Learning

* requirements.txt

Size: 5.44 MB - Last synced: 8 months ago - Pushed: almost 2 years ago

JeanMaximilienCadic/deepspeech2paddle-docker

* requirements.txt

Size: 418 KB - Last synced: 10 months ago - Pushed: 10 months ago

irdkwmnsb/pohui

InnoCTF 2019 finals project

* pohuy-ai/requirements.txt

Size: 2.78 MB - Last synced: 18 days ago - Pushed: over 3 years ago

linlemn/DepressionDectection

==0.6 requirements.txt

Size: 173 KB - Last synced: about 1 year ago - Pushed: almost 2 years ago

SarfarazJelil1987/ASVSpoof2017

* deep_learning/requirements.txt

Size: 1.05 MB - Last synced: about 1 year ago - Pushed: over 1 year ago

sathyapramod/voicegender

==0.6 requirements.txt

Size: 816 KB - Last synced: about 1 year ago - Pushed: almost 2 years ago

merylldindin/Challenger

Optimization toolkit

==0.6 featurizers/requirements.txt

Size: 2.07 MB - Last synced: about 1 year ago - Pushed: over 1 year ago

tutorialcreation/nlp_swahili_amharic

a swahili and amharic task

* requirements.txt

Size: 50 MB - Last synced: about 1 year ago - Pushed: almost 2 years ago

grant-TraDA/NLP-2022L

* projects/team_1/notebooks/eda/milestone1/requirements.txt

Size: 62 MB - Last synced: 9 months ago - Pushed: almost 2 years ago

nationalarchives/computational-archival-science-workshop

Computational Archival Science Workshop

>=0.6 Group 1 - Computer Vision/code/requirements.txt

Size: 57.6 MB - Last synced: 28 days ago - Pushed: over 2 years ago

FuxiVirtualHuman/AAAI22-one-shot-talking-face

Code for One-shot Talking Face Generation from Single-speaker Audio-Visual Correlation Learning (AAAI 2022)

* requirements.txt

Size: 8.3 MB - Last synced: 6 months ago - Pushed: over 1 year ago

skit-ai/Multimodal-Slu

This repo builds an top of self-supervised speech embeddings using S3PL tool-kit and Text based transformers from Huggingface to explore multi-modal SLU

==0.6 upstream/pase/requirements.txt

Size: 135 MB - Last synced: 24 days ago - Pushed: about 3 years ago

cranberrymuffin/anime-dub

Implementing Wav2Lip for anime faces

* requirements.txt

Size: 843 KB - Last synced: 10 months ago - Pushed: about 3 years ago

esther-amores/TFM

Master's Degree Thesis

* requirements.txt

Size: 15.8 MB - Last synced: 5 months ago - Pushed: over 1 year ago

santi-pdp/pase

Problem Agnostic Speech Encoder

==0.6 requirements.txt

Size: 10.2 MB - Last synced: 15 days ago - Pushed: 10 months ago

mohammadrezza/spoken-command-recognition

using hmm to detect words and execute commands

==0.6 requirements.txt

Size: 14.6 KB - Last synced: about 1 year ago - Pushed: almost 2 years ago

adamwawrzynski/speech-tagging-tool

Tool for phoneme indexation based on deep neural network.

* requirements.txt

Size: 14 MB - Last synced: about 1 year ago - Pushed: over 1 year ago

USC-NSL/miniature-winner

* backup/requirements.txt

Size: 36.6 MB - Last synced: 9 months ago - Pushed: over 1 year ago

mdangschat/ctc-asr

End-to-end trained speech recognition system, based on RNNs and the connectionist temporal classification (CTC) cost function.

>=0.6 requirements.txt

Size: 55.1 MB - Last synced: 6 months ago - Pushed: about 4 years ago

Overcautious/ADENet

Accepted by TMM 2022

* requirement.txt

Size: 4.71 MB - Last synced: 10 months ago - Pushed: over 1 year ago

eMaerthin/microevolution-lang-phones

This repo stores various ideas and approaches to the microevolution of individual speaker's phonemes retrieved from recordings found on youtube

* Pipfile

Size: 7.57 MB - Last synced: 9 months ago - Pushed: over 1 year ago

abbasrazaali/Multi-Dialect-Speech-Recognition

Multi-Dialect-Speech-Recognition

* requirements.txt

Size: 64.5 KB - Last synced: about 1 year ago - Pushed: over 1 year ago

BaronVladziu/Phone-Aligner

Simple phonetic-text-to-audio aligner for english language based on DNN.

==0.6 requirements.txt

Size: 102 MB - Last synced: about 1 year ago - Pushed: almost 2 years ago

Antilos/but-2020-sur-proj

==0.6 src/requirements.txt

Size: 139 KB - Last synced: 4 months ago - Pushed: 10 months ago

aruroyc/TalkBangla

An exploration of different implementations for Speech Recognition using a open dataset for Bengali

==0.6 requirements.txt

Size: 6.84 KB - Last synced: about 1 year ago - Pushed: over 1 year ago

eubr-atmosphere/a-GPUBench

Framework to profile and collect data of applications running on GPU(s)

* apps/tf_deepspeech/deepspeech/requirements.txt

Size: 111 MB - Last synced: about 1 year ago - Pushed: over 1 year ago

htm-community/nupic.audio

Audio (analog, digital) experiments using NuPIC HTM/CLA

* SpeechRecognition/requirements.txt

Size: 19.8 MB - Last synced: about 2 months ago - Pushed: almost 3 years ago

dc3ea9f/vico_challenge_baseline

* evaluations/lip_sync/requirements.txt

Size: 602 KB - Last synced: 10 months ago - Pushed: 10 months ago

gabriel-milan/sotaque-brasileiro 📦

Uma base de dados para estudo de regionalismos brasileiros através da voz.

==0.6 requirements.txt
==0.6 setup.py

Size: 29.3 KB - Last synced: 3 days ago - Pushed: over 2 years ago

shibing624/parrots

Automatic Speech Recognition(ASR), Text-To-Speech(TTS) engine. 中英语音识别、多角色语音合成，支持多语言，准确率高

* requirements.txt

Size: 12.2 MB - Last synced: 1 day ago - Pushed: 2 months ago

qmh1234567/speaker-identification

deep speaker and serescnn are used for SI, dataset is AIshell

==0.6 requirements.txt

Size: 16.8 MB - Last synced: about 1 year ago - Pushed: over 1 year ago

nhadiq/Deepspeech-

SPEECH TO TEXT ENGINE

* requirements.txt

Size: 6.82 MB - Last synced: 18 days ago - Pushed: about 1 year ago

osmr/deepspeech_features

Routines for DeepSpeech features processing

* requirements.txt

Size: 17.6 KB - Last synced: about 1 year ago - Pushed: about 4 years ago

tongjinle123/speech-transformer-pytorch_lightning

ASR project with pytorch-lightning

* docker/requirements.txt

Size: 531 KB - Last synced: about 1 year ago - Pushed: about 4 years ago

Ascend/ModelZoo-TensorFlow

* TensorFlow/built-in/audio/Jasper_ID0020_for_TensorFlow/requirements.txt

Size: 228 MB - Last synced: 26 days ago - Pushed: 6 months ago

YanyanPop/advAudioCAPTCHA

* DeepSpeech/requirements.txt

Size: 59 MB - Last synced: about 1 year ago - Pushed: over 1 year ago

sinceresu/pypraat

Pratte like python tool specially adapted for keyword clipping.

==0.6 requirements.txt

Size: 23.4 KB - Last synced: 9 months ago - Pushed: 10 months ago

Svito-zar/speech-driven-hand-gesture-generation-demo

This repository contains the gesture generation model from the paper "Moving Fast and Slow" (https://www.tandfonline.com/doi/full/10.1080/10447318.2021.1883883) trained on the English dataset

* requirements.txt

Size: 35.7 MB - Last synced: about 1 year ago - Pushed: over 1 year ago

jcvasquezc/phonet

Keras-based python framework to compute phonological posterior probabilities from audio files

==0.6 phonet/train/requirements.txt
==0.6 requirements.txt
* setup.py

Size: 23 MB - Last synced: 11 days ago - Pushed: over 1 year ago

jdanbrown/birdgram

Bird song classifier in a mobile app

==0.6 model/_requirements-headstart.txt

Size: 713 MB - Last synced: 26 days ago - Pushed: 10 months ago

Morawetz/Speech-to-text-data_collection

Speech-to-text data collection with Kafka, Airflow, and Spark, building a pipeline that can be deployed to process posting and receiving text and audio files from and into a data lake, apply transformation in a distributed manner, and load it into a warehouse in a suitable format to train a speech-to-text model.

* airflow_docker/requirements.txt

Size: 38.5 MB - Last synced: about 1 year ago - Pushed: over 2 years ago

uhh-lt/subtitle2go

* requirements.txt

Size: 180 KB - Last synced: 18 days ago - Pushed: 5 months ago

Julio-Assis/RealTimeAudio

repository for the development of an arcade game controlled by voice

==0.6 requirements.txt

Size: 667 KB - Last synced: about 1 year ago - Pushed: over 1 year ago

ydhira/ctc_viterbi_aligner

==0.6 requirements.txt

Size: 931 KB - Last synced: about 1 year ago - Pushed: over 1 year ago

LuisMalhadas/rhasspy Fork of rhasspy/rhasspy

Offline private voice assistant for many human languages

==0.6 requirements.txt

Size: 11.9 MB - Last synced: 9 months ago - Pushed: over 1 year ago

jixinya/EAMM

Code for paper 'EAMM: One-Shot Emotional Talking Face via Audio-Based Emotion-Aware Motion Model'

* requirements.txt

Size: 9.86 MB - Last synced: 5 days ago - Pushed: about 1 year ago

texzone/The-Better-CTC-Decoder

A CTC decoder with .whl for ~easy~ easier install

* requirements.txt

Size: 60.3 MB - Last synced: about 1 year ago - Pushed: almost 3 years ago

Jason-Oleana/speech-classification

In this challenge, the goal is to learn to recognize which of several English words is pronounced in an audio recording. This is a multiclass classification task.

==0.6 requirements.txt

Size: 7.03 MB - Last synced: about 1 year ago - Pushed: over 1 year ago

keonlee9420/Comprehensive-E2E-TTS

A Non-Autoregressive End-to-End Text-to-Speech (text-to-wav), supporting a family of SOTA unsupervised duration modelings. This project grows with the research community, aiming to achieve the ultimate E2E-TTS