Ecosyste.ms: Repos

An open API service providing repository metadata for many open source software ecosystems.

Package Usage: pypi: python_speech_features

Python Speech Feature extraction
6 versions
Latest release: over 6 years ago
1 dependent package
157,550 downloads last month

View more package details: https://packages.ecosyste.ms/registries/pypi.org/packages/python_speech_features

View more repository details: https://repos.ecosyste.ms/hosts/GitHub/repositories/jameslyons%2Fpython_speech_features

Dependent Repos 244

pvsnp9/audio_classification_using_deep_learning
This project automatically classifies musical instruments using CNN and RNN
  • ==0.6 requirements.txt

Size: 47.1 MB - Last synced: about 1 year ago - Pushed: over 1 year ago

bababoss/anticovidrobo
  • * tflite_speech_recognition/requirements.txt

Size: 24.6 MB - Last synced: about 1 year ago - Pushed: over 1 year ago

dumengnan/unicorn
  • ==0.6 06source_code/service-center/language-service/requirements.txt

Size: 94.5 MB - Last synced: about 1 year ago - Pushed: over 1 year ago

mesolitica/malaya-speech
Speech Toolkit for Malaysian language, https://malaya-speech.readthedocs.io/
  • * docs/requirements.txt

Size: 552 MB - Last synced: 10 days ago - Pushed: 10 days ago

zhangchenxu528/FACIAL
FACIAL: Synthesizing Dynamic Talking Face With Implicit Attribute Learning. ICCV, 2021.
  • * requirements.txt

Size: 9.23 MB - Last synced: 7 months ago - Pushed: almost 2 years ago

yeyupiaoling/PaddlePaddle-DeepSpeech
基于PaddlePaddle实现的语音识别,中文语音识别。项目完善,识别效果好。支持Windows,Linux下训练和预测,支持Nvidia Jetson开发板预测。
  • ==0.6 requirements.txt

Size: 14.8 MB - Last synced: 9 days ago - Pushed: 9 days ago

joonson/syncnet_python
Out of time: automated lip sync in the wild
  • * requirements.txt

Size: 92.8 KB - Last synced: 6 months ago - Pushed: 9 months ago

RoBorregos/robocup-home
Legacy Roborregos @Home division.
  • * catkin_home/src/action_selectors/scripts/DeepSpeech/requirements.txt

Size: 380 MB - Last synced: 5 days ago - Pushed: about 1 month ago

rhasspy/rhasspy
Offline private voice assistant for many human languages
  • ==0.6 requirements.txt

Size: 11.8 MB - Last synced: 15 days ago - Pushed: 10 months ago

ReneeYe/ConST
code for paper "Cross-modal Contrastive Learning for Speech Translation" (NAACL 2022)
  • * requirements.txt

Size: 3.62 MB - Last synced: about 1 year ago - Pushed: almost 2 years ago

rtaori/Black-Box-Audio
Targeted Adversarial Examples for Black Box Audio Systems
  • * requirements.txt

Size: 455 KB - Last synced: about 1 year ago - Pushed: over 3 years ago

sephiroce/srf
Supplementary files for the sequential routing framework
  • * requirements.txt

Size: 38.1 KB - Last synced: 3 months ago - Pushed: almost 2 years ago

Jason-Oleana/speech-emotion-classification
MFCC features + SVM for speech emotion classification
  • ==0.6 requirements.txt

Size: 428 KB - Last synced: about 1 year ago - Pushed: over 3 years ago

rhasspy/rhasspy-wake-raven
Wake word detection engine based on Snips Personal Wakeword Detector
  • ==0.6 requirements.txt

Size: 6.29 MB - Last synced: 25 days ago - Pushed: 6 months ago

chaosparrot/parrot.py
Computer interaction using audio and speechrecognition
  • * requirements-posix.txt
  • * requirements-windows.txt

Size: 2.33 MB - Last synced: 2 days ago - Pushed: 2 days ago

nhadiq/project
  • * END TO END SPEECH/code/STT/requirements.txt

Size: 19.5 MB - Last synced: about 1 year ago - Pushed: over 1 year ago

vivinastase/voxseg
A parameterized version of Voxseg
  • * setup.py

Size: 35.6 MB - Last synced: about 1 year ago - Pushed: almost 2 years ago

Alohomora-team/AlohomoraAPI
Um sistema de autenticação biométrica por voz utilizando mfcc e fastDTW.
  • ==0.6 docker/requirements.txt

Size: 4.7 MB - Last synced: about 1 year ago - Pushed: over 1 year ago

bytedance/neurst
Neural end-to-end Speech Translation Toolkit
  • * requirements.txt

Size: 1.32 MB - Last synced: 26 days ago - Pushed: almost 2 years ago

realzza/ChinaBirds
  • * requirements.txt

Size: 116 MB - Last synced: about 1 year ago - Pushed: almost 2 years ago

amoljagadambe/lazarus
Pit for speaker verification and identification
  • ==0.6 requirements.txt

Size: 38.1 KB - Last synced: about 1 year ago - Pushed: about 1 year ago

EmnaRejaibi/SQA_Lab
  • * requirements.txt

Size: 6.17 MB - Last synced: about 1 year ago - Pushed: about 2 years ago

Yassinlazaar/lab6QualityAndTesting
  • * requirements.txt

Size: 6.23 MB - Last synced: about 1 year ago - Pushed: about 2 years ago

oumayma122/SQA-lab
  • * requirements.txt

Size: 6.23 MB - Last synced: about 1 year ago - Pushed: about 2 years ago

dhiebenzid/lab6
  • * requirements.txt

Size: 6.23 MB - Last synced: 4 months ago - Pushed: about 2 years ago

kaisbarboura99/SQA_lab
  • * requirements.txt

Size: 6.23 MB - Last synced: about 1 year ago - Pushed: about 2 years ago

Ameur150999/SQA_Lab
  • * requirements.txt

Size: 6.16 MB - Last synced: 4 months ago - Pushed: about 2 years ago

Jihene-ch/SQA_Lab
  • * requirements.txt

Size: 6.16 MB - Last synced: about 1 year ago - Pushed: about 2 years ago

thamer824/lab6
  • * requirements.txt

Size: 6.15 MB - Last synced: about 1 year ago - Pushed: about 2 years ago

RanimAmor/SQA_Lab
  • * requirements.txt

Size: 6.15 MB - Last synced: about 1 year ago - Pushed: about 2 years ago

lihanghang/CASR-DEMO
基于Flask Web的中文自动语音识别演示系统,包含语音识别、语音合成、声纹识别之说话人识别。
  • ==0.6 requirements.txt

Size: 97.2 MB - Last synced: about 1 month ago - Pushed: about 1 month ago

Evelynn-n/wav2pic
一次将音乐转化为画面的试验
  • * requirements.txt

Size: 51.8 KB - Last synced: about 1 year ago - Pushed: over 1 year ago

philipperemy/deep-speaker
Deep Speaker: an End-to-End Neural Speaker Embedding System.
  • >=0.6 requirements.txt

Size: 79.6 MB - Last synced: 10 days ago - Pushed: 29 days ago

NVIDIA/OpenSeq2Seq 📦
Toolkit for efficient experimentation with Speech Recognition, Text2Speech and NLP
  • * requirements.txt

Size: 57.4 MB - Last synced: 15 days ago - Pushed: about 3 years ago

microsoft/ELL
Embedded Learning Library
  • * requirements.txt

Size: 31.9 MB - Last synced: 3 days ago - Pushed: over 1 year ago

hantswilliams/digitalClone
digital clone
  • * fastApi/requirements.txt
  • * fastApi/src/components/video/original_requirements.txt

Size: 26.6 MB - Last synced: about 1 year ago - Pushed: over 1 year ago

philipperemy/tensorflow-ctc-speech-recognition
Application of Connectionist Temporal Classification (CTC) for Speech Recognition (Tensorflow 1.0 but compatible with 2.0).
  • * requirements.txt

Size: 634 KB - Last synced: 10 days ago - Pushed: about 3 years ago

deskool/nlp-class
A Natural Language Processing course taught by Professor Ghassemi
  • * homework/HW7/requirements.txt

Size: 48.2 MB - Last synced: 6 months ago - Pushed: about 3 years ago

Jenarthanan14/Unified-Voice-Embedding-Using-Multi-task-Learning
  • >=0.6 hive-mtl/requirements.txt

Size: 465 KB - Last synced: 4 months ago - Pushed: about 3 years ago

kahst/AcousticEventDetection
Source code complementing our paper for acoustic event classification using convolutional neural networks.
  • * requirements.txt

Size: 518 KB - Last synced: 7 months ago - Pushed: over 3 years ago

zkmkarlsruhe/language-identification
Spoken Language Identification on Common Voice and AudioSet using Deep Learning
  • * requirements.txt

Size: 5.44 MB - Last synced: 8 months ago - Pushed: almost 2 years ago

JeanMaximilienCadic/deepspeech2paddle-docker
  • * requirements.txt

Size: 418 KB - Last synced: 10 months ago - Pushed: 10 months ago

irdkwmnsb/pohui
InnoCTF 2019 finals project
  • * pohuy-ai/requirements.txt

Size: 2.78 MB - Last synced: 18 days ago - Pushed: over 3 years ago

linlemn/DepressionDectection
  • ==0.6 requirements.txt

Size: 173 KB - Last synced: about 1 year ago - Pushed: almost 2 years ago

SarfarazJelil1987/ASVSpoof2017
  • * deep_learning/requirements.txt

Size: 1.05 MB - Last synced: about 1 year ago - Pushed: over 1 year ago

sathyapramod/voicegender
  • ==0.6 requirements.txt

Size: 816 KB - Last synced: about 1 year ago - Pushed: almost 2 years ago

merylldindin/Challenger
Optimization toolkit
  • ==0.6 featurizers/requirements.txt

Size: 2.07 MB - Last synced: about 1 year ago - Pushed: over 1 year ago

tutorialcreation/nlp_swahili_amharic
a swahili and amharic task
  • * requirements.txt

Size: 50 MB - Last synced: about 1 year ago - Pushed: almost 2 years ago

grant-TraDA/NLP-2022L
  • * projects/team_1/notebooks/eda/milestone1/requirements.txt

Size: 62 MB - Last synced: 9 months ago - Pushed: almost 2 years ago

nationalarchives/computational-archival-science-workshop
Computational Archival Science Workshop
  • >=0.6 Group 1 - Computer Vision/code/requirements.txt

Size: 57.6 MB - Last synced: 28 days ago - Pushed: over 2 years ago

FuxiVirtualHuman/AAAI22-one-shot-talking-face
Code for One-shot Talking Face Generation from Single-speaker Audio-Visual Correlation Learning (AAAI 2022)
  • * requirements.txt

Size: 8.3 MB - Last synced: 6 months ago - Pushed: over 1 year ago

skit-ai/Multimodal-Slu
This repo builds an top of self-supervised speech embeddings using S3PL tool-kit and Text based transformers from Huggingface to explore multi-modal SLU
  • ==0.6 upstream/pase/requirements.txt

Size: 135 MB - Last synced: 24 days ago - Pushed: about 3 years ago

cranberrymuffin/anime-dub
Implementing Wav2Lip for anime faces
  • * requirements.txt

Size: 843 KB - Last synced: 10 months ago - Pushed: about 3 years ago

esther-amores/TFM
Master's Degree Thesis
  • * requirements.txt

Size: 15.8 MB - Last synced: 5 months ago - Pushed: over 1 year ago

santi-pdp/pase
Problem Agnostic Speech Encoder
  • ==0.6 requirements.txt

Size: 10.2 MB - Last synced: 15 days ago - Pushed: 10 months ago

mohammadrezza/spoken-command-recognition
using hmm to detect words and execute commands
  • ==0.6 requirements.txt

Size: 14.6 KB - Last synced: about 1 year ago - Pushed: almost 2 years ago

adamwawrzynski/speech-tagging-tool
Tool for phoneme indexation based on deep neural network.
  • * requirements.txt

Size: 14 MB - Last synced: about 1 year ago - Pushed: over 1 year ago

USC-NSL/miniature-winner
  • * backup/requirements.txt

Size: 36.6 MB - Last synced: 9 months ago - Pushed: over 1 year ago

mdangschat/ctc-asr
End-to-end trained speech recognition system, based on RNNs and the connectionist temporal classification (CTC) cost function.
  • >=0.6 requirements.txt

Size: 55.1 MB - Last synced: 6 months ago - Pushed: about 4 years ago

Overcautious/ADENet
Accepted by TMM 2022
  • * requirement.txt

Size: 4.71 MB - Last synced: 10 months ago - Pushed: over 1 year ago

eMaerthin/microevolution-lang-phones
This repo stores various ideas and approaches to the microevolution of individual speaker's phonemes retrieved from recordings found on youtube
  • * Pipfile

Size: 7.57 MB - Last synced: 9 months ago - Pushed: over 1 year ago

abbasrazaali/Multi-Dialect-Speech-Recognition
Multi-Dialect-Speech-Recognition
  • * requirements.txt

Size: 64.5 KB - Last synced: about 1 year ago - Pushed: over 1 year ago

BaronVladziu/Phone-Aligner
Simple phonetic-text-to-audio aligner for english language based on DNN.
  • ==0.6 requirements.txt

Size: 102 MB - Last synced: about 1 year ago - Pushed: almost 2 years ago

Antilos/but-2020-sur-proj
  • ==0.6 src/requirements.txt

Size: 139 KB - Last synced: 4 months ago - Pushed: 10 months ago

aruroyc/TalkBangla
An exploration of different implementations for Speech Recognition using a open dataset for Bengali
  • ==0.6 requirements.txt

Size: 6.84 KB - Last synced: about 1 year ago - Pushed: over 1 year ago

eubr-atmosphere/a-GPUBench
Framework to profile and collect data of applications running on GPU(s)
  • * apps/tf_deepspeech/deepspeech/requirements.txt

Size: 111 MB - Last synced: about 1 year ago - Pushed: over 1 year ago

htm-community/nupic.audio
Audio (analog, digital) experiments using NuPIC HTM/CLA
  • * SpeechRecognition/requirements.txt

Size: 19.8 MB - Last synced: about 2 months ago - Pushed: almost 3 years ago

dc3ea9f/vico_challenge_baseline
  • * evaluations/lip_sync/requirements.txt

Size: 602 KB - Last synced: 10 months ago - Pushed: 10 months ago

gabriel-milan/sotaque-brasileiro 📦
Uma base de dados para estudo de regionalismos brasileiros através da voz.
  • ==0.6 requirements.txt
  • ==0.6 setup.py

Size: 29.3 KB - Last synced: 3 days ago - Pushed: over 2 years ago

shibing624/parrots
Automatic Speech Recognition(ASR), Text-To-Speech(TTS) engine. 中英语音识别、多角色语音合成,支持多语言,准确率高
  • * requirements.txt

Size: 12.2 MB - Last synced: 1 day ago - Pushed: 2 months ago

qmh1234567/speaker-identification
deep speaker and serescnn are used for SI, dataset is AIshell
  • ==0.6 requirements.txt

Size: 16.8 MB - Last synced: about 1 year ago - Pushed: over 1 year ago

nhadiq/Deepspeech-
SPEECH TO TEXT ENGINE
  • * requirements.txt

Size: 6.82 MB - Last synced: 18 days ago - Pushed: about 1 year ago

osmr/deepspeech_features
Routines for DeepSpeech features processing
  • * requirements.txt

Size: 17.6 KB - Last synced: about 1 year ago - Pushed: about 4 years ago

tongjinle123/speech-transformer-pytorch_lightning
ASR project with pytorch-lightning
  • * docker/requirements.txt

Size: 531 KB - Last synced: about 1 year ago - Pushed: about 4 years ago

Ascend/ModelZoo-TensorFlow
  • * TensorFlow/built-in/audio/Jasper_ID0020_for_TensorFlow/requirements.txt

Size: 228 MB - Last synced: 26 days ago - Pushed: 6 months ago

YanyanPop/advAudioCAPTCHA
  • * DeepSpeech/requirements.txt

Size: 59 MB - Last synced: about 1 year ago - Pushed: over 1 year ago

sinceresu/pypraat
Pratte like python tool specially adapted for keyword clipping.
  • ==0.6 requirements.txt

Size: 23.4 KB - Last synced: 9 months ago - Pushed: 10 months ago

Svito-zar/speech-driven-hand-gesture-generation-demo
This repository contains the gesture generation model from the paper "Moving Fast and Slow" (https://www.tandfonline.com/doi/full/10.1080/10447318.2021.1883883) trained on the English dataset
  • * requirements.txt

Size: 35.7 MB - Last synced: about 1 year ago - Pushed: over 1 year ago

jcvasquezc/phonet
Keras-based python framework to compute phonological posterior probabilities from audio files
  • ==0.6 phonet/train/requirements.txt
  • ==0.6 requirements.txt
  • * setup.py

Size: 23 MB - Last synced: 11 days ago - Pushed: over 1 year ago

jdanbrown/birdgram
Bird song classifier in a mobile app
  • ==0.6 model/_requirements-headstart.txt

Size: 713 MB - Last synced: 26 days ago - Pushed: 10 months ago

Morawetz/Speech-to-text-data_collection
Speech-to-text data collection with Kafka, Airflow, and Spark, building a pipeline that can be deployed to process posting and receiving text and audio files from and into a data lake, apply transformation in a distributed manner, and load it into a warehouse in a suitable format to train a speech-to-text model.
  • * airflow_docker/requirements.txt

Size: 38.5 MB - Last synced: about 1 year ago - Pushed: over 2 years ago

uhh-lt/subtitle2go
  • * requirements.txt

Size: 180 KB - Last synced: 18 days ago - Pushed: 5 months ago

Julio-Assis/RealTimeAudio
repository for the development of an arcade game controlled by voice
  • ==0.6 requirements.txt

Size: 667 KB - Last synced: about 1 year ago - Pushed: over 1 year ago

ydhira/ctc_viterbi_aligner
  • ==0.6 requirements.txt

Size: 931 KB - Last synced: about 1 year ago - Pushed: over 1 year ago

LuisMalhadas/rhasspy Fork of rhasspy/rhasspy
Offline private voice assistant for many human languages
  • ==0.6 requirements.txt

Size: 11.9 MB - Last synced: 9 months ago - Pushed: over 1 year ago

jixinya/EAMM
Code for paper 'EAMM: One-Shot Emotional Talking Face via Audio-Based Emotion-Aware Motion Model'
  • * requirements.txt

Size: 9.86 MB - Last synced: 5 days ago - Pushed: about 1 year ago

texzone/The-Better-CTC-Decoder
A CTC decoder with .whl for ~easy~ easier install
  • * requirements.txt

Size: 60.3 MB - Last synced: about 1 year ago - Pushed: almost 3 years ago

Jason-Oleana/speech-classification
In this challenge, the goal is to learn to recognize which of several English words is pronounced in an audio recording. This is a multiclass classification task.
  • ==0.6 requirements.txt

Size: 7.03 MB - Last synced: about 1 year ago - Pushed: over 1 year ago

keonlee9420/Comprehensive-E2E-TTS
A Non-Autoregressive End-to-End Text-to-Speech (text-to-wav), supporting a family of SOTA unsupervised duration modelings. This project grows with the research community, aiming to achieve the ultimate E2E-TTS
  • ==0.6 requirements.txt

Size: 3.45 MB - Last synced: 6 months ago - Pushed: almost 2 years ago

wangsuzhen/Audio2Head
code for paper "Audio2Head: Audio-driven One-shot Talking-head Generation with Natural Head Motion" in the conference of IJCAI 2021
  • * requirements.txt

Size: 1.02 MB - Last synced: 18 days ago - Pushed: 3 months ago

colinator/timit_utils
Python/numpy/pandas convenience wrapper for the TIMIT database.
  • * setup.py

Size: 1.23 MB - Last synced: 3 days ago - Pushed: over 5 years ago

huxian123/deep-speaker
  • ==0.6 requirements.txt

Size: 134 KB - Last synced: about 1 year ago - Pushed: over 1 year ago

a-n-rose/noise-manipulation-python
Where I will explore messing around with noise and speech in Python
  • >=0.6 autoencoder_speech_denoising/requirements.txt

Size: 5.34 MB - Last synced: about 1 year ago - Pushed: almost 2 years ago

NexIS_/s3prl
This is a mirror of the following repository: https://github.com/s3prl/s3prl
  • ==0.6 s3prl/upstream/pase/requirements.txt

Last synced: about 1 year ago

dataconvergence/ML
  • * News-Headline-sarcasm-Prediction-master/News-Headline-sarcasm-Prediction-master/requirements.txt

Size: 45.8 MB - Last synced: about 1 year ago - Pushed: almost 2 years ago