An open API service providing repository metadata for many open source software ecosystems.

GitHub topics: speech-commands

nyumaya/nyumaya_audio_recognition

Classify audio with neural nets on embedded systems like the Raspberry Pi

Language: Python - Size: 138 MB - Last synced at: 10 days ago - Pushed at: about 1 year ago - Stars: 86 - Forks: 14

dobby-seo/Wav2Keyword

Wav2Keyword is keyword spotting(KWS) based on Wav2Vec 2.0. This model shows state-of-the-art in Speech commands dataset V1 and V2.

Language: Python - Size: 11.3 MB - Last synced at: 20 days ago - Pushed at: over 2 years ago - Stars: 105 - Forks: 30

isadrtdinov/kws-attention

Attention-based model for keywords spotting

Language: Python - Size: 999 KB - Last synced at: 18 days ago - Pushed at: over 3 years ago - Stars: 19 - Forks: 6

manojsvgit/Voice_Based_Email_For_Blind

A Python-based application designed specifically for visually impaired users, enabling them to seamlessly send and receive emails using intuitive speech commands. This innovative solution enhances accessibility and independence by allowing users to manage their email communication effortlessly, utilizing voice recognition technology to ensure a us.

Language: Python - Size: 6.84 KB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 5 - Forks: 0

mryndzionek/kws_cli

Small footprint, standalone, zero dependency, offline keyword spotting (KWS) CLI tool.

Language: C - Size: 968 KB - Last synced at: 9 days ago - Pushed at: 9 months ago - Stars: 3 - Forks: 0

Audio-WestlakeU/audiossl

A library built for easier audio self-supervised training, downstream tasks evaluation

Language: Python - Size: 13.1 MB - Last synced at: 8 months ago - Pushed at: 8 months ago - Stars: 86 - Forks: 9

YuanGongND/ast

Code for the Interspeech 2021 paper "AST: Audio Spectrogram Transformer".

Language: Jupyter Notebook - Size: 2.35 MB - Last synced at: over 1 year ago - Pushed at: almost 2 years ago - Stars: 872 - Forks: 165

ace19-dev/tensorflow-speech-recognition-challenge

Kaggle Competitions: TensorFlow Speech Recognition Challenge

Language: Python - Size: 17.6 MB - Last synced at: over 1 year ago - Pushed at: about 7 years ago - Stars: 25 - Forks: 10

hoang1007/FRIDAY

Female Replacement Intelligent Digital Assistant Youth

Size: 342 KB - Last synced at: over 1 year ago - Pushed at: almost 2 years ago - Stars: 0 - Forks: 0

reddiedev/197z-kws

zero-shot keyword spotting with KWS test dataset using ImageBind

Language: Jupyter Notebook - Size: 2.72 MB - Last synced at: 2 months ago - Pushed at: almost 2 years ago - Stars: 0 - Forks: 0

htqin/BiFSMN

Pytorch implementation of BiFSMN, IJCAI 2022

Language: Python - Size: 252 KB - Last synced at: about 2 years ago - Pushed at: about 2 years ago - Stars: 10 - Forks: 1

usc-sail/gen-dmcca

Generalized Deep Multiset Canonical Correlation Analysis for Multiview Learning of Speech Representations

Language: Python - Size: 664 KB - Last synced at: 11 months ago - Pushed at: about 6 years ago - Stars: 12 - Forks: 6

philsyn/DiffWave-unconditional

Pytorch Reimplementation of DiffWave unconditional generation: a high quality waveform synthesizer.

Language: Python - Size: 84.4 MB - Last synced at: about 2 years ago - Pushed at: about 4 years ago - Stars: 18 - Forks: 2

tuanio/audio-classification

Audio Classification with AlexNet and Speech Commands dataset

Language: Python - Size: 118 KB - Last synced at: about 2 years ago - Pushed at: almost 3 years ago - Stars: 3 - Forks: 0

aminul-huq/Speech_Command_Recognition

Multi-class classification of speech command data. Dataset collected from kaggle speech recognition challenge and used pyTorch for implementation.

Language: Python - Size: 4.88 KB - Last synced at: about 2 years ago - Pushed at: almost 5 years ago - Stars: 1 - Forks: 0

Bill2015/Speech-Chinese-Model-Agent

A Model-based Agent, for chinese speech recognize.

Language: Python - Size: 8.57 MB - Last synced at: about 2 years ago - Pushed at: almost 4 years ago - Stars: 0 - Forks: 0

SebastianThomas1/keyword_spotter

Speech recognition of keyword commands

Language: Jupyter Notebook - Size: 1.31 MB - Last synced at: 4 months ago - Pushed at: about 4 years ago - Stars: 0 - Forks: 0

epfluegel/TalkMaths

A Vocola 2 (DNS) extension for creating and editing mathematics (in LaTeX) by voice, using a ZOO interface (Zoomable Online Outliner) such as WorkFlowy or Dynalist.

Size: 6.84 KB - Last synced at: almost 2 years ago - Pushed at: over 4 years ago - Stars: 2 - Forks: 0

Related Keywords
speech-commands 18 speech-recognition 7 keyword-spotting 6 pytorch 6 speech-command-recognition 3 machine-learning 3 audio-classification 3 kws 3 speech 2 voice-commands 2 deep-learning 2 audio 2 wake-word-detection 2 pytorch-lightning 2 hotword-detection 2 voxceleb1 1 diffwave 1 speech-embeddings 1 multiview-learning 1 deep-multiset-cca 1 binary-neural-networks 1 zero-shot 1 pytorch-audio 1 imagebind 1 representation-learning 1 assistant 1 speech-classification 1 android 1 tensorflow 1 kaggle-competition 1 workflowy 1 vocola 1 spoken-maths 1 spoken-digits 1 latex 1 dynalist 1 tensorflow2 1 keyword-spotter 1 flask-restful 1 rule-based 1 model-based 1 chines 1 agent 1 pytorch-implementation 1 multiclass-classification 1 kaggle-dataset 1 alexnet 1 waveform-generator 1 waveform-generation 1 waveform 1 speech-synthesis 1 urbansound8k 1 user-experience 1 speech-to-text 1 python-libraries 1 python-development 1 project-for-visually-impaired 1 natural-language-processing 1 email-client 1 email-automation 1 command-line-interface 1 assistive-technology 1 accessibility 1 attention-mechanism 1 transfer-learning 1 state-of-the-art 1 fine-tuning 1 raspberry-pi 1 hotword 1 embedded-systems 1 audio-recognition 1 self-supervised-learning 1 nsynth 1 audioset 1 audio-self-supervised-learning 1 audio-representation 1 audio-pretraining 1 audio-datasets 1 word-spotting 1 wake-word 1 tinyml 1 onnx 1 machinelearning 1 lightweight 1 hotword-detector 1 edgeml 1 cli 1 c-language 1 voice-user-interface 1 voice-recognition 1