GitHub topics: speech-commands
nyumaya/nyumaya_audio_recognition
Classify audio with neural nets on embedded systems like the Raspberry Pi
Language: Python - Size: 138 MB - Last synced at: 10 days ago - Pushed at: about 1 year ago - Stars: 86 - Forks: 14

dobby-seo/Wav2Keyword
Wav2Keyword is keyword spotting(KWS) based on Wav2Vec 2.0. This model shows state-of-the-art in Speech commands dataset V1 and V2.
Language: Python - Size: 11.3 MB - Last synced at: 20 days ago - Pushed at: over 2 years ago - Stars: 105 - Forks: 30

isadrtdinov/kws-attention
Attention-based model for keywords spotting
Language: Python - Size: 999 KB - Last synced at: 18 days ago - Pushed at: over 3 years ago - Stars: 19 - Forks: 6

manojsvgit/Voice_Based_Email_For_Blind
A Python-based application designed specifically for visually impaired users, enabling them to seamlessly send and receive emails using intuitive speech commands. This innovative solution enhances accessibility and independence by allowing users to manage their email communication effortlessly, utilizing voice recognition technology to ensure a us.
Language: Python - Size: 6.84 KB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 5 - Forks: 0

mryndzionek/kws_cli
Small footprint, standalone, zero dependency, offline keyword spotting (KWS) CLI tool.
Language: C - Size: 968 KB - Last synced at: 9 days ago - Pushed at: 9 months ago - Stars: 3 - Forks: 0

Audio-WestlakeU/audiossl
A library built for easier audio self-supervised training, downstream tasks evaluation
Language: Python - Size: 13.1 MB - Last synced at: 8 months ago - Pushed at: 8 months ago - Stars: 86 - Forks: 9

YuanGongND/ast
Code for the Interspeech 2021 paper "AST: Audio Spectrogram Transformer".
Language: Jupyter Notebook - Size: 2.35 MB - Last synced at: over 1 year ago - Pushed at: almost 2 years ago - Stars: 872 - Forks: 165

ace19-dev/tensorflow-speech-recognition-challenge
Kaggle Competitions: TensorFlow Speech Recognition Challenge
Language: Python - Size: 17.6 MB - Last synced at: over 1 year ago - Pushed at: about 7 years ago - Stars: 25 - Forks: 10

hoang1007/FRIDAY
Female Replacement Intelligent Digital Assistant Youth
Size: 342 KB - Last synced at: over 1 year ago - Pushed at: almost 2 years ago - Stars: 0 - Forks: 0

reddiedev/197z-kws
zero-shot keyword spotting with KWS test dataset using ImageBind
Language: Jupyter Notebook - Size: 2.72 MB - Last synced at: 2 months ago - Pushed at: almost 2 years ago - Stars: 0 - Forks: 0

htqin/BiFSMN
Pytorch implementation of BiFSMN, IJCAI 2022
Language: Python - Size: 252 KB - Last synced at: about 2 years ago - Pushed at: about 2 years ago - Stars: 10 - Forks: 1

usc-sail/gen-dmcca
Generalized Deep Multiset Canonical Correlation Analysis for Multiview Learning of Speech Representations
Language: Python - Size: 664 KB - Last synced at: 11 months ago - Pushed at: about 6 years ago - Stars: 12 - Forks: 6

philsyn/DiffWave-unconditional
Pytorch Reimplementation of DiffWave unconditional generation: a high quality waveform synthesizer.
Language: Python - Size: 84.4 MB - Last synced at: about 2 years ago - Pushed at: about 4 years ago - Stars: 18 - Forks: 2

tuanio/audio-classification
Audio Classification with AlexNet and Speech Commands dataset
Language: Python - Size: 118 KB - Last synced at: about 2 years ago - Pushed at: almost 3 years ago - Stars: 3 - Forks: 0

aminul-huq/Speech_Command_Recognition
Multi-class classification of speech command data. Dataset collected from kaggle speech recognition challenge and used pyTorch for implementation.
Language: Python - Size: 4.88 KB - Last synced at: about 2 years ago - Pushed at: almost 5 years ago - Stars: 1 - Forks: 0

Bill2015/Speech-Chinese-Model-Agent
A Model-based Agent, for chinese speech recognize.
Language: Python - Size: 8.57 MB - Last synced at: about 2 years ago - Pushed at: almost 4 years ago - Stars: 0 - Forks: 0

SebastianThomas1/keyword_spotter
Speech recognition of keyword commands
Language: Jupyter Notebook - Size: 1.31 MB - Last synced at: 4 months ago - Pushed at: about 4 years ago - Stars: 0 - Forks: 0

epfluegel/TalkMaths
A Vocola 2 (DNS) extension for creating and editing mathematics (in LaTeX) by voice, using a ZOO interface (Zoomable Online Outliner) such as WorkFlowy or Dynalist.
Size: 6.84 KB - Last synced at: almost 2 years ago - Pushed at: over 4 years ago - Stars: 2 - Forks: 0
