GitHub topics: spoken-language-processing
malifalhakim/prompt-based-tts-indo
Prompt-based Text-to-Speech system using Parler TTS, designed for generating natural-sounding speech in Indonesian. Includes dataset preparation, model training, inference pipeline, and performance evaluation.
Language: Jupyter Notebook - Size: 534 KB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 1 - Forks: 0

kahne/fastwer
A PyPI package for fast word/character error rate (WER/CER) calculation
Language: Python - Size: 432 KB - Last synced at: 3 days ago - Pushed at: almost 2 years ago - Stars: 71 - Forks: 15

Sherry-XLL/Digital-Recognition-DTW_HMM_GMM
10 digits recognition system based on DTW, HMM and GMM
Language: Python - Size: 15.9 MB - Last synced at: 7 days ago - Pushed at: over 3 years ago - Stars: 26 - Forks: 7

wanghao15536870732/ChatWithEveryone
🚧The Internet + project YiLuYuBan.The project is too messy, has moved to https://github.com/wanghao15536870732/ChatWithChinese
Language: Java - Size: 58.2 MB - Last synced at: about 1 year ago - Pushed at: over 6 years ago - Stars: 1 - Forks: 2

el841/ruby Fork of ruby/ruby
The Ruby Programming Language
Language: Ruby - Size: 275 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 1 - Forks: 2

navierula/mood-class
software that analyzes speech utterances
Language: Python - Size: 15.3 MB - Last synced at: 16 days ago - Pushed at: over 6 years ago - Stars: 12 - Forks: 3

praaline/Praaline
Praaline is an open-source system to manage, annotate, visualise and analyse spoken language corpora
Language: C - Size: 147 MB - Last synced at: over 1 year ago - Pushed at: over 2 years ago - Stars: 26 - Forks: 4

kahne/SpeechTransProgress
Tracking the progress in end-to-end speech translation
Size: 121 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 218 - Forks: 26

vocaliodmiku/SIL-LLM-LL
Repository of the paper: "Spoken Language Intelligence of Large Language Models for Language Learning"
Size: 13.6 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

vocaliodmiku/SLI-LL
Repository of the paper: "Spoken Language Intelligence of Large Language Models for Language Learning"
Size: 14.3 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

ReneeYe/ConST
code for paper "Cross-modal Contrastive Learning for Speech Translation" (NAACL 2022)
Language: Python - Size: 3.62 MB - Last synced at: about 2 years ago - Pushed at: almost 3 years ago - Stars: 38 - Forks: 3

tianleimin/Thesis-EmotionRecognition
Example codes for my PhD work on recognizing dimensional emotions in spoken dialogue
Language: Python - Size: 13.1 MB - Last synced at: almost 2 years ago - Pushed at: about 5 years ago - Stars: 9 - Forks: 3

ReneeYe/XSTNet
This is an implementation of paper "End-to-end Speech Translation via Cross-modal Progressive Training" (Interspeech2021)
Language: Python - Size: 988 KB - Last synced at: about 2 years ago - Pushed at: almost 3 years ago - Stars: 17 - Forks: 3

gchrupala/peppa
Code for the paper "Learning English with Peppa Pig" https://doi.org/10.48550/arXiv.2202.12917
Language: Python - Size: 13.4 MB - Last synced at: about 2 years ago - Pushed at: almost 3 years ago - Stars: 2 - Forks: 2

samKenpachi011/Spoken-Language-Processing
A guide to spoken language processing
Language: Jupyter Notebook - Size: 8.6 MB - Last synced at: about 2 years ago - Pushed at: about 5 years ago - Stars: 0 - Forks: 0

brijmohan/lid-convex-comb
Convex combination of phonotactics for large-scale spoken language identification
Language: Python - Size: 16.3 MB - Last synced at: about 2 years ago - Pushed at: almost 8 years ago - Stars: 2 - Forks: 2

loonghch/native-language-cnn
Speech subtask of the 2017 NLI Shared Task
Language: Jupyter Notebook - Size: 1.39 MB - Last synced at: 7 days ago - Pushed at: over 6 years ago - Stars: 2 - Forks: 2

SushantKafle/speechtext-wimp-labeler
This project demonstrates the use of generic bi-directional LSTM models for predicting importance of words in a spoken dialgoue for understanding its meaning. The model operates on human-annotated corpus of word importance for its training and evaluation. The corpus can be downloaded from: http://latlab.ist.rit.edu/lrec2018
Language: Python - Size: 50.8 KB - Last synced at: over 1 year ago - Pushed at: about 2 years ago - Stars: 10 - Forks: 2

vunb/is13 Fork of mesnilgr/is13
RNN for Spoken Language Understanding
Language: Python - Size: 22.5 KB - Last synced at: almost 2 years ago - Pushed at: almost 8 years ago - Stars: 0 - Forks: 0
