An open API service providing repository metadata for many open source software ecosystems.

GitHub topics: spoken-language-processing

malifalhakim/prompt-based-tts-indo

Prompt-based Text-to-Speech system using Parler TTS, designed for generating natural-sounding speech in Indonesian. Includes dataset preparation, model training, inference pipeline, and performance evaluation.

Language: Jupyter Notebook - Size: 534 KB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 1 - Forks: 0

kahne/fastwer

A PyPI package for fast word/character error rate (WER/CER) calculation

Language: Python - Size: 432 KB - Last synced at: 3 days ago - Pushed at: almost 2 years ago - Stars: 71 - Forks: 15

Sherry-XLL/Digital-Recognition-DTW_HMM_GMM

10 digits recognition system based on DTW, HMM and GMM

Language: Python - Size: 15.9 MB - Last synced at: 7 days ago - Pushed at: over 3 years ago - Stars: 26 - Forks: 7

wanghao15536870732/ChatWithEveryone

🚧The Internet + project YiLuYuBan.The project is too messy, has moved to https://github.com/wanghao15536870732/ChatWithChinese

Language: Java - Size: 58.2 MB - Last synced at: about 1 year ago - Pushed at: over 6 years ago - Stars: 1 - Forks: 2

el841/ruby Fork of ruby/ruby

The Ruby Programming Language

Language: Ruby - Size: 275 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 1 - Forks: 2

navierula/mood-class

software that analyzes speech utterances

Language: Python - Size: 15.3 MB - Last synced at: 16 days ago - Pushed at: over 6 years ago - Stars: 12 - Forks: 3

praaline/Praaline

Praaline is an open-source system to manage, annotate, visualise and analyse spoken language corpora

Language: C - Size: 147 MB - Last synced at: over 1 year ago - Pushed at: over 2 years ago - Stars: 26 - Forks: 4

kahne/SpeechTransProgress

Tracking the progress in end-to-end speech translation

Size: 121 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 218 - Forks: 26

vocaliodmiku/SIL-LLM-LL

Repository of the paper: "Spoken Language Intelligence of Large Language Models for Language Learning"

Size: 13.6 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

vocaliodmiku/SLI-LL

Repository of the paper: "Spoken Language Intelligence of Large Language Models for Language Learning"

Size: 14.3 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

ReneeYe/ConST

code for paper "Cross-modal Contrastive Learning for Speech Translation" (NAACL 2022)

Language: Python - Size: 3.62 MB - Last synced at: about 2 years ago - Pushed at: almost 3 years ago - Stars: 38 - Forks: 3

tianleimin/Thesis-EmotionRecognition

Example codes for my PhD work on recognizing dimensional emotions in spoken dialogue

Language: Python - Size: 13.1 MB - Last synced at: almost 2 years ago - Pushed at: about 5 years ago - Stars: 9 - Forks: 3

ReneeYe/XSTNet

This is an implementation of paper "End-to-end Speech Translation via Cross-modal Progressive Training" (Interspeech2021)

Language: Python - Size: 988 KB - Last synced at: about 2 years ago - Pushed at: almost 3 years ago - Stars: 17 - Forks: 3

gchrupala/peppa

Code for the paper "Learning English with Peppa Pig" https://doi.org/10.48550/arXiv.2202.12917

Language: Python - Size: 13.4 MB - Last synced at: about 2 years ago - Pushed at: almost 3 years ago - Stars: 2 - Forks: 2

samKenpachi011/Spoken-Language-Processing

A guide to spoken language processing

Language: Jupyter Notebook - Size: 8.6 MB - Last synced at: about 2 years ago - Pushed at: about 5 years ago - Stars: 0 - Forks: 0

brijmohan/lid-convex-comb

Convex combination of phonotactics for large-scale spoken language identification

Language: Python - Size: 16.3 MB - Last synced at: about 2 years ago - Pushed at: almost 8 years ago - Stars: 2 - Forks: 2

loonghch/native-language-cnn

Speech subtask of the 2017 NLI Shared Task

Language: Jupyter Notebook - Size: 1.39 MB - Last synced at: 7 days ago - Pushed at: over 6 years ago - Stars: 2 - Forks: 2

SushantKafle/speechtext-wimp-labeler

This project demonstrates the use of generic bi-directional LSTM models for predicting importance of words in a spoken dialgoue for understanding its meaning. The model operates on human-annotated corpus of word importance for its training and evaluation. The corpus can be downloaded from: http://latlab.ist.rit.edu/lrec2018

Language: Python - Size: 50.8 KB - Last synced at: over 1 year ago - Pushed at: about 2 years ago - Stars: 10 - Forks: 2

vunb/is13 Fork of mesnilgr/is13

RNN for Spoken Language Understanding

Language: Python - Size: 22.5 KB - Last synced at: almost 2 years ago - Pushed at: almost 8 years ago - Stars: 0 - Forks: 0