GitHub topics: spoken-language-processing

Repositories

ryota-komatsu/speaker_disentangled_hubert

Official repository of the IEEE SLT 2024 paper "Self-Supervised Syllable Discovery Based on Speaker-Disentangled HuBERT"

Language: Python - Size: 1.36 MB - Last synced at: 3 days ago - Pushed at: 3 days ago - Stars: 38 - Forks: 9

kahne/fastwer

A PyPI package for fast word/character error rate (WER/CER) calculation

Language: Python - Size: 432 KB - Last synced at: 21 days ago - Pushed at: almost 2 years ago - Stars: 72 - Forks: 16

malifalhakim/prompt-based-tts-indo

Prompt-based Text-to-Speech system using Parler TTS, designed for generating natural-sounding speech in Indonesian. Includes dataset preparation, model training, inference pipeline, and performance evaluation.

Language: Jupyter Notebook - Size: 534 KB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 1 - Forks: 0

Sherry-XLL/Digital-Recognition-DTW_HMM_GMM

10 digits recognition system based on DTW, HMM and GMM

Language: Python - Size: 15.9 MB - Last synced at: 2 months ago - Pushed at: almost 4 years ago - Stars: 26 - Forks: 7

wanghao15536870732/ChatWithEveryone

🚧The Internet + project YiLuYuBan.The project is too messy, has moved to https://github.com/wanghao15536870732/ChatWithChinese

Language: Java - Size: 58.2 MB - Last synced at: about 1 year ago - Pushed at: almost 7 years ago - Stars: 1 - Forks: 2

el841/ruby Fork of ruby/ruby

The Ruby Programming Language

Language: Ruby - Size: 275 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 1 - Forks: 2

navierula/mood-class

software that analyzes speech utterances

Language: Python - Size: 15.3 MB - Last synced at: 3 months ago - Pushed at: over 6 years ago - Stars: 12 - Forks: 3

praaline/Praaline

Praaline is an open-source system to manage, annotate, visualise and analyse spoken language corpora

Language: C - Size: 147 MB - Last synced at: over 1 year ago - Pushed at: almost 3 years ago - Stars: 26 - Forks: 4

kahne/SpeechTransProgress

Tracking the progress in end-to-end speech translation

Size: 121 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 218 - Forks: 26

vocaliodmiku/SIL-LLM-LL

Repository of the paper: "Spoken Language Intelligence of Large Language Models for Language Learning"

Size: 13.6 MB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 0 - Forks: 0

vocaliodmiku/SLI-LL

Repository of the paper: "Spoken Language Intelligence of Large Language Models for Language Learning"

Size: 14.3 MB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 0 - Forks: 0

ReneeYe/ConST

code for paper "Cross-modal Contrastive Learning for Speech Translation" (NAACL 2022)

Language: Python - Size: 3.62 MB - Last synced at: over 2 years ago - Pushed at: about 3 years ago - Stars: 38 - Forks: 3

tianleimin/Thesis-EmotionRecognition

Example codes for my PhD work on recognizing dimensional emotions in spoken dialogue

Language: Python - Size: 13.1 MB - Last synced at: about 2 years ago - Pushed at: over 5 years ago - Stars: 9 - Forks: 3

ReneeYe/XSTNet

This is an implementation of paper "End-to-end Speech Translation via Cross-modal Progressive Training" (Interspeech2021)

Language: Python - Size: 988 KB - Last synced at: over 2 years ago - Pushed at: about 3 years ago - Stars: 17 - Forks: 3

gchrupala/peppa

Code for the paper "Learning English with Peppa Pig" https://doi.org/10.48550/arXiv.2202.12917

Language: Python - Size: 13.4 MB - Last synced at: over 2 years ago - Pushed at: about 3 years ago - Stars: 2 - Forks: 2

samKenpachi011/Spoken-Language-Processing

A guide to spoken language processing

Language: Jupyter Notebook - Size: 8.6 MB - Last synced at: over 2 years ago - Pushed at: about 5 years ago - Stars: 0 - Forks: 0

brijmohan/lid-convex-comb

Convex combination of phonotactics for large-scale spoken language identification

Language: Python - Size: 16.3 MB - Last synced at: about 2 years ago - Pushed at: almost 8 years ago - Stars: 2 - Forks: 2

loonghuey/native-language-cnn

Speech subtask of the 2017 NLI Shared Task

Language: Jupyter Notebook - Size: 1.39 MB - Last synced at: 2 days ago - Pushed at: almost 7 years ago - Stars: 2 - Forks: 2

SushantKafle/speechtext-wimp-labeler

This project demonstrates the use of generic bi-directional LSTM models for predicting importance of words in a spoken dialgoue for understanding its meaning. The model operates on human-annotated corpus of word importance for its training and evaluation. The corpus can be downloaded from: http://latlab.ist.rit.edu/lrec2018

Language: Python - Size: 50.8 KB - Last synced at: almost 2 years ago - Pushed at: about 2 years ago - Stars: 10 - Forks: 2

vunb/is13 Fork of mesnilgr/is13

RNN for Spoken Language Understanding

Language: Python - Size: 22.5 KB - Last synced at: about 2 years ago - Pushed at: almost 8 years ago - Stars: 0 - Forks: 0

Related Keywords

spoken-language-processing 20 speech-processing 8 machine-translation 3 natural-language-processing 3 speech-translation 3 translation 2 python 2 corpus 2 pytorch 2 computer-assisted-language-learning 2 text-to-speech 2 dataset 2 language-learning 2 speech-recognition 2 large-language-models 2 neural-machine-translation 2 non-verbal-vocalisation 1 interspeech2021 1 emotion-recognition 1 tensorflow2 1 vunb-create-a-chatbot 1 disfluency 1 cross-copora 1 transformer 1 speec 1 naacl2022 1 spoken-language-understanding 1 understandability 1 importance 1 evaluation 1 toefl 1 stanford 1 nli-shared-task 1 native-language-identification 1 deep-learning 1 cs224s 1 convolutional-neural-network 1 convnet 1 cnn 1 phonological-features 1 language-identification 1 vision-and-language 1 grounding 1 child-language 1 self-supervised-learning 1 speech 1 speech-language-model 1 spoken-language-recognition 1 sound-processing 1 transformers 1 dtw 1 gmm 1 hmm 1 chat 1 video-chat 1 homepage 1 spoke 1 spoken-language 1 emotion-analysis 1 emotional-intelligence 1 machine-learning 1 pyaudioanalysis 1 annotations 1 corpus-builder 1 corpus-linguistics 1 corpus-tools 1 linguistics 1 speech-analysis 1 visualisation 1 artificial-intelligence 1 natural-language-generation 1 spoken-language-translation 1 tts-frontend 1 contrastive-learning 1

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Repos