Topic: "speech-representation"
s3prl/s3prl
Self-Supervised Speech Pre-training and Representation Learning Toolkit
Language: Python - Size: 136 MB - Last synced at: 16 days ago - Pushed at: about 2 months ago - Stars: 2,433 - Forks: 506

jishengpeng/WavTokenizer
[ICLR 2025] SOTA discrete acoustic codec models with 40/75 tokens per second for audio language modeling
Language: Python - Size: 390 KB - Last synced at: 4 months ago - Pushed at: 5 months ago - Stars: 1,096 - Forks: 84

ddlBoJack/emotion2vec
[ACL 2024] Official PyTorch code for extracting features and training downstream models with emotion2vec: Self-Supervised Pre-Training for Speech Emotion Representation
Language: Python - Size: 9.79 MB - Last synced at: 2 months ago - Pushed at: 7 months ago - Stars: 851 - Forks: 61

jishengpeng/WavChat
A Survey of Spoken Dialogue Models (60 pages)
Size: 2.24 MB - Last synced at: 8 months ago - Pushed at: 8 months ago - Stars: 93 - Forks: 3

mechanicalsea/lighthubert
LightHuBERT: Lightweight and Configurable Speech Representation Learning with Once-for-All Hidden-Unit BERT
Language: Python - Size: 237 KB - Last synced at: almost 2 years ago - Pushed at: almost 3 years ago - Stars: 57 - Forks: 6

andi611/Mockingjay-Speech-Representation
Official Implementation of Mockingjay in Pytorch
Language: Python - Size: 1.56 MB - Last synced at: 4 months ago - Pushed at: about 2 years ago - Stars: 54 - Forks: 12

vectominist/MiniASR
A mini, simple, and fast end-to-end automatic speech recognition toolkit.
Language: Jupyter Notebook - Size: 342 KB - Last synced at: about 1 month ago - Pushed at: over 2 years ago - Stars: 54 - Forks: 6

Ereboas/MagiCodec
A single-layer, streaming codec model providing SOTA audio quality and discrete tokens designed for superior downstream modelability.
Language: Python - Size: 216 KB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 35 - Forks: 3

ryota-komatsu/slp2025
音学シンポジウム2025チュートリアル「マルチモーダル大規模言語モデル入門」資料
Language: Jupyter Notebook - Size: 19.6 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 16 - Forks: 2

jefflai108/Semi-Supervsied-Spoken-Language-Understanding-PyTorch
Semi-supervised spoken language understanding (SLU) via self-supervised speech and language model pretraining
Language: Python - Size: 314 KB - Last synced at: over 2 years ago - Pushed at: over 4 years ago - Stars: 12 - Forks: 3

seorim0/SE-using-SRL-Model
Causal Speech Enhancement Based on a Two-Branch Nested U-Net Architecture Using Self-Supervised Speech Embeddings
Language: Python - Size: 17.1 MB - Last synced at: 7 months ago - Pushed at: 7 months ago - Stars: 6 - Forks: 1
