speech-representation | Topic | Ecosyste.ms: Repos

Topic: "speech-representation"

s3prl/s3prl

Self-Supervised Speech Pre-training and Representation Learning Toolkit

Language: Python - Size: 136 MB - Last synced at: 16 days ago - Pushed at: about 2 months ago - Stars: 2,433 - Forks: 506

jishengpeng/WavTokenizer

[ICLR 2025] SOTA discrete acoustic codec models with 40/75 tokens per second for audio language modeling

Language: Python - Size: 390 KB - Last synced at: 4 months ago - Pushed at: 5 months ago - Stars: 1,096 - Forks: 84

ddlBoJack/emotion2vec

[ACL 2024] Official PyTorch code for extracting features and training downstream models with emotion2vec: Self-Supervised Pre-Training for Speech Emotion Representation

Language: Python - Size: 9.79 MB - Last synced at: 2 months ago - Pushed at: 7 months ago - Stars: 851 - Forks: 61

jishengpeng/WavChat

A Survey of Spoken Dialogue Models (60 pages)

Size: 2.24 MB - Last synced at: 8 months ago - Pushed at: 8 months ago - Stars: 93 - Forks: 3

mechanicalsea/lighthubert

LightHuBERT: Lightweight and Configurable Speech Representation Learning with Once-for-All Hidden-Unit BERT

Language: Python - Size: 237 KB - Last synced at: almost 2 years ago - Pushed at: almost 3 years ago - Stars: 57 - Forks: 6

andi611/Mockingjay-Speech-Representation

Official Implementation of Mockingjay in Pytorch

Language: Python - Size: 1.56 MB - Last synced at: 4 months ago - Pushed at: about 2 years ago - Stars: 54 - Forks: 12

vectominist/MiniASR

A mini, simple, and fast end-to-end automatic speech recognition toolkit.

Language: Jupyter Notebook - Size: 342 KB - Last synced at: about 1 month ago - Pushed at: over 2 years ago - Stars: 54 - Forks: 6

Ereboas/MagiCodec

A single-layer, streaming codec model providing SOTA audio quality and discrete tokens designed for superior downstream modelability.

Language: Python - Size: 216 KB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 35 - Forks: 3

ryota-komatsu/slp2025

音学シンポジウム2025チュートリアル「マルチモーダル大規模言語モデル入門」資料

Language: Jupyter Notebook - Size: 19.6 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 16 - Forks: 2

jefflai108/Semi-Supervsied-Spoken-Language-Understanding-PyTorch

Semi-supervised spoken language understanding (SLU) via self-supervised speech and language model pretraining

Language: Python - Size: 314 KB - Last synced at: over 2 years ago - Pushed at: over 4 years ago - Stars: 12 - Forks: 3

seorim0/SE-using-SRL-Model

Causal Speech Enhancement Based on a Two-Branch Nested U-Net Architecture Using Self-Supervised Speech Embeddings

Language: Python - Size: 17.1 MB - Last synced at: 7 months ago - Pushed at: 7 months ago - Stars: 6 - Forks: 1

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Repos