An open API service providing repository metadata for many open source software ecosystems.

Topic: "speech-embeddings"

usc-sail/gen-dmcca

Generalized Deep Multiset Canonical Correlation Analysis for Multiview Learning of Speech Representations

Language: Python - Size: 664 KB - Last synced at: 12 months ago - Pushed at: about 6 years ago - Stars: 12 - Forks: 6

jvel07/wav2vec2_patho

Fine-tuning wav2vec2 to for Pathological Speech Processing

Language: Jupyter Notebook - Size: 4.05 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 5 - Forks: 1

jvel07/dnn_embeddings_pytorch

DNN embeddings extraction from audio and speech recordings using PyTorch.

Language: Python - Size: 596 KB - Last synced at: about 2 years ago - Pushed at: over 3 years ago - Stars: 2 - Forks: 1

NN-Project-1/dis-Vector-Embedding

The Dis-Vector project enhances voice conversion and synthesis through disentangled embeddings, allowing for high-quality, zero-shot voice cloning across multiple languages. This model leverages separate encoders for content, pitch, rhythm, and timbre, enabling precise control over synthesized voice characteristics.

Language: Python - Size: 11.3 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 1 - Forks: 0

peter-yh-wu/cross-lingual

Cross-lingual Transfer for Speech Processing using Acoustic Language Similarity

Size: 2.31 MB - Last synced at: about 2 years ago - Pushed at: over 3 years ago - Stars: 1 - Forks: 1

epistoteles/predicting-speaker-quality

This repository belongs to my Bachelor's thesis on predicting voice likability from pre-trained speech embeddings.

Language: Python - Size: 104 MB - Last synced at: almost 2 years ago - Pushed at: almost 4 years ago - Stars: 0 - Forks: 0