An open API service providing repository metadata for many open source software ecosystems.

GitHub topics: speech-embedding

bunyaminergen/WavLMMSDD

This repository combines `WavLM`, a powerful speech representation model from Microsoft, with `MSDD` (Multi-Scale Diarization Decoder), a state-of-the-art approach for speaker diarization from Nvidia.

Language: Jupyter Notebook - Size: 1.8 MB - Last synced at: 28 days ago - Pushed at: 4 months ago - Stars: 7 - Forks: 3

DigitalPhonetics/BetterFinetuning

Code accompanying our paper on finetuning self-supervised general speech representations with a combination of contrastive and non-contrastive methods.

Language: Python - Size: 75.2 KB - Last synced at: 3 months ago - Pushed at: almost 3 years ago - Stars: 1 - Forks: 0