Topic: "audio-text-modeling"
xiaomi-research/dasheng-glap
Official Implementation of GLAP - General Language Audio Pretraining
Language: Python - Size: 315 KB - Last synced at: 12 days ago - Pushed at: 12 days ago - Stars: 1 - Forks: 1

winston-lin-wei-cheng/MultiScale-Chunk-Regularization
audio-text multimodal emotion recognition model which is robust to missing data
Language: Python - Size: 188 KB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 0 - Forks: 0
