An open API service providing repository metadata for many open source software ecosystems.

Topic: "icassp"

gabrielmittag/NISQA

NISQA - Non-Intrusive Speech Quality and TTS Naturalness Assessment

Language: Python - Size: 2.2 MB - Last synced at: 13 days ago - Pushed at: 5 months ago - Stars: 757 - Forks: 128

sibozhang/Text2Video

ICASSP 2022: "Text2Video: text-driven talking-head video synthesis with phonetic dictionary".

Language: Python - Size: 209 MB - Last synced at: 5 months ago - Pushed at: almost 2 years ago - Stars: 427 - Forks: 92

DmitryRyumin/ICASSP-2023-24-Papers

ICASSP 2023-2024 Papers: A complete collection of influential and exciting research papers from the ICASSP 2023-24 conferences. Explore the latest advancements in acoustics, speech and signal processing. Code included. Star the repository to support the advancement of audio and signal processing!

Language: Python - Size: 8.8 MB - Last synced at: 6 months ago - Pushed at: 6 months ago - Stars: 388 - Forks: 17

IBM/TabFormer

Code & Data for "Tabular Transformers for Modeling Multivariate Time Series" (ICASSP, 2021)

Language: Python - Size: 460 KB - Last synced at: 4 months ago - Pushed at: over 1 year ago - Stars: 329 - Forks: 85

soham97/awesome-sound_event_detection

Reading list for research topics in Sound AI

Size: 145 KB - Last synced at: 6 days ago - Pushed at: 9 months ago - Stars: 180 - Forks: 9

Jiaxin-Ye/TIM-Net_SER

[ICASSP 2023] Official Tensorflow implementation of "Temporal Modeling Matters: A Novel Temporal Emotional Modeling Approach for Speech Emotion Recognition".

Language: Python - Size: 10.3 MB - Last synced at: 11 months ago - Pushed at: 11 months ago - Stars: 143 - Forks: 24

glam-imperial/EmotionalConversionStarGAN

This repository contains code to replicate results from the ICASSP 2020 paper "StarGAN for Emotional Speech Conversion: Validated by Data Augmentation of End-to-End Emotion Recognition".

Language: Python - Size: 168 KB - Last synced at: over 1 year ago - Pushed at: over 3 years ago - Stars: 123 - Forks: 27

DmitryRyumin/NewEraAI-Papers

The repository provides links to collections of influential and interesting research papers from top AI conferences, with open-source code to promote reproducibility and provide detailed implementation insights beyond the scope of the article. Stay up to date with the latest advances in AI research!

Language: Python - Size: 98.6 KB - Last synced at: 8 days ago - Pushed at: 11 months ago - Stars: 103 - Forks: 3

XuesongYang/end2end_dialog

ICASSP2017: End-to-end joint learning of natural language understanding and dialogue manager

Language: Python - Size: 123 MB - Last synced at: 15 days ago - Pushed at: almost 8 years ago - Stars: 74 - Forks: 25

fonfonx/FaceRecognition

Face Recognition in real-world images [ICASSP 2017]

Language: Python - Size: 137 KB - Last synced at: over 1 year ago - Pushed at: about 8 years ago - Stars: 38 - Forks: 23

30stomercury/Interaction-Aware-Attention-Network

[ICASSP19] An Interaction-aware Attention Network for Speech Emotion Recognition in Spoken Dialogs

Language: Python - Size: 499 KB - Last synced at: about 2 years ago - Pushed at: almost 5 years ago - Stars: 32 - Forks: 9

Neclow/SERAB

SERAB: a multi-lingual benchmark for speech emotion recognition

Language: Python - Size: 93.1 MB - Last synced at: over 1 year ago - Pushed at: over 2 years ago - Stars: 29 - Forks: 4

monetjoe/latex_templates

LaTeX templates for papers, please select your conference or journal by switching branches.

Language: TeX - Size: 1.6 MB - Last synced at: 12 days ago - Pushed at: 12 days ago - Stars: 26 - Forks: 1

orbxball/icassp2019-latex-template

ICASSP 2019 official Latex template

Language: TeX - Size: 102 KB - Last synced at: 19 days ago - Pushed at: almost 4 years ago - Stars: 24 - Forks: 14

doheejin/HiPAMA

This repository is the implementation of the HiPAMA architecture, introduced in the paper, Hierarchical Pronunciation Assessment with Multi-Aspect Attention (ICASSP 2023).

Language: Python - Size: 478 KB - Last synced at: 12 months ago - Pushed at: 12 months ago - Stars: 21 - Forks: 1

eleGAN23/QVAE

Official PyTorch implementation of A Quaternion-Valued Variational Autoencoder (QVAE).

Language: Python - Size: 1.29 MB - Last synced at: over 1 year ago - Pushed at: about 3 years ago - Stars: 20 - Forks: 4

choyingw/SCADC-DepthCompletion

ICASSP 2021: Scene Completeness-Aware Lidar Depth Completion for Driving Scenario

Language: Python - Size: 43.3 MB - Last synced at: 12 days ago - Pushed at: almost 3 years ago - Stars: 18 - Forks: 1

koudounasalkis/voc2vec

This repository contains the code for the paper "voc2vec: A Foundation Model for Non-Verbal Vocalization", accepted at ICASSP 2025.

Language: Python - Size: 19.5 KB - Last synced at: 11 days ago - Pushed at: 11 days ago - Stars: 16 - Forks: 0

huangyz0918/kws-continual-learning

Continual Learning Benchmark for Spoken Keyword Spotting

Language: Python - Size: 3.83 MB - Last synced at: 12 days ago - Pushed at: almost 3 years ago - Stars: 16 - Forks: 1

kjw11/Speaker-Aware-CTC

Speaker-aware CTC (SACTC) for multi-talker overlapped speech recognition.

Language: Python - Size: 3.73 MB - Last synced at: 18 days ago - Pushed at: 18 days ago - Stars: 13 - Forks: 1

meowoodie/Regularized-RBM

A regularized version of RBM for unsupervised feature selection.

Language: Python - Size: 1.59 MB - Last synced at: about 2 years ago - Pushed at: over 5 years ago - Stars: 13 - Forks: 2

KrishnaswamyLab/ImageFlowNet

[ICASSP 2025 Oral] ImageFlowNet: Forecasting Multiscale Image-Level Trajectories of Disease Progression with Irregularly-Sampled Longitudinal Medical Images

Language: Python - Size: 10.8 MB - Last synced at: 3 days ago - Pushed at: 14 days ago - Stars: 12 - Forks: 0

yousefkotp/Flare-Free-Vision-Empowering-Uformer-with-Depth-Insights

The official implementation for IEEE-ICASSP 2024 paper "Flare-Free Vision: Empowering Uformer with Depth Insights"

Language: Python - Size: 1.31 MB - Last synced at: 8 months ago - Pushed at: 8 months ago - Stars: 10 - Forks: 1

ChenLiu-1996/ImageFlowNet

[ICASSP 2025 Oral] ImageFlowNet: Forecasting Multiscale Image-Level Trajectories of Disease Progression with Irregularly-Sampled Longitudinal Medical Images

Language: Python - Size: 10.8 MB - Last synced at: 3 days ago - Pushed at: 14 days ago - Stars: 8 - Forks: 0

SMIL-SPCRAS/DAVIS

Official repo for "Audio-Visual Speech Recognition In-the-Wild: Multi-Angle Vehicle Cabin Corpus and Attention-based Method" in ICASSP 2024

Language: JavaScript - Size: 5.82 MB - Last synced at: 9 months ago - Pushed at: about 1 year ago - Stars: 7 - Forks: 0

kwatcharasupat/directional-sparse-filtering-tf

Python Implementation for Directional Sparse Filtering with Tensorflow/Keras

Language: Python - Size: 21.5 KB - Last synced at: 12 months ago - Pushed at: almost 4 years ago - Stars: 7 - Forks: 1

Factral/PrivDL

code for the paper: PRIVACY-PRESERVING DEEP LEARNING: LEVERAGING DEFORMABLE OPERATORS FOR SECURE TASK LEARNING

Language: Python - Size: 20.1 MB - Last synced at: 11 months ago - Pushed at: about 1 year ago - Stars: 6 - Forks: 0

seorim0/ResUNet-LC

2D residual U-Net (ResUNet) and a lead combiner (LC) for 12-lead ECG Abnormality Classification

Language: Python - Size: 36.1 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 4 - Forks: 0

sungjae-cho/ICASSP2020_STDemo

Show and Tell demonstration homepage

Language: HTML - Size: 6.14 MB - Last synced at: about 2 years ago - Pushed at: about 4 years ago - Stars: 4 - Forks: 0

hahnec/stofnet

StofNet: Super-resolution Time of Flight Network (ICASSP 2024)

Language: Python - Size: 19.3 MB - Last synced at: about 2 months ago - Pushed at: about 1 year ago - Stars: 2 - Forks: 0

CostasAK/icassp2023

Jupyter Notebook associated with our submission for the 2023 ICASSP, "Sensor Selection for Angle of Arrival Estimation Based on the Two-Target Cramér-Rao Bound"

Language: Jupyter Notebook - Size: 1.34 MB - Last synced at: about 2 months ago - Pushed at: over 1 year ago - Stars: 2 - Forks: 1

testzer0/SpeakerVerification

My implementation of "Generalized End-to-End Loss for Speaker Verification" (ICASSP 2018)

Language: Jupyter Notebook - Size: 2.2 GB - Last synced at: almost 2 years ago - Pushed at: almost 3 years ago - Stars: 1 - Forks: 0

CVSSP/icassp-pandoc

IEEE ICASSP Template for Pandoc

Language: TeX - Size: 227 KB - Last synced at: about 2 years ago - Pushed at: about 7 years ago - Stars: 1 - Forks: 0

Related Topics
deep-learning 12 pytorch 7 signal-processing 6 icassp2024 5 emotion-recognition 4 deep-neural-networks 3 tts 3 speech-synthesis 3 icassp2023 3 interspeech 3 unsupervised-learning 2 image-prediction 2 image-forecasting 2 icassp2025 2 disease-progression 2 icassp2021 2 machine-learning 2 computer-vision 2 python 2 medical-image-analysis 2 neural-ode 2 spatial-temporal 2 generative-models 2 time-series-forecasting 2 trajectory-prediction 2 unet 2 self-supervised-learning 2 depth-estimation 2 artificial-intelligence 2 image-processing 2 neural-networks 2 speech-recognition 2 icassp-2020 2 face-recognition 2 speech-processing 2 ismir 2 asr 2 icassp-2019 2 speech-emotion-recognition 2 text-to-speech 2 tensorflow 2 ieee 2 keyword-spotting 2 spoken-language-understanding 1 sensor-selection 1 notebook 1 multi-target 1 jupyter-notebook 1 jupyter 1 array-processing 1 sparse-sensing 1 signal-restoration 1 bert 1 semantic-segmentation 1 credit-card-dataset 1 credit-card-transaction 1 fraud-detection 1 feature-selection 1 attention-mechanism 1 zero-shot-learning 1 audio-visual 1 avsr 1 corpus 1 in-the-wild 1 multi-modal 1 spatio-temporal-features 1 ctc 1 sactc 1 sound-event-detection 1 representation-learning 1 audio-retrieval 1 audio-processing 1 audio-generation 1 audio-captioning 1 time-series 1 acoustic-scene-classification 1 vad 1 angle-of-arrival 1 gpt 1 non-destructive-testing 1 round-trip 1 super-resolution 1 time-of-arrival 1 time-of-flight 1 tof 1 trilateration 1 ultrasound 1 csmt 1 eurasip 1 icme 1 audio-classification 1 foundation-models 1 non-verbal-vocalisation 1 open-source 1 pre-training 1 differential-equations 1 latent-space 1 medical-imaging 1 cvpr 1 emnlp 1