#

icassp2024

Here are 11 public repositories matching this topic...

Fsoft-AIC / WAVER

[ICASSP 2024 Oral] WAVER: Writing-Style Agnostic Text-Video Retrieval Via Distilling Vision-Language Models Through Open-Vocabulary Knowledge

knowledge-distillation open-vocabulary vision-language-model text-video-retrieval icassp2024 writing-style-agnostic

Updated Jan 10, 2024
Python

hahnec / stofnet

StofNet: Super-resolution Time of Flight Network (ICASSP 2024)

audio learning localization deep-learning acoustic super-resolution ultrasound neural trilateration time-of-flight tof multilateration icassp time-of-arrival round-trip non-destructive-testing icassp2024

Updated Feb 22, 2024
Python

YWCandGHY / BiMACL

Official code for "Multi-Level Motion Attention with Contrastive Learning for Few-shot Action Recognition" (IICASSP2024)

action-recognition few-shot icassp2024

Updated Jan 12, 2024
Python

deezer / multi-view-ssl-benchmark

Repository for the ICASSP 2024 paper "An Experimental Comparison Of Multi-view Self-supervised Methods For Music Tagging".

music-information-retrieval music-tagging self-supervised-learning audio-representation-learning icassp2024

Updated May 20, 2024
Python

SMIL-SPCRAS / DAVIS

Official repo for "Audio-Visual Speech Recognition In-the-Wild: Multi-Angle Vehicle Cabin Corpus and Attention-based Method" in ICASSP 2024

signal-processing corpus speech-recognition multi-modal attention-mechanism avsr icassp spatio-temporal-features in-the-wild audio-visual icassp2024

Updated Apr 8, 2024
JavaScript

seorim0 / ResUNet-LC

2D residual U-Net (ResUNet) and a lead combiner (LC) for 12-lead ECG Abnormality Classification

deep-neural-networks deep-learning dnn pytorch ecg classification resnet multi-label-classification icassp electrocardiogram ecg-classification abnormal-detection abnormality-detection icassp2024

Updated Jan 4, 2024
Python

yousefkotp / Flare-Free-Vision-Empowering-Uformer-with-Depth-Insights

The official implementation for IEEE-ICASSP 2024 paper "Flare-Free Vision: Empowering Uformer with Depth Insights"

deep-learning image-processing neural-networks image-restoration depth-estimation depth-map icassp image-enhancement icassp2024 image-enhancing u-shaped-transformer flare-removal flare-free ieee-icassp

Updated Aug 27, 2024
Python

ku21fan / CLL-STR

Cross-lingual learning in scene text recognition (ICASSP2024)

multilingual ocr ocr-recognition scene-text-recognition icassp2024 cross-lingual-learning multilingual-text-recognition

Updated Apr 9, 2024
Jupyter Notebook

DmitryRyumin / Awesome-Speech-Enhancement

Read articles, explore effectiveness metrics for speech enhancement methodologies. Seamlessly integrate code implementations for better understanding, and stay at the forefront of advances in speech enhancement with this repository! Don't forget to ⭐ if you find it helpful.

awesome speech-enhancement icassp2024

Updated Apr 19, 2024
Jupyter Notebook

nianlonggu / WhisperSeg

Code for ICASSP 2024 paper WhisperSeg: Positive Transfer of the Whisper Speech Transformer to Human and Animal Voice Activity Detection

transformer whisper audio-segmentation voice-activity-detection icassp2024 animal-sound-detection whisperseg

Updated Sep 9, 2024
Python

DmitryRyumin / ICASSP-2023-24-Papers

ICASSP 2023-2024 Papers: A complete collection of influential and exciting research papers from the ICASSP 2023-24 conferences. Explore the latest advancements in acoustics, speech and signal processing. Code included. Star the repository to support the advancement of audio and signal processing!

Updated Sep 23, 2024
Python

Improve this page

Add a description, image, and links to the icassp2024 topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the icassp2024 topic, visit your repo's landing page and select "manage topics."