[ICASSP 2024 Oral] WAVER: Writing-Style Agnostic Text-Video Retrieval Via Distilling Vision-Language Models Through Open-Vocabulary Knowledge
-
Updated
Jan 10, 2024 - Python
[ICASSP 2024 Oral] WAVER: Writing-Style Agnostic Text-Video Retrieval Via Distilling Vision-Language Models Through Open-Vocabulary Knowledge
StofNet: Super-resolution Time of Flight Network (ICASSP 2024)
Official code for "Multi-Level Motion Attention with Contrastive Learning for Few-shot Action Recognition" (IICASSP2024)
Repository for the ICASSP 2024 paper "An Experimental Comparison Of Multi-view Self-supervised Methods For Music Tagging".
Official repo for "Audio-Visual Speech Recognition In-the-Wild: Multi-Angle Vehicle Cabin Corpus and Attention-based Method" in ICASSP 2024
2D residual U-Net (ResUNet) and a lead combiner (LC) for 12-lead ECG Abnormality Classification
The official implementation for IEEE-ICASSP 2024 paper "Flare-Free Vision: Empowering Uformer with Depth Insights"
Cross-lingual learning in scene text recognition (ICASSP2024)
Read articles, explore effectiveness metrics for speech enhancement methodologies. Seamlessly integrate code implementations for better understanding, and stay at the forefront of advances in speech enhancement with this repository! Don't forget to ⭐ if you find it helpful.
Code for ICASSP 2024 paper WhisperSeg: Positive Transfer of the Whisper Speech Transformer to Human and Animal Voice Activity Detection
ICASSP 2023-2024 Papers: A complete collection of influential and exciting research papers from the ICASSP 2023-24 conferences. Explore the latest advancements in acoustics, speech and signal processing. Code included. Star the repository to support the advancement of audio and signal processing!
Add a description, image, and links to the icassp2024 topic page so that developers can more easily learn about it.
To associate your repository with the icassp2024 topic, visit your repo's landing page and select "manage topics."