CS677 final project: A study in audio-visual scene classification
-
Updated
Dec 22, 2021 - Python
CS677 final project: A study in audio-visual scene classification
This code is part of the paper: "A Deep Dive Into Neural Synchrony Evaluation for Audio-visual Translation" published at ACM ICMI 2022.
A simple, efficient, convenient and free video download tool for www.bilibili.com.
Open Pinspot for Ballrooms (OPinBall) is an open source DMX over Art-Net lighting controller designed to make it easier to pinspot centerpieces for banquet functions in ballrooms.
A video feature extractor using the epic fusion model
Respository for BFI National Archive open source preservation workflow scripts
Attention-based Temporal Binding Network
Segment-level autoencoders for multimodal representation
Champion Solutions repository for Perception Test challenges in ICCV2023 workshop.
Audio-Visual Generalized Zero-Shot Learning using Large Pre-Trained Models
Code and datasets for 'Move2Hear: Active Audio-Visual Source Separation' (ICCV 2021)
Towards Intelligibility-Oriented Audio-Visual Speech Enhancement
Towards Audio-Visual Saliency Prediction for Omnidirectional Video with Spatial Audio
Official code for WACV 2024 paper, "Annotation-free Audio-Visual Segmentation"
Accepted by TMM 2022
Official source code of the INTERSPEECH 2023 paper: "Audio-Visual Speech Separation in Noisy Environments with a Lightweight Iterative Model" (AVLIT)
Efficient synchronization from sparse cues
Code for CVPR 2021 paper Exploring Heterogeneous Clues for Weakly-Supervised Audio-Visual Video Parsing
[CVPR 2023] Collecting Cross-Modal Presence-Absence Evidence for Weakly-Supervised Audio-Visual Event Perception
Add a description, image, and links to the audio-visual topic page so that developers can more easily learn about it.
To associate your repository with the audio-visual topic, visit your repo's landing page and select "manage topics."