Source code for "Visually aligned sound generation via sound-producing motion parsing" (Published at Neurocomputing)
synchronization
video-understanding
audioset
vas
cross-modality
visual-audio
audio-generation
visual-to-sound
-
Updated
Apr 12, 2022