Source code for "Visually aligned sound generation via sound-producing motion parsing" (Published at Neurocomputing)
synchronization video-understanding audioset vas cross-modality visual-audio audio-generation visual-to-sound
-
Updated
Apr 12, 2022