A 64x64 pre-trained diffusion model is all you need for 1-step high-resolution SOTA generation
NeurIPS24
Unified framework enables diverse samplers and 1-step generation SOTAs
ICLR24
Applications:
[SoundGen]
Improving Unsupervised Clean-to-Rendered Guitar Tone Transformation Using GANs and Integrated Unaligned Clean Data
DAFx24
DiffRoll: Diffusion-based Generative Music Transcription with Unsupervised Pretraining Capability
ICASSP23
STARSS23: An Audio-Visual Dataset of Spatial Recordings of Real Scenes with Spatiotemporal Annotations of Sound Events
NeurIPS23
### Contact