Minimal reproduction of Diffusion Transformer architecture, untrained.
- Scalable diffusion models with transformers (William Peebles and Saining Xie, 2023) [Main paper]
I switched to this from normal U-net backbone diffusion. Why? cus I just prefer transforemrs (and DiTs ARE currently the SOTA architecture)