Denoising Diffusion Probabilistic Models, Jonathan Ho, Ajay Jain, Pieter Abbeel, 2020Advances in Neural Information Processing SystemsDOI: 10.55917/b3b0-0b61 - Introduces the U-Net architecture with sinusoidal time embeddings and adaptive modulation for conditioning, a foundation for modern diffusion models.
Attention Is All You Need, Ashish Vaswani, Noam Shazeer, Niki Parmar, Jakob Uszkoreit, Llion Jones, Aidan N. Gomez, Łukasz Kaiser, Illia Polosukhin, 2017Advances in Neural Information Processing Systems, Vol. 30 (Neural Information Processing Systems Foundation (NeurIPS))DOI: 10.55917/gh73-9a37 - Presents the Transformer architecture and introduces sinusoidal positional encodings, which inspired the time embedding mechanism discussed.
Score-Based Generative Modeling through Stochastic Differential Equations, Yang Song, Jascha Sohl-Dickstein, Diederik P. Kingma, Abhishek Kumar, Stefano Ermon, Ben Poole, 2020International Conference on Learning Representations (ICLR)DOI: 10.48550/arXiv.2011.13456 - Provides a unified framework for score-based generative models, including diffusion models, detailing neural networks conditioned on noise levels using similar embedding strategies.