Photorealistic Text-to-Image Diffusion Models with Deep Language Understanding, Chitwan Saharia, William Chan, Saurabh Saxena, Lala Li, Jay Whang, Emily Denton, Seyed Kamyar Seyed Ghasemipour, Burcu Karagol Ayan, S. Sara Mahdavi, Rapha Gontijo Lopes, Tim Salimans, Jonathan Ho, David J Fleet, Mohammad Norouzi, 2022arXiv preprint arXiv:2205.11487DOI: 10.48550/arXiv.2205.11487 - Formalizes Classifier-Free Guidance (CFG) and demonstrates its effectiveness in achieving high-quality, text-aligned image generation, providing the standard formulation used today.
Denoising Diffusion Probabilistic Models, Jonathan Ho, Ajay Jain, Pieter Abbeel, 2020Advances in Neural Information Processing Systems 33 (NeurIPS 2020)DOI: 10.48550/arXiv.2006.11239 - The foundational paper that introduced Denoising Diffusion Probabilistic Models (DDPMs), providing the theoretical and practical basis for the diffusion models on which CFG is applied.