Cascaded Diffusion Models for High Fidelity Image Generation, Jonathan Ho, Chitwan Saharia, William Chan, David J. Fleet, Mohammad Norouzi, Tim Salimans, 2022Journal of Machine Learning Research, Vol. 23 (Journal of Machine Learning Research) - This paper introduced AdaLN-Zero, a conditional normalization technique that significantly improved the stability and performance of diffusion models, especially in cascaded architectures.
Layer Normalization, Jimmy Lei Ba, Jamie Ryan Kiros, Geoffrey E. Hinton, 2016arXiv preprint arXiv:1607.06450DOI: 10.48550/arXiv.1607.06450 - The foundational paper introducing Layer Normalization, which computes statistics independently for each sample, making it robust to varying batch sizes and a precursor to adaptive normalization techniques like AdaLN.