Attention Is All You Need, Ashish Vaswani, Noam Shazeer, Niki Parmar, Jakob Uszkoreit, Llion Jones, Aidan N. Gomez, Łukasz Kaiser, Illia Polosukhin, 2017Advances in Neural Information Processing Systems, Vol. 30 (Curran Associates, Inc.)DOI: 10.5555/3295222.3295349 - 介绍了Transformer架构的基础论文,该架构是扩散变换器的基础。
FiLM: Visual Reasoning with a General Conditioning Layer, Ethan Perez, Florian Strub, Harm de Vries, Vincent Dumoulin, Aaron Courville, 2018Proceedings of the AAAI Conference on Artificial Intelligence, Vol. 32 (Association for the Advancement of Artificial Intelligence)DOI: 10.1609/aaai.v32i1.11671 - 介绍了特征级线性调制(FiLM),该技术为DiTs中使用的AdaLN-Zero等自适应归一化方法提供了更广泛的背景。