Attention Is All You Need, Ashish Vaswani, Noam Shazeer, Niki Parmar, Jakob Uszkoreit, Llion Jones, Aidan N. Gomez, Lukasz Kaiser, Illia Polosukhin, 2017Advances in Neural Information Processing Systems (NeurIPS 2017)DOI: 10.48550/arXiv.1706.03762 - 引入 Transformer 模型的开创性论文,该模型是 Conformer 架构中提供全局上下文建模能力的核心组成部分。