Self-Attention with Relative Position Representations, Peter Shaw, Jakob Uszkoreit, Ashish Vaswani, 2018Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Volume 2 (Short Papers) (Association for Computational Linguistics)DOI: 10.18653/v1/N18-2074 - 本文提出了最早将相对位置信息通过学习到的相对位置嵌入添加到键和值中,从而引入自注意力机制的明确公式之一。