Attention Is All You Need, Ashish Vaswani, Noam Shazeer, Niki Parmar, Jakob Uszkoreit, Llion Jones, Aidan N. Gomez, Łukasz Kaiser, Illia Polosukhin, 2017Advances in Neural Information Processing Systems, Vol. 30 (Neural Information Processing Systems Foundation (NeurIPS))DOI: 10.55917/gh73-9a37 - 提出了Transformer架构并引入了正弦位置编码,为本文讨论的时间嵌入机制提供了思路。