Listen, Attend and Spell, William Chan, Navdeep Jaitly, Quoc V. Le, Oriol Vinyals, 2015arXiv preprint arXiv:1508.01211DOI: 10.48550/arXiv.1508.01211 - Introduces the end-to-end Listen, Attend, and Spell (LAS) model for speech recognition, detailing its encoder-decoder architecture with attention.
Sequence to Sequence Learning with Neural Networks, Ilya Sutskever, Oriol Vinyals, Quoc V. Le, 2014Advances in Neural Information Processing Systems 27 (NIPS 2014), Vol. 27 (Neural Information Processing Systems Foundation, Inc. (NeurIPS))DOI: 10.48550/arXiv.1409.3215 - Presents the foundational sequence-to-sequence learning framework, which serves as the basis for the LAS model's approach to ASR.