Long Short-Term Memory, Sepp Hochreiter, Jürgen Schmidhuber, 1997Neural Computation, Vol. 9 (MIT Press)DOI: 10.1162/neco.1997.9.8.1735 - The seminal paper introducing the Long Short-Term Memory (LSTM) architecture, detailing its fundamental design to address vanishing and exploding gradient problems in recurrent neural networks.
Deep Learning, Ian Goodfellow, Yoshua Bengio, and Aaron Courville, 2016 (MIT Press) - A foundational textbook with a detailed chapter on recurrent neural networks, including a thorough explanation of LSTM architecture and its operational flow.
Understanding LSTM Networks, Christopher Olah, 2015 - A widely acclaimed and highly visual blog post that provides an intuitive step-by-step explanation of how LSTM cells operate, making the information flow easy to grasp.