Long Short-Term Memory, Sepp Hochreiter and Jürgen Schmidhuber, 1997Neural Computation, Vol. 9 (MIT Press)DOI: 10.1162/neco.1997.9.8.1735 - The foundational paper introducing the LSTM architecture, which was designed to address the vanishing gradient problem in recurrent neural networks.
Deep Learning (Chapter 10: Sequence Modeling: Recurrent and Recursive Networks), Ian Goodfellow, Yoshua Bengio, and Aaron Courville, 2016 (MIT Press) - A section from a widely cited textbook offering a theoretical explanation of recurrent neural networks, including the mechanics of LSTMs and the problems they mitigate.
Keras LSTM Layer, Keras Team, 2024 - Official documentation for Keras's LSTM layer, detailing its parameters and use for practical implementation within the Keras framework.