Learning Phrase Representations using RNN Encoder-Decoder for Statistical Machine Translation, Kyunghyun Cho, Bart van Merriënboer, Caglar Gulcehre, Dzmitry Bahdanau, Fethi Bougares, Holger Schwenk, Yoshua Bengio, 2014Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing (EMNLP) (Association for Computational Linguistics)DOI: 10.3115/v1/D14-1179 - This paper introduced the Gated Recurrent Unit (GRU) and details the architecture and function of its gates, including the reset gate.
Deep Learning, Ian Goodfellow, Yoshua Bengio, and Aaron Courville, 2016 (MIT Press) - Provides a detailed explanation of recurrent neural networks, including GRUs and their gating mechanisms.
Sequence Models (Course 5 of Deep Learning Specialization), Andrew Ng, Kian Katanforoosh, Younes Bensouda Mourri, 2017 (DeepLearning.AI) - Offers an intuitive and practical explanation of GRUs, clarifying the role of the reset gate in sequence modeling.