Human-level control through deep reinforcement learning, Volodymyr Mnih, Koray Kavukcuoglu, David Silver, Alex Graves, Ioannis Antonoglou, Daan Wierstra, and Martin Riedmiller, 2015Nature, Vol. 518DOI: 10.1038/nature14236 - Introduces the Deep Q-Network (DQN), combining Q-learning with deep neural networks for successful control tasks, establishing a key technique in deep reinforcement learning.
Reinforcement Learning: An Introduction, Richard S. Sutton and Andrew G. Barto, 2018 (MIT Press) - A standard textbook that explains reinforcement learning principles, including Q-learning, Bellman equations, and function approximation methods.
Deep Learning, Ian Goodfellow, Yoshua Bengio, and Aaron Courville, 2016 (MIT Press) - A foundational book providing background on deep neural networks, network architectures, and training using gradient-based optimization.