Reinforcement Learning: An Introduction, Richard S. Sutton and Andrew G. Barto, 2018 (The MIT Press) - A comprehensive textbook covering foundational reinforcement learning concepts, including tabular methods, function approximation (linear and non-linear), policy gradient methods, and challenges like the deadly triad.
Human-level control through deep reinforcement learning, Volodymyr Mnih, Koray Kavukcuoglu, David Silver, Andrei A. Rusu, Joel Veness, Marc G. Bellemare, Alex Graves, Martin Riedmiller, Andreas K. Fidjeland, Georg Ostrovski, Stig Petersen, Charles Beattie, Amir Sadik, Ioannis Antonoglou, Helen King, Dharshan Kumaran, Daan Wierstra, Shane Legg and Demis Hassabis, 2015Nature, Vol. 518 (Nature Publishing Group)DOI: 10.1038/nature14236 - Introduces Deep Q-Networks (DQN), a seminal work that successfully combines deep neural networks with reinforcement learning, using techniques like experience replay and target networks to stabilize learning with function approximation.
Reinforcement Learning Lecture 6: Value Function Approximation, David Silver, 2015 (UCL) - Lecture notes from a highly respected reinforcement learning course, offering an accessible overview of value function approximation methods and their integration with RL algorithms.