Human-level control through deep reinforcement learning, Volodymyr Mnih, Koray Kavukcuoglu, David Silver, Andrei A. Rusu, Joel Veness, Marc G. Bellemare, Alex Graves, Martin Riedmiller, Andreas K. Fidjeland, Georg Ostrovski, Stig Petersen, Charles Beattie, Amir Sadik, Ioannis Antonoglou, Helen King, Dharshan Kumaran, Daan Wierstra, Shane Legg, and Demis Hassabis, 2015Nature, Vol. 518DOI: 10.1038/nature14236 - Introduces the Deep Q-Network (DQN) algorithm, pioneering the use of experience replay for stable and efficient training of deep neural networks in reinforcement learning environments.
Reinforcement Learning: An Introduction, Richard S. Sutton and Andrew G. Barto, 2018 (MIT Press) - A standard textbook providing comprehensive coverage of reinforcement learning, including detailed explanations of Q-learning, deep Q-networks, and the role of experience replay in stabilizing learning with function approximators.
Prioritized Experience Replay, Tom Schaul, John Quan, Ioannis Antonoglou, and David Silver, 2016ICLRDOI: 10.48550/arXiv.1511.05952 - While introducing prioritized experience replay, this paper discusses the characteristics and limitations of uniform experience replay, highlighting how sampling strategies can further improve learning efficiency.