Human-level control through deep reinforcement learning, Volodymyr Mnih, Koray Kavukcuoglu, David Silver, Andrei A. Rusu, Joel Veness, Marc G. Bellemare, Alex Graves, Martin Riedmiller, Andreas K. Fidjeland, Georg Ostrovski, Stig Petersen, Charles Gimel, Andriy Blun, Daan Wierstra, John Wenzeslav, Remi Munos, 2015Nature, Vol. 518DOI: 10.1038/nature14236 - This foundational paper introduces the Deep Q-Network (DQN) algorithm, including the experience replay mechanism, which addresses issues of correlated data and unstable learning in deep reinforcement learning.
Prioritized Experience Replay, Tom Schaul, John Quan, Ioannis Antonoglou, David Silver, 2016International Conference on Learning Representations (ICLR)DOI: 10.48550/arXiv.1511.05952 - This paper proposes a method to improve experience replay by prioritizing important transitions, making the learning process more efficient and effective than uniform sampling.
Reinforcement Learning: An Introduction, Richard S. Sutton and Andrew G. Barto, 2018 (MIT Press) - A comprehensive textbook that provides foundational concepts of reinforcement learning, including a thorough explanation of Deep Q-Networks and the role of experience replay. 2nd edition.