Prioritized Experience Replay, Tom Schaul, John Quan, Ioannis Antonoglou, David Silver, 2016International Conference on Learning Representations (ICLR)DOI: 10.48550/arXiv.1511.05952 - Presents Prioritized Experience Replay, an experience replay mechanism that samples transitions based on their temporal difference error.
Human-level control through deep reinforcement learning, Volodymyr Mnih, Koray Kavukcuoglu, David Silver, Andrei A. Rusu, Joel Veness, Marc G. Bellemare, Alex Graves, Martin Riedmiller, Andreas K. Fidjeland, Georg Ostrovski, Stig Petersen, Charles Beattie, Amir Sadik, Ioannis Antonoglou, Helen King, Dharshan Kumaran, Daan Wierstra, Shane Legg, Demis Hassabis, 2015Nature, Vol. 518DOI: 10.1038/nature14236 - The foundational paper that introduced Deep Q-Networks (DQN) for playing Atari games directly from raw pixel inputs.