Exploration by Random Network Distillation, Yuri Burda, Harrison Edwards, Amos Storkey, Oleg Klimov, 2018International Conference on Learning Representations (ICLR)DOI: 10.48550/arXiv.1810.12894 - Presents Random Network Distillation (RND), an intrinsic motivation method that uses the prediction error of a randomly initialized target network for exploration.
Unifying Count-Based Exploration and Intrinsic Motivation, Marc G. Bellemare, Sriram Srinivasan, Georg Ostrovski, Tom Schaul, David Saxton, Rémi Munos, 2016Advances in Neural Information Processing Systems 29, Vol. 29 (NeurIPS) - Explores the connection between count-based exploration and intrinsic motivation, proposing pseudo-count methods for high-dimensional state spaces.
Reinforcement Learning: An Introduction, Richard S. Sutton and Andrew G. Barto, 2018 (MIT Press) - A comprehensive presentation of reinforcement learning theory, including foundational exploration techniques.