A Distributional Perspective on Reinforcement Learning, Marc G. Bellemare, Will Dabney, Rémi Munos, 2017Proceedings of the 34th International Conference on Machine Learning, Vol. 70 (PMLR) - Introduces the categorical approach to distributional reinforcement learning and the C51 algorithm.
Implicit Quantile Networks for Distributional Reinforcement Learning, Will Dabney, Georg Ostrovski, David Silver, Remi Munos, 2018Proceedings of the 35th International Conference on Machine Learning, Vol. 80 (PMLR) - Extends QR-DQN by learning a continuous quantile function, allowing estimation of any quantile.
Rainbow: Combining Improvements in Deep Reinforcement Learning, Matteo Hessel, Joseph Modayil, Hado van Hasselt, Tom Schaul, Georg Ostrovski, Will Dabney, Dan Horgan, Bilal Piot, Mohammad Gheshlaghi Azar, David Silver, 2018Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence, Vol. 32 (AAAI Press)DOI: 10.1609/aaai.v32i1.11792 - Demonstrates strong empirical performance by combining several Deep Q-Network enhancements, including C51.