Unifying Count-Based Exploration and Intrinsic Motivation, Marc G. Bellemare, Sriram Srinivasan, Georg Ostrovski, Tom Schaul, David Saxton, Rémi Munos, 2016Advances in Neural Information Processing Systems 29, Vol. 29 (NeurIPS) - 探讨了基于计数的探索和内在激励之间的联系,提出了用于高维状态空间的伪计数方法。