Deep Reinforcement Learning that Matters, Peter Henderson, Riashat Islam, Philip Bachman, Joelle Pineau, Doina Precup, David Meger, 2018Proceedings of the AAAI Conference on Artificial Intelligence, Vol. 32 (Association for the Advancement of Artificial Intelligence)DOI: 10.1609/aaai.v32i1.11694 - 这篇基础性论文识别并分析了导致深度强化学习结果不可复现性的关键因素,如超参数敏感性、随机种子和微小的实现差异。