Deep Reinforcement Learning that Matters, Peter Henderson, Riashat Islam, Philip Bachman, Joelle Pineau, Doina Precup, David Meger, 2018Proceedings of the AAAI Conference on Artificial Intelligence, Vol. 32 (Association for the Advancement of Artificial Intelligence)DOI: 10.1609/aaai.v32i1.11694 - This foundational paper identifies and analyzes key factors contributing to irreproducibility in deep RL, such as hyperparameter sensitivity, random seeds, and subtle implementation choices.
Reproducibility, Stable Baselines3 Contributors, 2023 (Stable Baselines3 Contributors) - Provides practical advice and code examples from the Stable Baselines3 library for managing randomness, environment versions, and other factors to improve experimental reproducibility.