Rainbow: Combining Improvements in Deep Reinforcement Learning, Matteo Hessel, Joseph Modayil, Hado van Hasselt, Tom Schaul, Georg Ostrovski, Will Dabney, Dan Horgan, Bilal Piot, Mohammad Gheshlaghi Azar, David Silver, 2018Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence, Vol. 32 (AAAI Press)DOI: 10.1609/aaai.v32i1.11792 - 通过结合多项深度Q网络改进(包括C51)展示了强大的实证性能。