Asynchronous Methods for Deep Reinforcement Learning, Volodymyr Mnih, Adrià Puigdomènech Badia, Mehdi Mirza, Alex Graves, Timothy P. Lillicrap, Tim Harley, David Silver, Koray Kavukcuoglu, 2016International Conference on Machine Learning (ICML)DOI: 10.48550/arXiv.1602.01783 - 提出了A3C,这是一种重要的深度Actor-Critic算法,有效实现了描述的架构,并在多种环境中展现了其效率。