Human-level control through deep reinforcement learning, Volodymyr Mnih, Koray Kavukcuoglu, David Silver, Andrei A. Rusu, Joel Veness, Marc G. Bellemare, Alex Graves, Martin Riedmiller, Andreas K. Fidjeland, Georg Ostrovski, Stig Petersen, Charles Beattie, Amir Sadik, Ioannis Antonoglou, Helen King, Dharshan Kumaran, Daan Wierstra, Shane Legg, and Demis Hassabis, 2015Nature, Vol. 518 (Springer Nature)DOI: 10.1038/nature14236 - 引入深度Q网络(DQN)的开创性论文,展示了使用深度强化学习和函数逼近在Atari游戏上达到人类水平的性能。
Deep Learning, Ian Goodfellow, Yoshua Bengio, and Aaron Courville, 2016 (MIT Press) - 一本权威教材,全面介绍了深度学习,包括神经网络的架构和训练,这对于理解DQN中的函数逼近至关重要。