Deep Reinforcement Learning with Double Q-learning, Hado van Hasselt, Arthur Guez, David Silver, 2016Proceedings of the AAAI Conference on Artificial Intelligence, Vol. 30 (Association for the Advancement of Artificial Intelligence)DOI: 10.1609/aaai.v30i1.10295 - This paper introduced the Double DQN algorithm to address overestimation bias in deep Q-networks.
Reinforcement Learning: An Introduction, Richard S. Sutton and Andrew G. Barto, 2018 (The MIT Press) - A comprehensive textbook on reinforcement learning, including Q-learning, DQN, and its improvements.
Human-level control through deep reinforcement learning, Volodymyr Mnih, Koray Kavukcuoglu, David Silver, Alex Graves, Ioannis Antonoglou, Daan Wierstra, and Martin Riedmiller, 2015Nature, Vol. 518 (Springer Nature)DOI: 10.1038/nature14236 - The original paper that introduced the Deep Q-Network (DQN), providing the foundational context for Double DQN.