Human-level control through deep reinforcement learning, Volodymyr Mnih, Koray Kavukcuoglu, David Silver, Alex Graves, Ioannis Antonoglou, Daan Wierstra, Martin A. Riedmiller, 2015Nature, Vol. 518DOI: 10.1038/nature14236 - This foundational paper introduces the Deep Q-Network (DQN) architecture, including the use of experience replay and target networks, and defines the training objective with the Q-learning loss function.
Reinforcement Learning: An Introduction, Richard S. Sutton and Andrew G. Barto, 2018 (MIT Press) - This widely used textbook covers the fundamental concepts of reinforcement learning, including Q-learning, Bellman equations, and provides a clear explanation of how these principles apply to deep Q-networks.
Spinning Up in Deep RL, Josh Achiam, 2018 - This online resource offers a practical introduction to deep reinforcement learning, providing detailed explanations and examples of the DQN algorithm, its loss function, and training methodology.