Asynchronous Methods for Deep Reinforcement Learning, Volodymyr Mnih, Adrià Puigdomènech Badia, Mehdi Mirza, Alex Graves, Timothy P. Lillicrap, Tim Harley, David Silver, Koray Kavukcuoglu, 2016ICML 2016DOI: 10.48550/arXiv.1602.01783 - Original paper introducing the A3C algorithm, detailing its asynchronous architecture and benefits for efficient deep reinforcement learning.
Reinforcement Learning: An Introduction, Richard S. Sutton and Andrew G. Barto, 2018 (MIT Press) - Standard textbook for reinforcement learning, providing foundational concepts for Actor-Critic methods, n-step returns, and policy gradients.
Reinforcement Learning (UCL Course), David Silver, 2015 - Lecture series offering conceptual clarity on reinforcement learning algorithms, including a detailed explanation of Actor-Critic methods.