Reinforcement Learning: An Introduction, Richard S. Sutton and Andrew G. Barto, 2018 (MIT Press) - This textbook is the definitive resource for reinforcement learning. Chapters 9 and 10 provide an in-depth explanation of function approximation, semi-gradient TD methods, and their convergence properties.
Gradient Descent for Reinforcement Learning, Leemon C. Baird, 1994Advances in Neural Information Processing Systems 7, Vol. 7 (MIT Press)DOI: 10.5555/2984950.2985068 - This foundational paper highlights the 'deadly triad' (function approximation, bootstrapping, and off-policy learning) and the potential for divergence in TD learning with function approximation, explaining the challenges semi-gradient methods address.
CS234: Reinforcement Learning (Winter 2024), Emma Brunskill, 2025 (Stanford University) - Stanford's comprehensive course on Reinforcement Learning offers lecture notes and videos that explain semi-gradient TD methods and function approximation, providing an alternative pedagogical perspective.