Reinforcement Learning: An Introduction, Richard S. Sutton and Andrew G. Barto, 2018 (MIT Press) - The foundational textbook for reinforcement learning, comprehensively covering Markov Decision Processes, Bellman equations, and dynamic programming algorithms like Policy Iteration and Value Iteration.
Dynamic Programming and Optimal Control, Vol. I, Dimitri P. Bertsekas, 2017 (Athena Scientific) - A rigorous and comprehensive treatment of dynamic programming, serving as a fundamental reference for the theoretical underpinnings of solving Bellman equations in discrete and continuous spaces.
CS234: Reinforcement Learning (Autumn 2023-2024), Emma Brunskill, 2025 (Stanford University) - An advanced university course covering the theoretical and algorithmic foundations of reinforcement learning, with lectures and materials on Bellman equations and dynamic programming methods.