Reinforcement Learning: An Introduction, Richard S. Sutton and Andrew G. Barto, 2018 (The MIT Press) - Standard comprehensive textbook for reinforcement learning, covering foundational concepts like Markov Decision Processes, Bellman equations, and dynamic programming algorithms.
Dynamic Programming and Optimal Control, Vol. I, Dimitri P. Bertsekas, 2017 (Athena Scientific) - A rigorous mathematical book on dynamic programming, including Bellman equations, relevant for a deeper understanding of optimal control.
Lecture 3: Dynamic Programming, Emma Brunskill, 2024Stanford University CS234: Reinforcement Learning Lecture Notes (Stanford University) - Provides a concise academic overview of Bellman equations, value iteration, and policy iteration within the context of reinforcement learning.