Reinforcement Learning: An Introduction, Richard S. Sutton and Andrew G. Barto, 2018 (The MIT Press) - Comprehensive textbook covering reinforcement learning fundamentals, including detailed explanations of Q-Learning and SARSA.
Learning from Delayed Rewards, Christopher J.C.H. Watkins, 1989 (University of Cambridge) - Original Ph.D. thesis introducing the Q-learning algorithm.
On-line Q-learning using connectionist systems, Gavin A. Rummery, Mahesan Niranjan, 1994Technical Report CUED/F-INFENG/TR 166 (Cambridge University Engineering Department) - Technical report introducing the SARSA algorithm.