Pattern Recognition and Machine Learning, Christopher M. Bishop, 2006 (Springer) - A foundational text on probabilistic machine learning, covering latent variable models and variational inference.
Deep Learning, Ian Goodfellow, Yoshua Bengio, Aaron Courville, 2016 (MIT Press) - A standard reference for deep learning, with sections on generative models including VAEs.