Pattern Recognition and Machine Learning, Christopher M. Bishop, 2006 (Springer) - A classic and comprehensive textbook providing a foundational introduction to variational inference, including the derivation of the ELBO and the mean-field approximation.
Variational Inference: A Review for Statisticians, David M. Blei, Alp Kucukelbir, and Jon D. McAuliffe, 2017Journal of the American Statistical Association, Vol. 112 (Taylor & Francis Online)DOI: 10.1080/01621459.2017.1285773 - This review paper provides an accessible and broad overview of variational inference, framing it as an optimization problem and discussing its applications and modern developments.
Probabilistic Machine Learning: Advanced Topics, Kevin Patrick Murphy, 2023 (MIT Press) - A modern, comprehensive textbook offering an in-depth treatment of variational inference, from fundamentals to advanced algorithms and applications.