Pattern Recognition and Machine Learning, Christopher M. Bishop, 2006 (Springer) - A comprehensive textbook covering the theoretical basis of probabilistic models, maximum likelihood estimation, and the distinction between generative and discriminative methods.
Deep Learning, Ian Goodfellow, Yoshua Bengio, and Aaron Courville, 2016 (MIT Press) - An authoritative book with sections on probabilistic approaches to machine learning, generative models, and challenges in high-dimensional data, providing modern context.
Auto-Encoding Variational Bayes, Diederik P Kingma, Max Welling, 2013International Conference on Learning Representations (ICLR 2014)DOI: 10.48550/arXiv.1312.6114 - The original paper introducing Variational Autoencoders, a prime example of an intractable explicit density generative model and a foundational work for the course.
Generative Adversarial Nets, Ian J. Goodfellow, Jean Pouget-Abadie, Mehdi Mirza, Bing Xu, David Warde-Farley, Sherjil Ozair, Aaron Courville, Yoshua Bengio, 2014Advances in Neural Information Processing Systems (NeurIPS 2014)DOI: 10.48550/arXiv.1406.2661 - The seminal paper presenting Generative Adversarial Networks, a representative example of implicit density generative models.