β-VAE: Learning Basic Visual Concepts with a Constrained Variational Framework, Irina Higgins, Loïc Matthey, Arka Pal, Christopher Burgess, Xavier Glorot, Matthew Botvinick, Shakir Mohamed, and Alexander Lerchner, 2017ICLR 2017 Deep Learning Symposium - Introduces the β-VAE, a variant designed to encourage disentangled representations in the latent space through a scaled KL divergence term.
Deep Learning, Ian Goodfellow, Yoshua Bengio, and Aaron Courville, 2016 (MIT Press) - Comprehensive textbook with a dedicated chapter on Autoencoders, including a detailed section on Variational Autoencoders, their loss function, and characteristics.