Deep Learning, Ian Goodfellow, Yoshua Bengio, Aaron Courville, 2016 (MIT Press) - A comprehensive textbook that covers the fundamental theory of autoencoders, including the motivation for dimensionality reduction (undercomplete) and the role of regularization for learning useful features (relevant for overcomplete).
Sparse Autoencoders (part of UFLDL Tutorial), Andrew Ng, 2012 (Stanford University) - An educational resource providing a clear explanation of sparse autoencoders, demonstrating how overcomplete architectures can learn meaningful, distributed features through sparsity regularization.
Extracting and Composing Robust Features with Denoising Autoencoders, Pascal Vincent, Hugo Larochelle, Yoshua Bengio, Pierre-Antoine Manzagol, 2008Proceedings of the 25th International Conference on Machine Learning (ICML) (ACM)DOI: 10.1145/1390156.1390294 - A foundational paper that introduces denoising autoencoders, a significant method for training autoencoders, especially overcomplete ones, to learn robust feature representations by reconstructing clean inputs from corrupted versions.
Reducing the Dimensionality of Data with Neural Networks, Geoffrey E. Hinton, Ruslan R. Salakhutdinov, 2006Science, Vol. 313 (American Association for the Advancement of Science)DOI: 10.1126/science.1127647 - A seminal paper that demonstrated the effectiveness of deep autoencoders for learning lower-dimensional, compressed representations of data, laying the groundwork for undercomplete autoencoders for dimensionality reduction.