Deep Learning, Ian Goodfellow, Yoshua Bengio, and Aaron Courville, 2016 (MIT Press) - This authoritative textbook provides a comprehensive treatment of autoencoders, detailing their architecture, training objectives, and ability to learn meaningful features through dimensionality reduction.
Reducing the Dimensionality of Data with Neural Networks, Geoffrey E. Hinton and Ruslan R. Salakhutdinov, 2006Science, Vol. 313 (American Association for the Advancement of Science)DOI: 10.1126/science.1127647 - A seminal paper demonstrating how deep autoencoders can effectively reduce data dimensionality and learn powerful, non-linear representations, thereby laying much of the groundwork for modern unsupervised feature learning.
Representation Learning: A Review and New Perspectives, Yoshua Bengio, Aaron Courville, and Pascal Vincent, 2013IEEE Transactions on Pattern Analysis and Machine Intelligence, Vol. 35 (IEEE)DOI: 10.1109/TPAMI.2013.50 - This review article surveys various representation learning techniques, including autoencoders, and discusses how they learn abstract features for various machine learning tasks.