Reducing the Dimensionality of Data with Neural Networks, Geoffrey E. Hinton and Ruslan R. Salakhutdinov, 2006Science, Vol. 313 (American Association for the Advancement of Science)DOI: 10.1126/science.1127647 - A foundational paper that introduced deep autoencoders and demonstrated the effectiveness of greedy layer-wise pre-training for dimensionality reduction, a historically significant approach to training deep networks.
Deep Learning, Ian Goodfellow, Yoshua Bengio, and Aaron Courville, 2016 (MIT Press) - A comprehensive textbook covering the theoretical and practical aspects of deep learning, including detailed explanations of autoencoders, stacked autoencoders, and their training methodologies.
Unsupervised Feature Learning and Deep Learning (UFLDL) Tutorial, Andrew Ng, Jiquan Ngiam, Chuan Yu Foo, Yifan Mai, Caroline Suen, Adam Coates, Andrew Maas, Awni Hannun, Brody Huval, Tao Wang, Sameep Tandon, 2013 (Stanford University) - Provides clear, practical explanations and examples of autoencoders, including stacked autoencoders and their training, serving as an excellent educational resource.