Deep Learning, Ian Goodfellow, Yoshua Bengio, and Aaron Courville, 2016 (MIT Press) - A comprehensive textbook covering the theoretical and practical aspects of deep learning, including autoencoders, neural network architectures, activation functions, and training considerations.
Reducing the Dimensionality of Data with Neural Networks, Geoffrey E. Hinton, Ruslan R. Salakhutdinov, 2006Science, Vol. 313 (American Association for the Advancement of Science)DOI: 10.1126/science.1127647 - A seminal paper demonstrating the effectiveness of deep autoencoders for dimensionality reduction and feature learning through pre-training, influencing subsequent autoencoder research.
Deep Sparse Rectifier Networks, Xavier Glorot, Antoine Bordes, and Yoshua Bengio, 2011Proceedings of the Fourteenth International Conference on Artificial Intelligence and Statistics (AISTATS) - This paper introduced the Rectified Linear Unit (ReLU) activation function and highlighted its advantages in training deep neural networks, addressing issues like vanishing gradients.