Deep Learning, Ian Goodfellow, Yoshua Bengio, and Aaron Courville, 2016 (MIT Press) - This textbook offers a comprehensive treatment of deep learning concepts, including thorough explanations of various loss functions like MSE and cross-entropy, and their application in training autoencoders.
Reducing the Dimensionality of Data with Neural Networks, Geoffrey E. Hinton, Ruslan R. Salakhutdinov, 2006Science, Vol. 313 (American Association for the Advancement of Science)DOI: 10.1126/science.1127647 - A seminal paper that introduced deep autoencoders, showcasing their capability to learn lower-dimensional data representations effectively by minimizing reconstruction errors.
Pattern Recognition and Machine Learning, Christopher M. Bishop, 2006 (Springer)DOI: 10.1007/978-0-387-45528-0 - This book provides a strong statistical and probabilistic foundation for machine learning, with detailed discussions on the principles behind common loss functions such as squared error and cross-entropy.