Deep Learning, Ian Goodfellow, Yoshua Bengio, and Aaron Courville, 2016 (MIT Press) - A comprehensive textbook covering foundational deep learning concepts, including autoencoders, their architectures, and general hyperparameter optimization techniques.
Random Search for Hyper-Parameter Optimization, James Bergstra and Yoshua Bengio, 2012Journal of Machine Learning Research, Vol. 13 - A research paper demonstrating the efficiency and advantages of random search over grid search for hyperparameter optimization.
Practical Bayesian Optimization of Machine Learning Algorithms, Jasper Snoek, Hugo Larochelle, and Ryan P. Adams, 2012Advances in Neural Information Processing Systems (NIPS 25) (Elsevier B.V.) - Presents Bayesian optimization as a data-efficient strategy for hyperparameter tuning, which is more effective than grid or random search.