Dropout: A Simple Way to Prevent Overfitting, Nitish Srivastava, Geoffrey Hinton, Alex Krizhevsky, Ilya Sutskever, and Ruslan Salakhutdinov, 2014Journal of Machine Learning Research, Vol. 15 (Journal of Machine Learning Research)DOI: 10.5555/2620392.2620461 - 提出了 Dropout 作为一种有效的正则化技术。这篇论文有助于理解 Dropout 率,这是一个提高模型泛化能力和防止过拟合的关键超参数。
Random Search for Hyper-Parameter Optimization, James Bergstra and Yoshua Bengio, 2012Journal of Machine Learning Research, Vol. 13 - 这篇论文的引言全面概述了超参数调整面临的挑战,例如巨大的搜索空间和相互依赖性,有助于理解该领域的基础问题。