Stochastic Gradient Boosting, Jerome H. Friedman, 2002Computational Statistics & Data Analysis, Vol. 38 (Elsevier)DOI: 10.1016/S0167-9473(01)00065-2 - Introduces the concept of subsampling (stochasticity) into gradient boosting algorithms to improve generalization.
The Elements of Statistical Learning: Data Mining, Inference, and Prediction, Trevor Hastie, Robert Tibshirani, and Jerome Friedman, 2009 (Springer) - Comprehensive textbook covering statistical learning, including detailed sections on ensemble methods like boosting, bagging, and their regularization aspects.
XGBoost Parameters, XGBoost Developers, 2023 - Official documentation describing hyperparameters like subsample, colsample_bytree, and their role in XGBoost regularization.