Stochastic Gradient Boosting, Jerome H. Friedman, 2002Computational Statistics & Data Analysis, Vol. 38 (Elsevier)DOI: 10.1016/S0167-9473(01)00065-2 - Introduces the Stochastic Gradient Boosting algorithm, including row subsampling, to regularize and improve the generalization of Gradient Boosting Machines.
Greedy Function Approximation: A Gradient Boosting Machine, Jerome H. Friedman, 2001The Annals of Statistics, Vol. 29 (Institute of Mathematical Statistics)DOI: 10.1214/aos/1013203451 - The foundational paper that formally introduced the Gradient Boosting Machine (GBM) algorithm, providing the theoretical basis for its iterative function approximation. SGB is a direct extension of this.
The Elements of Statistical Learning: Data Mining, Inference, and Prediction, Trevor Hastie, Robert Tibshirani, and Jerome Friedman, 2009 (Springer) - A comprehensive textbook covering Gradient Boosting, Random Forests, and related ensemble methods, offering detailed theoretical explanations of subsampling techniques. (2nd edition)
scikit-learn User Guide: Gradient Boosting, The scikit-learn developers, 2024 - Provides practical details and parameter explanations for subsample and max_features within Scikit-learn's GradientBoostingClassifier and GradientBoostingRegressor.