Classification and Regression Trees, Leo Breiman, Jerome H. Friedman, Richard A. Olshen, Charles J. Stone, 1984 (Chapman and Hall/CRC)DOI: https://doi.org/10.1201/9781315139470 - This foundational text introduces Classification and Regression Trees (CART), providing a thorough explanation of their construction, properties, and use as predictive models.
Greedy Function Approximation: A Gradient Boosting Machine, Jerome H. Friedman, 2001The Annals of Statistics, Vol. 29 (Institute of Mathematical Statistics)DOI: 10.1214/aos/1013203451 - This seminal paper formalizes the gradient boosting algorithm, demonstrating how it iteratively combines weak learners, particularly decision trees, to optimize arbitrary differentiable loss functions.
The Elements of Statistical Learning: Data Mining, Inference, and Prediction, Trevor Hastie, Robert Tibshirani, Jerome Friedman, 2009 (Springer) - A comprehensive textbook that dedicates chapters to decision trees and gradient boosting, providing theoretical background and practical insights into their mechanisms and applications.