System Optimizations: Cache Awareness and Parallelism
Was this section helpful?
XGBoost: A Scalable Tree Boosting System, Tianqi Chen and Carlos Guestrin, 2016Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD '16) (Association for Computing Machinery)DOI: 10.1145/2939672.2939785 - The original paper introducing XGBoost, detailing its system design for scalability, including parallelism, cache awareness, and the Column Block structure.
Scalable Tree Boosting. XGBoost Documentation., XGBoost Contributors, 2024 (XGBoost) - Official documentation describing XGBoost's system optimizations like parallel split finding, cache-aware data access, and out-of-core computation.
Computer Architecture: A Quantitative Approach, John L. Hennessy and David A. Patterson, 2017 (Morgan Kaufmann) - A foundational textbook providing detailed explanations of computer hardware, including CPU caches, memory hierarchies, and their impact on program performance.
Introduction to Parallel Computing, Ananth Grama, Anshul Gupta, George Karypis, and Vipin Kumar, 2003 (Addison-Wesley) - A comprehensive textbook covering fundamental concepts, algorithms, and architectures for parallel computing, relevant to understanding XGBoost's parallel processing.