On Grouping for Maximum Homogeneity, Walter D. Fisher, 1958Journal of the American Statistical Association, Vol. 53 (Taylor & Francis)DOI: 10.1080/01621459.1958.10501479 - Presents a foundational algorithm for optimally partitioning a set of observations into groups, which is a conceptual basis for LightGBM's categorical splitting logic.
LightGBM: A Highly Efficient Gradient Boosting Decision Tree, Guolin Ke, Qi Meng, Thomas Finley, Taifeng Wang, Wei Chen, Weidong Ma, Qiwei Ye, Tie-Yan Liu, 2017Advances in Neural Information Processing Systems, Vol. 30 (NeurIPS) - The original paper introducing the LightGBM algorithm, detailing its optimized features including the categorical splitting approach.
Parameters - LightGBM documentation, LightGBM Contributors, 2024 - Provides practical guidance on configuring LightGBM to handle categorical features, including parameters for fine-tuning its behavior.