An Introduction to Statistical Learning: with Applications in Python, Gareth James, Daniela Witten, Trevor Hastie, Robert Tibshirani, Jonathan Taylor, 2023 (Springer) - Provides an accessible introduction to statistical learning, covering classification methods and their evaluation metrics comprehensively.
Model evaluation: quantifying the quality of predictions, scikit-learn developers, 2024 (scikit-learn project) - Official documentation defining and illustrating classification metrics with practical examples relevant to Python implementations.
Machine Learning Yearning, Andrew NG, 2017 - Offers practical advice on selecting and understanding machine learning evaluation metrics to drive model improvement.