Applied Predictive Modeling, Max Kuhn and Kjell Johnson, 2013 (Springer)DOI: 10.1007/978-1-4614-6849-3 - Offers comprehensive treatment of model evaluation, selection, and performance assessment, including metric interpretation.
Machine Learning Yearning, Andrew Ng, 2018 (deeplearning.ai) - A practical guide to understanding and interpreting machine learning model performance and error analysis, emphasizing context and baselines.