Troubleshooting Machine Learning Deployments: A Taxonomy of Failures and Fixes, Nithya Sambasivan, Heidi Lam, Sherry Qian, Khang Nguyen, Ann Yuan, Beenish Chaudhry, Leilani Battle, Greg Nelson, Anusha Muralidharan, Li Zhang, Xiang'Anthony' Chen, Michael S. Bernstein, James Landay, 2021Proceedings of the ACM on Human-Computer Interaction, Vol. 5 (ACM)DOI: 10.1145/3476059 - This paper identifies and categorizes common failure modes in deployed machine learning systems, providing a strong rationale for robust monitoring to detect and address issues proactively.