On Calibration of Modern Neural Networks, Chuan Guo, Geoff Pleiss, Yu Sun, Kilian Q. Weinberger, 2017Proceedings of the 34th International Conference on Machine Learning (ICML), Vol. 70 (PMLR (Proceedings of Machine Learning Research))DOI: 10.5555/3305890.3306010 - Introduces Expected Calibration Error (ECE) and temperature scaling for calibrating modern neural networks, a fundamental work in deep learning calibration.
Predicting Good Probabilities with Confidence Calibration, Alexandru Niculescu-Mizil, Rich Caruana, 2005Proceedings of the 22nd International Conference on Machine LearningDOI: 10.1145/1102351.1102430 - Compares various methods for confidence calibration, including isotonic regression and binning, providing a broad overview of calibration techniques.
Measuring and Improving the Reliability of Large Language Models, Samuel Kadavath, Tom Phambu, Liang Chen, Yuntao T. Lee, Keegan Scherer, Tom Zhao, et al., 2022arXiv preprint arXiv:2207.05221 (arXiv)DOI: 10.48550/arXiv.2207.05221 - Examines calibration in large language models (LLMs) and discusses methods for improving their reliability, directly addressing LLM-specific challenges.