Visualizing the Loss Landscape of Neural Networks, Hao Li, Zheng Xu, Gavin Taylor, Christoph Studer, Tom Goldstein, 2018Advances in Neural Information Processing Systems (NeurIPS 2018)DOI: 10.48550/arXiv.1712.09913 - A foundational paper presenting techniques for visualizing high-dimensional loss surfaces and illustrating their complex characteristics.
Deep Learning, Ian Goodfellow, Yoshua Bengio, and Aaron Courville, 2016 (MIT Press) - Provides a comprehensive theoretical foundation for deep learning, including discussions on optimization challenges and loss surfaces.