Optimization Methods for Large-Scale Machine Learning, Léon Bottou, Frank E. Curtis, Jorge Nocedal, 2018SIAM Review, Vol. 60 (Society for Industrial and Applied Mathematics)DOI: 10.1137/16M1080173 - A comprehensive survey of optimization algorithms for large-scale machine learning, covering the theoretical properties of SGD and various variance reduction methods.