Better performance with the tf.data API, TensorFlow Authors, 2024 - Guide on building efficient data input pipelines, including strategies for shuffling and batching for optimal model training.
Deep Learning, Ian Goodfellow, Yoshua Bengio, and Aaron Courville, 2016 (MIT Press) - Explains the theoretical basis for using mini-batches and the importance of data randomization in gradient-based optimization algorithms like SGD.