Gradient-based learning applied to document recognition, Yann LeCun, Léon Bottou, Yoshua Bengio, Patrick Haffner, 1998Proceedings of the IEEE, Vol. 86 (IEEE)DOI: 10.1109/5.726791 - Introduces the LeNet-5 architecture, which prominently features pooling layers for downsampling and achieving local translation invariance.
Deep Learning, Ian Goodfellow, Yoshua Bengio, Aaron Courville, 2016 (MIT Press) - Provides a comprehensive theoretical explanation of pooling layers, including their role in CNNs for feature map reduction and achieving local invariance.
Keras MaxPooling2D layer, Keras Team, 2024 - Official documentation for the MaxPooling2D layer in Keras, detailing its parameters, arguments, and usage.