Practice: Implementing a CNN for Image Classification
Was this section helpful?
Gradient-Based Learning Applied to Document Recognition, Yann LeCun, Léon Bottou, Yoshua Bengio, and Patrick Haffner, 1998Proceedings of the IEEE, Vol. 86 (IEEE)DOI: 10.1109/5.726791 - Introduces the foundational architecture of Convolutional Neural Networks (CNNs), including convolutional layers, pooling, and dense layers, demonstrated with handwritten digit recognition on the MNIST dataset.
Deep Learning, Ian Goodfellow, Yoshua Bengio, and Aaron Courville, 2016 (MIT Press) - A comprehensive textbook on deep learning covering neural network architectures, training methodologies, loss functions, and optimization algorithms like Adam. Chapters on CNNs are relevant.
Flux.jl Documentation, The Flux.jl Community, 2025 - Official documentation for Flux.jl, providing detailed API references and guides for constructing neural networks, handling data with MLUtils, and setting up training loops in Julia.
Adam: A Method for Stochastic Optimization, Diederik P. Kingma and Jimmy Ba, 2014International Conference for Learning Representations (ICLR) 2015DOI: 10.48550/arXiv.1412.6980 - Introduces the Adam optimization algorithm, a widely adopted adaptive learning rate method used for training deep neural networks efficiently.