We have previously looked at autoencoders constructed with fully-connected layers. While useful for various data types, these architectures can encounter difficulties when processing image data due to its high dimensionality and inherent spatial structure. Flattening images into vectors for standard autoencoders often results in the loss of important local relationships between pixels.
To address these challenges with image data, this chapter focuses on Convolutional Autoencoders (ConvAEs). These models integrate principles from Convolutional Neural Networks (CNNs), allowing them to effectively process images by preserving spatial information and learning hierarchical features.
In this chapter, you will learn to:
5.1 Why Fully-Connected Autoencoders Fall Short for Images
5.2 Convolutional Layers in Autoencoder Encoders
5.3 Using Pooling Layers for Spatial Down-sampling
5.4 Transposed Convolutional Layers in Decoders
5.5 Upsampling Techniques in Decoders
5.6 Constructing a Convolutional Autoencoder Model
5.7 Extracting Hierarchical Features with Conv Autoencoders
5.8 Hands-on: Convolutional Autoencoder for Image Features
© 2025 ApX Machine Learning