Data rarely remains static or originates exactly where it is needed. A core task in data engineering involves constructing pathways to move, process, and refine data for analysis or application use. These pathways are known as data pipelines.
This chapter introduces the fundamental structure and operation of data pipelines. We will examine two common architectural patterns: Extract Transform Load (ETL) and Extract Load Transform (ELT). You will learn about the distinct phases within these patterns including:
Finally, you will apply these ideas by sketching a basic data pipeline based on a given scenario.
3.1 What is a Data Pipeline?
3.2 ETL Process Explained
3.3 ELT Process Explained
3.4 Data Extraction Techniques
3.5 Basic Data Transformation Operations
3.6 Loading Data into Storage
3.7 Simple Pipeline Orchestration Concepts
3.8 Practice: Sketching a Basic Pipeline
© 2025 ApX Machine Learning