After understanding how data pipelines move information, the next logical step is to consider where that data resides. Storing data effectively is fundamental to any data engineering task, as the choice of storage impacts accessibility, performance, and cost.
This chapter examines the core principles of data storage relevant to data engineers. You will learn about different storage paradigms and when to use them:
By the end of this chapter, you will have a foundational understanding of the primary data storage options available and the factors involved in selecting the appropriate solution for a given task. We will also practice setting up a basic database table.
4.1 Choosing the Right Data Storage
4.2 Working with Relational Databases (SQL Basics)
4.3 Introduction to NoSQL Databases
4.4 Understanding File Storage Systems
4.5 Object Storage Basics
4.6 Common Data Formats
4.7 Practice: Setting up a Simple Database Table
© 2025 ApX Machine Learning