Building and maintaining data infrastructure relies on a specific set of tools. This chapter introduces software and platforms fundamental to the data engineering workflow. You will learn about using SQL for data manipulation within databases, applying version control with Git to manage code, and operating systems through the command-line interface (CLI). We will also provide a high-level look at common cloud platforms and the role of workflow schedulers. Practical exercises are included to help you start using these tools, beginning with basic Git commands.
6.1 Introduction to SQL for Data Manipulation
6.2 Version Control with Git for Code
6.3 Command-Line Interface (CLI) Basics
6.4 Overview of Cloud Platforms
6.5 Introduction to Workflow Schedulers
6.6 Practice: Basic Git Commands
© 2025 ApX Machine Learning