Real-world data often requires cleaning and reshaping before it can be effectively analyzed. Having learned how to create, load, and select data using Pandas Series and DataFrames, we now turn to the essential task of data manipulation. Raw datasets frequently contain missing entries, irrelevant information, or inconvenient column names that need addressing.
This chapter introduces fundamental techniques for tidying your data within Pandas DataFrames. You will learn how to:
Mastering these operations provides a solid foundation for preparing datasets for subsequent analysis or modeling tasks.
8.1 Detecting Missing Data
8.2 Handling Missing Data: Dropping
8.3 Handling Missing Data: Filling
8.4 Dropping Columns and Rows
8.5 Adding New Columns
8.6 Modifying Existing Columns
8.7 Renaming Columns
8.8 Sorting Data
8.9 Hands-on practical: Cleaning and Modifying DataFrames
© 2025 ApX Machine Learning