Having established a foundation with NumPy for numerical computation, we now focus on Pandas, a library optimized for working with structured data in Python. NumPy arrays are powerful for numerical tasks, but data analysis often involves tabular data with labels, mixed data types, and missing values. Pandas provides high-performance, easy-to-use data structures and data analysis tools specifically for these scenarios.
This chapter introduces the core components of Pandas:
Series
and the two-dimensional DataFrame
.Series
and DataFrames
from various sources, including Python lists, dictionaries, and NumPy arrays.DataFrames
to understand their structure, contents, and data types using functions like head()
, info()
, and describe()
.By the end of this chapter, you will be able to create and examine the basic building blocks used for most data manipulation tasks in Pandas. A practical session will reinforce these concepts.
5.1 What is Pandas?
5.2 Pandas Data Structure: Series
5.3 Creating Series
5.4 Pandas Data Structure: DataFrame
5.5 Creating DataFrames
5.6 Inspecting DataFrames
5.7 Hands-on practical: Creating and Examining Series/DataFrames
© 2025 ApX Machine Learning