All Courses

Introduction to Feature Engineering

Chapter 1: The Role of Features in Machine Learning

Revisiting the Machine Learning Workflow

What Constitutes a Feature?

Impact of Feature Quality on Model Performance

Common Data Types and Their Challenges

Overview of Feature Engineering Tasks

Chapter 2: Handling Missing Data

Identifying Missing Values

Mechanisms of Missing Data (MCAR, MAR, MNAR)

Simple Imputation Strategies: Mean, Median, Mode

Creating Missing Value Indicators

Multivariate Imputation: KNN Imputer

Multivariate Imputation: Iterative Imputer

Comparing Imputation Methods

Hands-on Practical: Imputing Missing Data

Chapter 3: Encoding Categorical Features

Challenges with Categorical Data

Nominal vs. Ordinal Categories

One-Hot Encoding for Nominal Features

Ordinal Encoding for Ordered Features

Handling High Cardinality Features

Target Encoding (Mean Encoding)

Binary Encoding

Hashing Encoder

Comparing Encoding Methods

Hands-on Practical: Applying Encoding Techniques

Chapter 4: Feature Scaling and Transformation

The Need for Feature Scaling

Standardization (Z-score Scaling)

Normalization (Min-Max Scaling)

Scaling for Outliers

Log Transformation for Skewed Data

Box-Cox Transformation

Yeo-Johnson Transformation

Quantile Transformation

Choosing the Right Scaling/Transformation Method

Hands-on Practical: Scaling and Transforming Features

Chapter 5: Feature Creation

Motivation for Creating New Features

Interaction Features

Polynomial Features

Feature Creation from Date/Time Data

Binning Numerical Features

Domain-Specific Feature Engineering

Automated Feature Creation (Introduction)

Hands-on Practical: Engineering New Features

Chapter 6: Feature Selection

Importance of Feature Selection

Filter Methods Overview

Filter Methods: Variance Threshold

Filter Methods: Univariate Statistical Tests (ANOVA F-value, Chi-Squared)

Filter Methods: Correlation Analysis

Wrapper Methods Overview

Wrapper Methods: Recursive Feature Elimination (RFE)

Wrapper Methods: Sequential Feature Selection (SFS)

Embedded Methods Overview

Embedded Methods: Regularization (Lasso L1)

Embedded Methods: Tree-Based Feature Importance

Hands-on Practical: Selecting Features

Feature Scaling and Transformation

Now that we have methods for handling categorical data, we turn to numerical features. The range and distribution of raw numerical values can directly impact the effectiveness of algorithms sensitive to feature scales, such as distance-based methods or those using gradient descent. This chapter focuses on techniques to prepare numerical data for modeling by adjusting its scale and distribution.

We will cover common scaling methods, including Standardization ( $Z$ -score scaling), which results in data with zero mean and unit variance ( $Z = \frac{x - \mu}{\sigma}$ ), and Normalization (Min-Max scaling), which confines values to a specific interval like [0, 1]. We will also look at Robust Scaling for data containing outliers. Additionally, you will learn about transformations like Log, Box-Cox, and Yeo-Johnson, used to modify skewed distributions and make data more suitable for certain modeling assumptions. The chapter provides guidance on selecting and applying these techniques effectively using Scikit-learn.

Sections

4.1 The Need for Feature Scaling
4.2 Standardization (Z-score Scaling)
4.3 Normalization (Min-Max Scaling)
4.4 Scaling for Outliers
4.5 Log Transformation for Skewed Data
4.6 Box-Cox Transformation
4.7 Yeo-Johnson Transformation
4.8 Quantile Transformation
4.9 Choosing the Right Scaling/Transformation Method
4.10 Hands-on Practical: Scaling and Transforming Features

© 2025 ApX Machine Learning