All Courses

Advanced Adversarial Machine Learning

Chapter 1: Foundations of Adversarial ML Security

Review of Machine Learning Security Vulnerabilities

Threat Models in Machine Learning

Attack Surfaces: Training vs. Inference

Mathematical Formulation of Adversarial Examples

Taxonomy of Adversarial Attacks

Overview of Defense Strategies

Chapter 2: Advanced Evasion Attacks

Gradient-Based Attacks: FGSM, BIM, PGD Analysis

Optimization-Based Attacks: Carlini & Wagner Methods

Score-Based Attack Techniques

Decision-Based Attack Techniques

Transferability of Adversarial Examples

Attacking Ensemble Models

Implementing Evasion Attacks: Hands-on Practical

Chapter 3: Data Poisoning and Backdoor Attacks

Poisoning Attack Strategies: Availability vs Integrity

Targeted Data Poisoning Techniques

Backdoor Attack Mechanisms and Trigger Design

Clean-Label Poisoning Attacks

Analyzing Poisoning Impact on Model Training

Crafting Data Poisoning Attacks: Hands-on Practical

Chapter 4: Model Inference and Privacy Attacks

Membership Inference Attacks: Theory and Methods

Attribute Inference Techniques

Model Inversion and Reconstruction Attacks

Model Stealing: Functionality Extraction Methods

Relationship to Differential Privacy

Implementing Membership Inference: Hands-on Practical

Chapter 5: Robust Defense Mechanisms

Adversarial Training: Principles and Variations

Certified Defenses: Randomized Smoothing

Input Transformation Defenses

Gradient Masking and Obfuscation Issues

Defending Against Poisoning and Backdoors

Implementing Adversarial Training: Hands-on Practical

Chapter 6: Evaluating Model Robustness

Metrics for Adversarial Robustness

Benchmarking Tools and Frameworks

Adaptive Attacks: Evaluating Defenses Properly

Security Evaluations under Different Threat Models

Interpreting Robustness Evaluation Results

Setting up Robustness Benchmarks: Hands-on Practical

Chapter 7: Adversarial Examples in Specific Domains

Adversarial Attacks on Computer Vision Models

Generating Adversarial Text for NLP Models

Attacks on Reinforcement Learning Agents

Physical Adversarial Attacks

Domain-Specific Attack Considerations

Generating Adversarial Text: Practice

Chapter 6: Evaluating Model Robustness

Having examined methods for attacking machine learning models and strategies to defend against them, the next step is to determine how well these defenses actually perform. Implementing a defense mechanism is not enough; we need dependable ways to measure a model's security against potential threats. This chapter focuses on the methods and practices for rigorously assessing model security.

You will learn about standard metrics used to quantify security, such as accuracy under attack or the minimum perturbation magnitude, often represented using $L_p$ norms like $L_0$ , $L_2$ , or $L_\infty$ , needed to cause misclassification. We will examine common benchmarking tools and frameworks like ART and CleverHans that aid these evaluations. A significant part of evaluation involves designing strong, adaptive attacks specifically tailored to test the limits of a defense, avoiding the false sense of security provided by weak assessments. We will also discuss how to conduct evaluations considering different attacker assumptions (threat models) and how to interpret the resulting security assessments effectively. By the end of this chapter, you will be equipped to set up and run systematic security evaluations for machine learning models.

Sections

6.1 Metrics for Adversarial Robustness
6.2 Benchmarking Tools and Frameworks
6.3 Adaptive Attacks: Evaluating Defenses Properly
6.4 Security Evaluations under Different Threat Models
6.5 Interpreting Robustness Evaluation Results
6.6 Setting up Robustness Benchmarks: Hands-on Practical

© 2025 ApX Machine Learning