Backdoor Attacks & Defense

Explore how adversaries can embed hidden behaviors in ML models, learn to detect trojan models using Neural Cleanse and spectral signatures, and master defense techniques like fine-pruning and activation clustering.

Start Course → View All Lessons

Lessons

🔒

Security Focus

🕑

Self-Paced

100%

Free

Your Learning Path

Follow these lessons in order, or jump to any topic that interests you.

Beginner

◈

1. Introduction

What are backdoor attacks? Threat landscape, attack surface in ML pipelines, and why backdoors are uniquely dangerous.

Start here →

Intermediate

⚡

2. Attack Methods

BadNets, clean-label backdoors, hidden trigger attacks, and data poisoning strategies used by adversaries.

12 min read →

Advanced

⚙

3. Trojan Models

How trojan models work, supply chain attacks on pre-trained models, and trojan insertion during fine-tuning.

15 min read →

Advanced

✎

4. Detection

Neural Cleanse, spectral signatures, activation clustering, STRIP, and meta-neural analysis for backdoor detection.

15 min read →

Advanced

★

5. Removal

Fine-pruning, knowledge distillation, unlearning techniques, and certified backdoor removal methods.

12 min read →

Intermediate

☆

6. Best Practices

Defense-in-depth strategies, supply chain security, model auditing workflows, and organizational policies.

10 min read →

What You'll Learn

By the end of this course, you'll be able to:

🔎

Identify Backdoor Threats

Recognize how and where backdoors can be inserted into ML models, datasets, and training pipelines.

🛡

Detect Trojan Models

Apply state-of-the-art detection methods including Neural Cleanse, spectral signatures, and activation analysis.

🛠

Remove Backdoors

Use fine-pruning, distillation, and unlearning techniques to neutralize backdoors from compromised models.

🎯

Secure ML Supply Chains

Implement policies and technical controls to prevent backdoor injection throughout the ML lifecycle.