Defenses

Supported Defenses

The following table lists the supported defenses in BackdoorMBTI:

Defense

Modality

Input

Stage

Output

Paper

STRIP

Audio, Image, and text

backdoor model, clean dataset

post-training

clean dataset

STRIP: A Defence Against Trojan Attacks on Deep Neural Networks

AC

Audio, Image, and text

backdoor model, clean dataset, poison dataset

post-training

clean model, clean dataset

Detecting Backdoor Attacks on Deep Neural Networks by Activation Clustering

FT

Audio, Image, and text

backdoor model, clean dataset

in-training

clean model

Fine-Pruning: Defending Against Backdooring Attacks on Deep Neural Networks

FP

Audio, Image, and text

backdoor model, clean dataset

post-training

clean model

Fine-Pruning: Defending Against Backdooring Attacks on Deep Neural Networks

ABL

Audio, Image, and text

backdoor model, poison dataset

in-training

clean model

Anti-Backdoor Learning: Training Clean Models on Poisoned Data

CLP

Audio, Image, and text

backdoor model

post-training

clean model

Data-free Backdoor Removal based on Channel Lipschitzness

NC

Image

backdoor model, clean dataset

post-training

clean model, trigger pattern

Neural Cleanse: Identifying and Mitigating Backdoor Attacks in Neural Networks

MNTD

Image

backdoor model

post-training

detection result

Detecting AI Trojans Using Meta Neural Analysis

FreeEagle

Image

backdoor model

post-training

detection result

FREEEAGLE: Detecting Complex Neural Trojans in Data-Free Cases