Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Support Dirty-Label Backdoor Attack #137

Open
deprit opened this issue Apr 3, 2024 · 0 comments
Open

Support Dirty-Label Backdoor Attack #137

deprit opened this issue Apr 3, 2024 · 0 comments
Labels
enhancement New feature or request

Comments

@deprit
Copy link
Collaborator

deprit commented Apr 3, 2024

Add support in Armory Library for an undefended Dirty-label Backdoor (DLBD) Attack applied to image classification.

In a DLBD attack, training images are chosen from the source class, a trigger applied to them, and then their labels flipped to the target class. The model is then trained on this modified data. The adversary's goal is that test images from the source class will be classified as the target class when the trigger is applied at test time.

Four primary metrics are computed after the model is trained on poisoned data.

  • Accuracy on benign test data, all classes
  • Accuracy on benign test data, source class
  • Accuracy on poisoned test data, all classes
  • Attack success rate

To evaluate a DLBD attack, Armory Library must

  • Create poison datasets by inserting triggers into selected classes and modifying labels;
  • Generate primary poisoning metrics to evaluate a poisoned model;
  • Run an example script evaluating a DLBD attack using the CIFAR10 dataset and a ResNet-18 classifier.
@deprit deprit added the phoenix label Apr 7, 2024
@deprit deprit added enhancement New feature or request and removed phoenix labels Aug 16, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request
Projects
None yet
Development

When branches are created from issues, their pull requests are automatically linked.

1 participant