Adversarial Attack Implementation: FGSM and PGD with VGG16

Here, we use two different algorithms, FGSM(Fast Gradient Sign Method) and PGD(Projected Gradient Descent) to create images, that are identical to the input for the human eye but, fools the image classification models into misclassifying the image as something else. These algorithms work on almost all image classification model, but in my project I have used the VGG16 classifier as an example. The attacks can be targeted(Misclassify the input image to a specified target label) or untargeted ( Misclassify the input image to any label other than its original label). The algorithm for both attacks remains the same, but the definition of the loss function changes based on the attack.

FGSM(Fast Gradient Sign Method)

This method was introduced by Goodfellow et.al. This algorithm generates the adversarial image in a single step making it highly efficient and is used extensively to create adversarial examples to train robust classifiers. This algorithm works by calculating the cross-entropy loss of the image classifier and using gradient descent to modify the image to increase the loss further (in case of untargeted attacks). In targeted attacks, this algorithm aims to improve the probability of the target variable. Each pixel of the image is only modified by a small value($\epsilon$) so the changes are imperceptible to the human eye.

Input image(Treeing Walker Coonhound)	Generated FGSM petrubation	Image with petrubation applied(classified as Chihuahua)	Actual image of a Chihuahua
Input image(Magpi)	Generated FGSM petrubation	Image with petrubation applied(classified as Crayfish)	Actual image of a Crayfish

PGF(Projected Gradient Descent)

This algorithm was introduced by Simon Geisler et.al and is built of top of FGSM. It is an iterative algorithm, that tries to find the smallest change to bring about misclassification. This method produces better quality images and has a higher success rate for targeted misclassification when compared to FGSM,but it has higher compute requirements when compared to FGMS due to its iterative nature.

The images generated below are generated using PGD, with the target set. Hyperparameters: 500 epochs and learning rate: 0.001

Input image(Treeing Walker Coonhound)	Image with petrubation applied(classified as wippet,target was wippet)	Actual image of a wippet
Input image(Magpi)	Image with petrubation applied(classified as koala, target was koala)	Actual image of a koala

Clone and run

git clone https://github.com/Abhiram-29/MisclassifyMe.git

cd MisclassifiyMe.git

# install dependencies
pip install -r requirements.txt

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
.ipynb_checkpoints		.ipynb_checkpoints
images		images
.gitignore		.gitignore
README.md		README.md
algos.py		algos.py
fgsm.ipynb		fgsm.ipynb
imagenet-simple-labels.json		imagenet-simple-labels.json
pgd.ipynb		pgd.ipynb
requirements.txt		requirements.txt
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Adversarial Attack Implementation: FGSM and PGD with VGG16

FGSM(Fast Gradient Sign Method)

PGF(Projected Gradient Descent)

The images generated below are generated using PGD, with the target set. Hyperparameters: 500 epochs and learning rate: 0.001

Clone and run

About

Uh oh!

Releases

Packages

Uh oh!

Languages

Abhiram-29/MisclassifyMe

Folders and files

Latest commit

History

Repository files navigation

Adversarial Attack Implementation: FGSM and PGD with VGG16

FGSM(Fast Gradient Sign Method)

PGF(Projected Gradient Descent)

The images generated below are generated using PGD, with the target set. Hyperparameters: 500 epochs and learning rate: 0.001

Clone and run

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages