trevillie / magnet Goto Github PK

MagNet: a Two-Pronged Defense against Adversarial Examples

License: BSD 2-Clause "Simplified" License

Python 100.00%

magnet's Introduction

MagNet

Demo code for "MagNet: a Two-Pronged Defense against Adversarial Examples", by Dongyu Meng and Hao Chen, at CCS 2017.

The code demos black-box defense against Carlini's L2 attack of various confidences. Other techniques proposed in the paper are also included in defensive_models.py and worker.py, but are not shown in the demo defense. Attack implementations are not provided in this repository.

Run the demo code:

Make sure you have Keras, Tensorflow, numpy, scipy, and matplotlib installed.
Clone the repository.
We provide demo attack data and classifier on Dropbox and 百度网盘 (密码: yzt4). Please download and put the unzipped files in MagNet/. You may also use your own data for test.
Train autoencoders with python3 train_defense.py.
Test the defense with python3 test_defense.py .
Defense performance is plotted in graph/defense_performance.pdf.

magnet's People

Contributors

Stargazers

Watchers

magnet's Issues

Implementation of the CIFAR10 autoencoder

Hi, we didn't try our old model but we can get a even better result with the following structure:
input -> 3x3x32 conv -> BN -> Relu -> 3x3x32 conv -> Relu ->BN -> 3x3x3 conv -> output

BTW, since we don't have our old classifier, we used a DenseNet classifier with acc ~92%.

Hope this helps.

Originally posted by @Trevillie in #2 (comment)

In your response to issue #2, you suggest using an even better structure: input -> 3x3x32 conv -> BN -> Relu -> 3x3x32 conv -> Relu ->BN -> 3x3x3 conv -> output.

But when I implement this structure for CIFAR10 Detector or Reformer, I found it happens to errors because it cannot make the output in the range of [0, 1].
Is that structure really work in your experiments? how do you do to make it work in your experiments?

inconsistent l_p value used in detector

In the original paper section 5.2.1, the paper said

Detector II and detector I(see Table 3) used the $L^2$ and $L^1$ norm to measure reconstruction error, respectively.

However, in the following implementation, you use L1 for detector II and L2 for detector I.

MagNet/test_defense.py

Line 14 in b115cf8

detector_I = AEDetector("./defensive_models/MNIST_I", p=2)

MagNet/test_defense.py

Line 15 in b115cf8

detector_II = AEDetector("./defensive_models/MNIST_II", p=1)

I am asking is this a just typo? Can you confirm which one should be the correct one.

It seems the code in the repo gives the better result.

Implementation of the CIFAR autoencoder/reformer

Hi I've been trying to reproduce your CIFAR results for a couple of weeks now, but after following the architecture in your paper, I've only gotten ~40% accuracy with detector and reformer. Would it be possible to upload your implementation of the CIFAR MagNet architecture? Thank you

how to change adversarial samples

Hi,I know the demo is for C&W adversarial attack,now I need to test other adversarial samples,Could you tell me how to make it in a easy way.Thank you!

Inconsistency with your results of Figure 4&5 in the paper

Hi! I'm trying to reproduce your results of defense performance with different confidence of Carlini’s L2 attack on MNIST & CIFAR10 (Figure 4&5). But I met some issues.
For MNIST:
I generated about 10000 adversarial samples for each confidence(0.0, 10.0, 20.0, 30.0, 40.0). For total about 50000 samples. I got the following graph.

My no_defense curve has a downward trend with a high accuracy at confidence of 0.0. But your no_defense accuracy keeps 0% at any confidence. Since higher confidence yields higher attack success rate, I wonder why you got such a curve.
For CIFAR10:
I generated about 10000 adversarial samples for each confidence(0.0, 20.0, 40.0, 60.0, 80.0, 100.0). For total about 60000 samples. I got the following graph.

Besides the no_defense issue in MNIST, my with_detector curve is also inconsistent with yours. Your curve in the paper shows a upward trend. And my test data shows that nearly 99% of the adversarial samples pass the detector. The detector seems not to work.
I've seen your talk in ISSUE 1. So I also used your new autoencoder architecture and a better classifier with an accuracy of 86%. I got the following graph.

The trends of curves seem not to change. Only a higher beginning accuracy was acquired.
To sum up, there are 2 issues:

My no_defense curve is inconsistent with yours.
The detector seems not to make any difference in my experiment.
Could you help me deal with the problem? Thank you very much!

my result is not as good as yours in P10 figure4

Hi! I have implemented your results of defense in MNIST dataset, but the results is not as good as yours.
I have trained the defense model and the classify model, and download the data which you upload to the Baiduyun , and test the defense performance . My classify model accuracy is 99%+, and the mean_squared_error of two autoencoder model is less than 0.003. Finally,I got the following graph.

I think there may be a problem with the detector，but I dont known the reason!

trevillie / magnet Goto Github PK

magnet's Introduction

MagNet

Run the demo code:

magnet's People

Contributors

Stargazers

Watchers

Forkers

magnet's Issues

Implementation of the CIFAR10 autoencoder

inconsistent l_p value used in detector

Implementation of the CIFAR autoencoder/reformer

how to change adversarial samples

Inconsistency with your results of Figure 4&5 in the paper

my result is not as good as yours in P10 figure4

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent