Code Monkey home page Code Monkey logo

Comments (2)

gmh14 avatar gmh14 commented on June 3, 2024

Hi,

I believe there are two problems here.

First, regarding the wrong checkpoint, sorry for this inconvenience. I have previously updated the correct version of the checkpoint on SVHN. You could download it from the link in README. What I want to stress here is that the mistake ONLY happens during the conversion from our private-trained models to pytorch-available models for the aim of publishing the code, more details described in #5 (comment), nothing wrong involved in the training or testing process using our private distributed framework. Therefore, the way you truncate the first 10 dimensions of the checkpoint actually gets the correct parameters.

Furthermore, regarding the results, the difference from the paper is because, when I was implementing the training code for this repo, I found the cosine-decay learning rate can help a lot to increase the final adversarial performance (where the numbers reported in the paper are using the step-decay learning rate scheme). I have tested it on CIFAR and SVHN. On CIFAR it works nice and I have updated it in the code, but for SVHN, it is hard to train. Additional tricks are required, i.e. you need to first train the model on clean data for several epochs then finetune it with the adversarial attack, and use a large batch size. And it helps the RobNet_free to work, yet unfortunately not for the RobNet_large. So the checkpoint of RobNet_free is trained using cosine-decay learning rate, and as you can see, it yields far much better results than those of the step-decay learning rate. We did not include this in the paper, since if we do, it is unfair for the comparison of other models. But for this public use, I believe the better performance will be more helpful.

(To help understand what I am saying, you can evaluate the result of RobNet_large using the same way you treat it with RobNet_free to get the correct performance and compare it with the paper. Sorry again for so much inconvenience introduced by the previous stupid mistake...)

from robnets.

A-LinCui avatar A-LinCui commented on June 3, 2024

Thanks.

from robnets.

Related Issues (19)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.