Code Monkey home page Code Monkey logo

wideresnet-pytorch's Introduction

wideresnet-pytorch's People

Contributors

xternalz avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar  avatar  avatar

wideresnet-pytorch's Issues

Accuracy

Hi,

Did you manage to get the same accuracy as in the paper?

Cheers!

Size mismatch

Hey there,

I'm trying WideResNet on the following dataset, which consists of 256x256x3 images which should be classified into 17 classes.

However I have a size issue on the network. With a batch size of 1, I expected a 1x17 output but I get a 64x17 output. With a batch size of 8 I get 512 x 17 output.

I'm trying to understand the shapes of the tensors at each stage with with the following prints:

def forward(self, x):
        print(x)
        out = self.conv1(x)
        print(out)
        out = self.block1(out)
        print(out)
        out = self.block2(out)
        print(out)
        out = self.block3(out)
        print(out)
        out = self.relu(self.bn1(out))
        print(out)
        out = F.avg_pool2d(out, 8)
        print(out)
        out = out.view(-1, self.nChannels)
        print(out)
        out = self.fc(out)
        print(out)
        return out
$  python main.py
Variable containing:
...
[torch.FloatTensor of size 1x3x256x256]

Variable containing:
...
[torch.FloatTensor of size 1x16x256x256]

Variable containing:
...
[torch.FloatTensor of size 1x64x256x256]

Variable containing:
...
[torch.FloatTensor of size 1x128x128x128]

Variable containing:
...
[torch.FloatTensor of size 1x256x64x64]

Variable containing:
...
[torch.FloatTensor of size 1x256x64x64]

Variable containing:
...
[torch.FloatTensor of size 1x256x8x8]

Variable containing:
...
[torch.FloatTensor of size 64x256]

Variable containing:
...
[torch.FloatTensor of size 64x17]

The place to add dropout

In your code, you add dropout after relu2, your process is like dropout(relu2(bn2(conv1(relu1(bn1(x)))))).

But it this code, he adds dropout after conv1, his process is like relu2(bn2(dropout(conv1(relu1(bn1(x)))))).

Does it matter? What is the difference on performance between the two methods? I'm troubled, I can't reproduce the performance on CIFAR10(I only get accuracy at 93.2%) using the second method.

Adding LICENSE

Could you add LICENSE (e.g. MIT) to help people better benefit from your work?
Thank you.

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.