Code Monkey home page Code Monkey logo

Comments (9)

sfzhang15 avatar sfzhang15 commented on July 24, 2024 1

@evercherish Hi, sorry, until now, we just use VGG16 and Res101 as our base network. In the future to expand this work, we have the plan to try more base networks, such as MobileNet, ShuffleNet, DenseNet and so on. If we finish, we will share them.

from refinedet.

evercherish avatar evercherish commented on July 24, 2024

looking forward to it
Thanks

from refinedet.

lengly avatar lengly commented on July 24, 2024

@sfzhang15 @evercherish
It takes me two days to change base network to DenseNet.
I use the DenseNet pre-trained model from https://github.com/shicai/DenseNet-Caffe
This is the only public pre-trained caffe model I found. But the main problem is the growth rate in that model is 32. Do you know what does this means? It means we need extremely huge memory... When image-size=512, we can only set batch-size=4 on 4 GPUs(each memory is 16GB)...

Actually we don't need set growth rate so big, but I didn't find other pre-trained models...

So if you want to use DenseNet, my suggestion is to set growth rate equals to 12, and train this DenseNet on ImageNet by yourself, then use your own pre-trained model...

from refinedet.

sfzhang15 avatar sfzhang15 commented on July 24, 2024

@lengly Thanks for sharing. According to your experience, the pre-trained DenseNet model (i.e., K=32) takes more GPU memory than ResNet101. For ResNet101, a 24G GPU only can be input five 512x512 images and the BN layer is barely stable. If there are less than 5 images in one GPU, the BN layer will be unstable. So the pre-trained DenseNet model (i.e., K=32) is computationally prohibitive even for a 24G GPU. As you said, the solution is to set growth rate equals to 12 and train this DenseNet on ImageNet, then use this pre-trained model. Besides, we can use other efficient base network, such as MobileNet, Inception and so on.

from refinedet.

lengly avatar lengly commented on July 24, 2024

@sfzhang15 When you train ResNet101, did you notice some iteration is really slow? When image size=320, batch=32, it takes 1 second per iteration normally, but sometimes, it takes 5x or even 10x slower (10 seconds per iter)

from refinedet.

sfzhang15 avatar sfzhang15 commented on July 24, 2024

@lengly Yes, we also have this phenomenon, but it rarely appears. Maybe when other users on the server take up CPU or IO resources, or when the server is busy doing other tasks, this phenomenon will appear.

from refinedet.

evercherish avatar evercherish commented on July 24, 2024

@lengly Could you please share your python script?
thanks!

from refinedet.

lengly avatar lengly commented on July 24, 2024

@evercherish Sorry, our company didn't allow us to do this...

from refinedet.

evercherish avatar evercherish commented on July 24, 2024

@lengly That's all right. Thanks you for your reply.

from refinedet.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.