This repo is really nice, performance on pascal voc could be reproduce using 2 gpus wi

Hello <a class="user-mention notranslate" data-hovercard-type="user" data-hovercard-ur

<a class="user-mention notranslate" data-hovercard-type="user" data-hovercard-url="/us

Hello <a class="user-mention notranslate" data-hovercard-type="user" data

Nice Repo! about deeplabv3plus-pytorch HOT 6 OPEN

vainf commented on July 3, 2024

Nice Repo!

from deeplabv3plus-pytorch.

Comments (6)

VainF commented on July 3, 2024 1

Actually, when I train on 2 GPUs and 4 GPUs machine. The performance did variate, with a 2 percent drop on 4 GPUs machine. From my point of view, as it doesn't use global BN, thus per GPU batch size did value a lot.

@MaureenZOU, Yes, batch size is an important hyper param for BN. It is recommended to use a large batch size (e.g. >8). As far as know, there is no SyncBN in pytorch. Please try third party implementations if SyncBN is required.

Hello @shipra25jain , It works with any number of GPUs.

Thanks @VainF for the reply. It seems to be working now on adding 'device_ids' in DataParallel() as my default gpu_ids are not 0 and 1 but 5 and 7. However, there seems to be a bug in polyLR scheduler. Shouldn't it be (1 - last_epoch/max_epochs)**power ? I mean instead of max_iters in formula, it should be max_epochs?

@shipra25jain, thank you for pointing out this issue. In this repo, the learning rate is scheduled at each iteration, so last_epoch actually means last_iter. I will rename it to make the code more straightforward.

from deeplabv3plus-pytorch.

shipra25jain commented on July 3, 2024

Did this repo work when you gave 2 GPU ids in the argument? You had to make any changes in the code?

from deeplabv3plus-pytorch.

VainF commented on July 3, 2024

Hello @shipra25jain , It works with any number of GPUs.

from deeplabv3plus-pytorch.

MaureenZOU commented on July 3, 2024

Actually, when I train on 2 GPUs and 4 GPUs machine. The performance did variate, with a 2 percent drop on 4 GPUs machine. From my point of view, as it doesn't use global BN, thus per GPU batch size did value a lot.

from deeplabv3plus-pytorch.

MaureenZOU commented on July 3, 2024

@VainF if my experiment did has any problem. Please point out!

from deeplabv3plus-pytorch.

shipra25jain commented on July 3, 2024

Hello @shipra25jain , It works with any number of GPUs.

Thanks @VainF for the reply. It seems to be working now on adding 'device_ids' in DataParallel() as my default gpu_ids are not 0 and 1 but 5 and 7. However, there seems to be a bug in polyLR scheduler. Shouldn't it be (1 - last_epoch/max_epochs)**power ? I mean instead of max_iters in formula, it should be max_epochs?

from deeplabv3plus-pytorch.

Recommend Projects

Nice Repo! about deeplabv3plus-pytorch HOT 6 OPEN

Comments (6)

Related Issues (20)

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent