For GANs or similar approaches, we may want optimizer A to step every batch while opti

<a class="user-mention notranslate" data-hovercard-type="user" data-hovercard-url="/us

Allow optimizers to alternate at arbitrary intervals about lightning HOT 8 CLOSED

lightning-ai commented on May 3, 2024 4

Allow optimizers to alternate at arbitrary intervals

from lightning.

Comments (8)

zetyquickly commented on May 3, 2024 3

Hi, I think the following is related.

What if we want just to alternate optimizers during training step by step. I mean the cases when we for example use Adam for 5 epochs than SGD for other 5 epochs.

How could we prevent training_step which in that case has "optimizer_idx" argument from executing twice?

from lightning.

williamFalcon commented on May 3, 2024 2

@wheatdog @sidhanthholalkere on master now. override optimizer_step to update any optimizer at arbitrary intervals.

from lightning.

sholalkere commented on May 3, 2024

First, when defining validation_end, instead of passing
return [torch.optim.Adam(self.parameters(), lr=0.02)],
they could try

optimizer = torch.optim.Adam(self.parameters(), lr=2.0)
optimizer.skip_batch = 1
return [optimizer]

I'm sure that whoever wants to use this skipping feature would be comfortable adding a few lines
To accommodate for this, whenever self.optimizers = model.configure_optimizers() is called in trainer.py, you could just add the following:

for optimizer in self.optimizers:
    try:
        optimizer.skip_batch = 0
    except AttributeError:
        optimizer.skip_batch = 0

Basically the first part is checking to see if the user manually defined the skip rate, and if not, just set it to 0(never skip)
Later on, when calling optimizer.step(), you can replace it with

if self.batch_nb % (optimizer.skip_batch + 1) == 0:
    optimizer.step()

I believe this should also work with schedulers as well.
But then again, I don't know that much about Pytorch.
Nevertheless, this project looks quite exciting and I hope I can provide some help!

On another note, why would you want to have this feature? If its so optimizer A can learn "faster" than B, why not just multiply B's learning rate by 1/k so you fully take advantage of all of the gradients while having it optimize slower than A.

from lightning.

williamFalcon commented on May 3, 2024

good suggestion, but I wonder how adding properties to the optimizer might affect loading, saving and training continuation.

It seems a bit hacky, so let's think of other alternatives as well? If this turns out to be the best way, then we can go with it.

I was thinking about maybe just allowing the configure_optimizers method to return another list with config stuff:

return [opt_a, opt_b], [sched_a], [{'skip_batch': 2}]

Something like that. But don't love this either haha.

from lightning.

wheatdog commented on May 3, 2024

I fail to understand how to implement GAN-related training scheme in pytorch-lightning. Can you give me some examples?

from lightning.

williamFalcon commented on May 3, 2024

@wheatdog @sidhanthholalkere see #106 for discussion. #107 for changes to support this.

Would these changes work for you?

from lightning.

williamFalcon commented on May 3, 2024

docs here: https://williamfalcon.github.io/pytorch-lightning/Trainer/hooks/#optimizer_step

from lightning.

farahFif commented on May 3, 2024

Is there a way to change the optimizer after n epochs ? I am trying by calling configure_optimizer and it changes the Lr value but scheduler is not working

from lightning.

Allow optimizers to alternate at arbitrary intervals about lightning HOT 8 CLOSED

Comments (8)

Related Issues (20)

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent