sanghoon / prediction_gan Goto Github PK

View Code? Open in Web Editor NEW

31.0 2.0 6.0 26 KB

PyTorch Impl. of Prediction Optimizer (to stabilize GAN training)

Python 98.33% Shell 1.67%

gan dcgan optimizer pytorch stabilizing-adversarial-nets prediction-methods pytorch-impl

prediction_gan's Introduction

Prediction Optimizer (to stabilize GAN training)

Introduction

This is a PyTorch implementation of 'prediction method' introduced in the following paper ...

Abhay Yadav et al., Stabilizing Adversarial Nets with Prediction Methods, ICLR 2018, Link
(Just for clarification, I'm not an author of the paper.)

The authors proposed a simple (but effective) method to stabilize GAN trainings. With this Prediction Optimizer, you can easily apply the method to your existing GAN codes. This impl. is compatible with most of PyTorch optimizers and network structures. (Please let me know if you have any issues using this)

How-to-use

Instructions

Import prediction.py
- from prediction import PredOpt
Initialize just like an optimizer
- pred = PredOpt(net.parameters())
Run the model in a 'with' block to get results from a model with predicted params.
- With 'step' argument, you can control lookahead step size (1.0 by default)
- ```
with pred.lookahead(step=1.0):
    output = net(input)
```
Call step() after an update of the network parameters
- ```
optim_net.step()
pred.step()
```

Samples

You can find a sample code in this repository (example_gan.py)
A sample snippet

import torch.optim as optim
from prediction import PredOpt


# ...

optim_G = optim.Adam(netG.parameters(), lr=0.01)
optim_D = optim.Adam(netD.parameters(), lr=0.01)

pred_G = PredOpt(netG.parameters())             # Create an prediction optimizer with target parameters
pred_D = PredOpt(netD.parameters())


for i, data in enumerate(dataloader, 0):
    # (1) Training D with samples from predicted generator
    with pred_G.lookahead(step=1.0):            # in the 'with' block, the model works as a 'predicted' model
        fake_predicted = netG(Z)                           
    
        # Compute gradients and loss 
    
        optim_D.step()
        pred_D.step()
    
    # (2) Training G
    with pred_D.lookahead(step=1.0:)            # 'Predicted D'
        fake = netG(Z)                          # Draw samples from the real model. (not predicted one)
        D_outs = netD(fake)

        # Compute gradients and loss

        optim_G.step()
        pred_G.step()                           # You should call PredOpt.step() after each update

Output samples

You can find more images at the following issues.

Training w/ large learning rate (0.01)

Vanilla DCGAN	DCGAN w/ prediction (step=1.0)

Training w/ medium learning rate (1e-4)

Vanilla DCGAN	DCGAN w/ prediction (step=1.0)

Training w/ small learning rate (1e-5)

Vanilla DCGAN	DCGAN w/ prediction (step=1.0)

External links

GitHub repo. mentioned in the paper (https://github.com/jaiabhayk/stableGAN)
- Empty by the date of this README.md update.
Another impl. for PyTorch (https://github.com/shahsohil/stableGAN)
- From the name of the repository owner, I guess it's written by one of the paper authors. (not 100% sure)
- Currently supports ADAM only.

TODOs

: Impl. as an optimizer
: Support pip install
: Add some experimental results

prediction_gan's People

Contributors

Stargazers

Watchers

Forkers

ml-lab shubhampachori12110095 nieshaoshuai litpuvn parallelstablegan chz367

prediction_gan's Issues

CelebA experiments

Notes

This work was done only to show some sample outputs. Different random seeds can lead to totally different outcomes. Therefore, we need to investigate outputs from repeated trials to correctly compare two GAN methods.
For faster training, I used only 50k images from CelebA (resized to be 64x64)

Large learning rate (0.01)

Vanilla DCGAN

After 2 epochs	After 10 epochs	After 25 epochs

DCGAN w/ prediction

After 2 epochs	After 10 epochs	After 25 epochs

Medium learning rate (0.0001)

Vanilla DCGAN

After 2 epochs	After 10 epochs	After 25 epochs

DCGAN w/ prediction

After 2 epochs	After 10 epochs	After 25 epochs

Small learning rate (1e-5)

Vanilla DCGAN

After 2 epochs	After 10 epochs	After 25 epochs

DCGAN w/ prediction

After 2 epochs	After 10 epochs	After 25 epochs

CIFAR-10 experiments with different LRs (PredictionOpt only on the generator)

Notes

This work was done only to show some sample outputs. Different random seeds can lead to totally different outcomes. Therefore, we need to investigate outputs from repeated trials to correctly compare two GAN methods.
For these results, the prediction methods has been applied only for G.

Large learning rate (0.01)

Vanilla DCGAN

After 2 epochs

After 10 epochs

After 25 epochs

DCGAN w/ prediction

After 2 epochs

After 10 epochs

After 25 epochs

Medium learning rate (0.0001)

Vanilla DCGAN

After 2 epochs

After 10 epochs

After 25 epochs

DCGAN w/ prediction

After 2 epochs

After 10 epochs

Adter 25 epochs

Small learning rate (1e-5)

Vanilla DCGAN

After 2 epochs

After 10 epochs

After 25 epochs

DCGAN w/ prediction

After 2 epochs

Adter 10 epochs

After 25 epochs

CIFAR-10 experiments with different LRs

Notes

This work was done only to show some sample outputs. Different random seeds can lead to totally different outcomes. Therefore, we need to investigate outputs from repeated trials to correctly compare two GAN methods.

Large learning rate (0.01)

Vanilla DCGAN

After 2 epochs	After 10 epochs	After 25 epochs

DCGAN w/ prediction

After 2 epochs	After 10 epochs	After 25 epochs

Medium learning rate (0.0001)

Vanilla DCGAN

After 2 epochs	After 10 epochs	After 25 epochs

DCGAN w/ prediction

After 2 epochs	After 10 epochs	After 25 epochs

Small learning rate (1e-5)

Vanilla DCGAN

After 2 epochs	After 10 epochs	After 25 epochs

DCGAN w/ prediction

After 2 epochs	After 10 epochs	After 25 epochs

CelebA experiments (PredictionOpt only on the generator)

Notes

This work was done only to show some sample outputs. Different random seeds can lead to totally different outcomes. Therefore, we need to investigate outputs from repeated trials to correctly compare two GAN methods.
For faster training, I used only 50k images from CelebA (resized to be 64x64)
For these results, prediction methods have been applied only for G.

sanghoon / prediction_gan Goto Github PK

prediction_gan's Introduction

Prediction Optimizer (to stabilize GAN training)

Introduction

How-to-use

Instructions

Samples

Output samples

Training w/ large learning rate (0.01)

Training w/ medium learning rate (1e-4)

Training w/ small learning rate (1e-5)

External links

TODOs

prediction_gan's People

Contributors

Stargazers

Watchers

Forkers

prediction_gan's Issues

Notes

Large learning rate (0.01)

Vanilla DCGAN

DCGAN w/ prediction

Medium learning rate (0.0001)

Vanilla DCGAN

DCGAN w/ prediction

Small learning rate (1e-5)

Vanilla DCGAN

DCGAN w/ prediction

Notes

Large learning rate (0.01)

Vanilla DCGAN

DCGAN w/ prediction

Medium learning rate (0.0001)

Vanilla DCGAN

DCGAN w/ prediction

Small learning rate (1e-5)

Vanilla DCGAN

DCGAN w/ prediction

Notes

Large learning rate (0.01)

Vanilla DCGAN

DCGAN w/ prediction

Medium learning rate (0.0001)

Vanilla DCGAN

DCGAN w/ prediction

Small learning rate (1e-5)

Vanilla DCGAN

DCGAN w/ prediction

Notes

Large learning rate (0.01)

Vanilla DCGAN

DCGAN w/ prediction

Medium learning rate (0.0001)

Vanilla DCGAN

DCGAN w/ prediction

Small learning rate (1e-5)

Vanilla DCGAN

DCGAN w/ prediction

Recommend Projects

Recommend Topics

Recommend Org