acids-ircam / diffusion_models Goto Github PK

A series of tutorial notebooks on denoising diffusion probabilistic models in PyTorch

Jupyter Notebook 99.88% Python 0.12%

diffusion_models's Introduction

Denoising diffusion probabilistic models

These tutorials explores the new class of generative models based on diffusion probabilistic models [ 1 ] . This class of models is inspired by considerations from thermodynamics [ 2 ] , but also bears strong ressemblence to denoising score matching [ 3 ] , Langevin dynamics and autoregressive decoding. We will also discuss the more recent development of denoising diffusion implicit models [ 4 ] , which bypass the need for a Markov chain to accelerate the sampling. Stemming from this work, we will also discuss the wavegrad model [ 5 ] , which is based on the same core principles but applies this class of models for audio data.

In order to fully understand the inner workings of diffusion model, we will review all of the correlated topics through tutorial notebooks. These notebooks are available in Pytorch or in JAX (in the jax_tutorials/ folder), thanks to the great contribution of Cristian Garcia.

We split the explanation between four detailed notebooks.

Score matching and Langevin dynamics.
Diffusion probabilistic models and denoising
Applications to waveforms with WaveGrad
Implicit models to accelerate inference

[1] Ho, J., Jain, A., & Abbeel, P. (2020). Denoising diffusion probabilistic models. arXiv preprint arXiv:2006.11239.

[2] Sohl-Dickstein, J., Weiss, E. A., Maheswaranathan, N., & Ganguli, S. (2015). Deep unsupervised learning using nonequilibrium thermodynamics. arXiv preprint arXiv:1503.03585.

[3] Vincent, P. (2011). A connection between score matching and denoising autoencoders. Neural computation, 23(7), 1661-1674.

[4] Song, J., Meng, C., & Ermon, S. (2020). Denoising Diffusion Implicit Models. arXiv preprint arXiv:2010.02502.

[5] Chen, N., Zhang, Y., Zen, H., Weiss, R. J., Norouzi, M., & Chan, W. (2020). WaveGrad: Estimating gradients for waveform generation. arXiv preprint arXiv:2009.00713.

diffusion_models's People

Contributors

Stargazers

Watchers

Forkers

rikrd cv-ip sambaiga dudent01 noncomputable cgarciae larsenridder han-oqo jskim0406 hazemessamm matherheart wangshaosun davidnet fyremael radiradev fanyangxyz mrzzy2021 codwest yin95 metavai xiaoya-li ali-zafari vu1seek sean-halpin aniquetahir jarvisloh yonetaniryo truongchien naotokui jph00 seungyong-han bluematrix007 timcanby kyuhyoung ongrdamn joweyel george-jiao andrewliuhui xtinacarty satishpas2 xk57238890 summer723 stash-196 xortical comdec vincehass danielcyma ntienvu noploop myriamlizotte pyalgosilphf cecilnb liuchaoxd kexiongwen ailabteam yizhidaluobo soojoong-hwang shlim2210 jhmlam farzadkhazaee72 daksae98 yansonggu cemoke tliu76 fritschek felixpun suicee maverobot xinjianouyang wrwilliam sdubnov marcosiniscalchi pinkmoon-io hvt1609

diffusion_models's Issues

Typo in diffusion_02_model.ipynb

Hi,

I think you have an error in:

while it should be q(x_t|x_0).

The fourth part of this tutorial series?

I would like to know if the fourth part of this series is available now. If it is available, can you put it here or share the link with us?

Why shifting alphas_prod_p?

Why do you shift alphas_prod_p in diffusion_02_model.ipynb?

alphas_prod_p = torch.cat([torch.tensor([1]).float(), alphas_prod[:-1]], 0)

n_steps and the final figure

Hello, thank you for your tutorial. It helps me a lot to understand the model. I have got a small question.
Since we set the n_steps to 100, the x_seq contains a random noise and 100 following x for each denoising step. When we plot, we plot the x distribution after every 10 steps. Why in the figure, the labels are from 100 to 1000 rather than 10 to 100? Did I miss something?

Randn instead of Rand?

Thanks for the very nice tutorial.

I found that you may want to use normal distribution for noise term but use "rand" instand of "randn"? In the 6-th block of https://github.com/acids-ircam/diffusion_models/blob/main/diffusion_02_model.ipynb

Bests!

Diffusion 1 Tutorial // Denoising Score Matching

Which one's right? I guess there should be a '-' sign

Incorrect objective in Jax Denoising Score Matching

def denoising_score_matching(scorenet, samples, key, sigma=0.01):
    noise = jax.random.normal(key, samples.shape)
    perturbed_samples = samples + noise * sigma
    target = -noise / sigma
    scores = scorenet(perturbed_samples)
    loss = 1 / 2. * ((scores - target) ** 2).sum(axis=-1).mean(axis=0)
    return loss

denoising_score_matching(model, data[:10], jax.random.PRNGKey(0))

Here target = -noise/sigma should be replaced with target = -noise

EMA model usage

In notebooks I see updates of EMA model def update, where we change shallow model, but def ema method was never called. It should be called ones per N epochs or it will be used at the end of all training process? As I can see, def update does not affect actual model and smoothing is not happen.

acids-ircam / diffusion_models Goto Github PK

diffusion_models's Introduction

Denoising diffusion probabilistic models

diffusion_models's People

Contributors

Stargazers

Watchers

Forkers

diffusion_models's Issues

Typo in diffusion_02_model.ipynb

The fourth part of this tutorial series?

Why shifting alphas_prod_p?

n_steps and the final figure

Randn instead of Rand?

Diffusion 1 Tutorial // Denoising Score Matching

Incorrect objective in Jax Denoising Score Matching

EMA model usage

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent