wassname / attentive-neural-processes Goto Github PK

View Code? Open in Web Editor NEW

86.0 7.0 23.0 89.07 MB

implementing "recurrent attentive neural processes" to forecast power usage (w. LSTM baseline, MCDropout)

License: Apache License 2.0

Jupyter Notebook 98.74% Python 1.26%

attentive-neural-processes uncertainty prediction pytorch attention neural-processes anp anp-rnn rnn

attentive-neural-processes's People

Contributors

Stargazers

Watchers

attentive-neural-processes's Issues

ANP-RNN 'use_deterministic_path'= False?

Hi there,

I am just wondering if you forgot to set 'use_deterministic_path'= True for ANP-RNN, since in the paper the authors indicate so, and you obviously have already set cross-validations.

https://github.com/3springs/attentive-neural-processes/blob/af431a267bad309b2d5698f25551986e2c4e7815/neural_processes/models/neural_process/lightning.py#L189-L195

BTW, I was trying to replicate the experiment on GP dataset, and I have already implemented an ANP. Although the architecture of mine is slightly different from yours, I assume I will just need to replace all the MLPs with LSTMs plus sequential encodings of the input, output, right? However, mine result was really bad, when I sorted my x_context, x_target, the model only seems to be able to predict a few points and predicts flat curves afterwards. Could you please share with me any hints regarding this?

Your help is very much appreciated

Missing anp-rnn_1d_regression.ipynb

Hi,

Thank you for such a great and clean implementation.

It seems that anp-rnn_1d_regression.ipynb is missing from the repo. It is mentioned in the Usage section in your Readme.

https://github.com/3springs/attentive-neural-processes/blob/master/anp-rnn_1d_regression.ipynb

Regards & thanks
Kapil

1d regression example

I was trying to run the 1d regression notebook straight out of the box as provided. Ran into this issue of size mismatch. Any help would be appreciated. Thank you

ValueError Traceback (most recent call last)

in ()
32 optim.zero_grad()
33 y_pred, kl, loss, mse_loss, y_std = model(context_x, context_y, target_x,
---> 34 target_y)
35 loss.backward()
36 optim.step()

ValueError: not enough values to unpack (expected 5, got 3)

No TensorBoard logs from smartmeters-ANP-RNN[-mcdropout].ipynb

Thanks for the awesome repo! I ran the notebooks smartmeters-ANP-RNN.ipynb and smartmeters-ANP-RNN-mcdropout.ipynb which instructed to run tensorboard --logdir ${MODEL_DIR} but there were no records found.

I tried replacing DictLogger with the vanilla TensorBoardLogger but this didn't change anything I could see.
There were no output .tfevents in MODEL_DIR (only a model checkpoint).

Still `anp-rnn_1d_regression.ipynb` logged to TensorBoard fine, using `SummaryWriter` directly:

Although not a bug, I was also wondering why the training looked unstable from these plots :) The ANN-RNP paper reported pretty stable convergence:

Thanks a lot for your time! I'll report back here if I find anything new.

Fixing the error in kl_loss_var function

https://github.com/3springs/attentive-neural-processes/blob/016272a077a19bc51d145d1ad99d910477458876/neural_processes/utils.py#L167

There is an issue with the computation of kl-divergence when using kl_loss_var function. I think the fix would be by removing the second ( before var_ratio_log.exp(). The update would look like:

def kl_loss_var(prior_mu, log_var_prior, post_mu, log_var_post):
    var_ratio_log = log_var_post - log_var_prior
    kl_div = (
         var_ratio_log.exp() + ((post_mu - prior_mu) ** 2) / log_var_prior.exp()
        - 1.0
        - var_ratio_log
       )
    kl_div = 0.5 * kl_div

Otherwise, using torch.distributions.kl_divergence(z_post_dist, z_prior_dist) where

z_prior_dist =  torch.distributions.normal.Normal(mu_c, sigma_c) # mu_c, sigma_c are the computed mean and standard deviation using contexts
z_post_dist =  torch.distributions.normal.Normal(mu_t, sigma_t) # mu_t, sigma_t are the computed mean and standard deviation using targets

would do the job.

PS. thank you for this open-source implementation! 👍

wassname / attentive-neural-processes Goto Github PK

attentive-neural-processes's People

Contributors

Stargazers

Watchers

Forkers

attentive-neural-processes's Issues

ANP-RNN 'use_deterministic_path'= False?

Missing anp-rnn_1d_regression.ipynb

1d regression example

No TensorBoard logs from smartmeters-ANP-RNN[-mcdropout].ipynb

Still `anp-rnn_1d_regression.ipynb` logged to TensorBoard fine, using `SummaryWriter` directly:

Fixing the error in kl_loss_var function

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent

wassname / attentive-neural-processes Goto Github PK

attentive-neural-processes's People

Contributors

Stargazers

Watchers

Forkers

attentive-neural-processes's Issues

Still anp-rnn_1d_regression.ipynb logged to TensorBoard fine, using SummaryWriter directly:

Recommend Projects

Recommend Topics

Recommend Org

Still `anp-rnn_1d_regression.ipynb` logged to TensorBoard fine, using `SummaryWriter` directly: