Code Monkey home page Code Monkey logo

deep-ensembles's Introduction

deep-ensembles

Reproduction of the paper: Simple and Scalable Predictive Uncertainty Estimation using Deep Ensembles

deep-ensembles's People

Contributors

mpritzkoleit avatar

Stargazers

cc avatar Zheng Zhao avatar chonghang avatar  avatar  avatar LiuYing avatar  avatar Zhengyang Mao avatar TAISEI Tosaki avatar  avatar Zhicheng avatar  avatar Phan Anh VU avatar Johann Desloires avatar  avatar Shang Zhu avatar Sherbet Lemon avatar  avatar  avatar Jiazhi Yang avatar Narsil-Dinghuai Zhang 张鼎怀 avatar Xiaobai avatar Guangyuan Zhao avatar Kimin Lee avatar

deep-ensembles's Issues

reproducing Deep Ensembles paper

Hi,
I was looking for an implementation of Deep Ensambles and I've found your Notebook. Have you found out why you couldn't reproduce some results of the Deep Ensembles paper in the end?
Thanks!

About your questions in the end of the ipynb file

Hi, I am PhD student in South Korea, making use of your deep ensemble python code actively.

While using your Python code, I think I can answer your questions in the end of the ipynb file.

Q

  1. Is something wrong with the training procedure in terms of batchsize, epochs? In the paper, it was mentioned, that a batchsize of 100 was used, but the toy example has only 20 samples. Training on these 20 samples for 40 epochs leads to underfitting for the NLL loss.
    --> In the Algorithm 1 in the original paper, it is stated that the "single nm for clarity, minibatch in practice".
    Therefore, in the regression on toy datasets, I think that minibatch size of 1 should be applied.
    Then, I think the underfitting issue will be solved.

  2. Is something wrong with the NLL loss function?
    --> I think your "def NLLloss(y, mean, var):" is correct.

  3. Is something wrong with the standard initialization in PyTorch? From Figure 1 (right) in the paper, I argue that each network of the ensemble has a more varying output outside the interval [-4, 4], than it is the case in my reproduction. I can't reproduce the predictive uncertainty as seen in the figure above.
    --> In Figure 1 of the original paper, it is stated that the confidence interval was drawn with 3std.
    However, in your code, the plot is set to draw a confidence interval with 1
    std.

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.