zchuning / repo Goto Github PK

View Code? Open in Web Editor NEW

12.0 12.0 1.0 28.65 MB

Resilient Model-Based RL by Regularizing Posterior Predictability

Home Page: https://zchuning.github.io/repo-website/

Python 100.00%

deep-learning reinforcement-learning robotics

repo's People

Contributors

Stargazers

Watchers

repo's Issues

Hope to get the performance data on the standard DMC

Hello, I want to do a comparison on standard DMC, but I don't know the hyperparameters of the standard task. Could you please provide the final convergence performance of standard DMC, which is the performance shown in Figure 11 in the appendix of the paper?

Questions regarding DMC

Hi, thank you for publishing awesome work with the code. This is very helpful.

I have two questions regard DMC experiments.

I wonder what X-axis in Figure 5 means. Does it mean 'total environment steps with action repeat of 2 counted' or 'a number of env.step() call'? Based on the implementation, I assume it is the latter but I would like to make sure.
Could you make the performances used for creating plots (specifically Figure 5 and Figure 7) available? This would be helpful to compare RePo with other methods.

Again, thank you so much!

About performance?

Hello, I ran the distraction DMC experiment. The hyperparameters were set to Table 2 in Appendix C, but the performance was much worse than what was shown in the paper, and the variance was large. Why is this? Are there other settings that need attention?

What is the difference between 'pair' and 'simple_pair'

Hi, I'm interested in this work. I find that 'calibration_mode' has two forms, 'pair' and 'simple_pair', and the default is 'simple_pair'. What's the difference between them?

And I find that calibration loss in code seems to be a cross-entropy loss, rather than the L2 loss mentioned in the paper. Why do this? Will it improve performance?

About reconstruction?

Hi, I have another question. As mentioned in the title of the 4th section: "RePo: Parsimonious Representation Learning without Reconstruction", but in the repo.py, there is a "# Reconstruction loss for probing", why is that？

zchuning / repo Goto Github PK

repo's People

Contributors

Stargazers

Watchers

repo's Issues

Hope to get the performance data on the standard DMC

Questions regarding DMC

About performance?

What is the difference between 'pair' and 'simple_pair'

About reconstruction?

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent