Code Monkey home page Code Monkey logo

repo's People

Contributors

zchuning avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar

repo's Issues

Hope to get the performance data on the standard DMC

Hello, I want to do a comparison on standard DMC, but I don't know the hyperparameters of the standard task. Could you please provide the final convergence performance of standard DMC, which is the performance shown in Figure 11 in the appendix of the paper?

Questions regarding DMC

Hi, thank you for publishing awesome work with the code. This is very helpful.

I have two questions regard DMC experiments.

  1. I wonder what X-axis in Figure 5 means. Does it mean 'total environment steps with action repeat of 2 counted' or 'a number of env.step() call'? Based on the implementation, I assume it is the latter but I would like to make sure.
  2. Could you make the performances used for creating plots (specifically Figure 5 and Figure 7) available? This would be helpful to compare RePo with other methods.

Again, thank you so much!

About performance?

Hello, I ran the distraction DMC experiment. The hyperparameters were set to Table 2 in Appendix C, but the performance was much worse than what was shown in the paper, and the variance was large. Why is this? Are there other settings that need attention?
Cartpole_swingup
hopper_stand
cheetah_run
wlker_run
walker_walk
walker_stand

What is the difference between 'pair' and 'simple_pair'

Hi, I'm interested in this work. I find that 'calibration_mode' has two forms, 'pair' and 'simple_pair', and the default is 'simple_pair'. What's the difference between them?

And I find that calibration loss in code seems to be a cross-entropy loss, rather than the L2 loss mentioned in the paper. Why do this? Will it improve performance?

About reconstruction?

Hi, I have another question. As mentioned in the title of the 4th section: "RePo: Parsimonious Representation Learning without Reconstruction", but in the repo.py, there is a "# Reconstruction loss for probing", why is that?

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.