Code Monkey home page Code Monkey logo

bremen's Issues

Performance of cheetah-run task

Hi,

I tried to run deployment-efficient experiments to reproduce the results reported in the paper with the following command:

python recursive.py --env cheetah_run --exp_name recursive_example --sub_exp_name BREMEN_demo --param_path configs/params_cheetah_run_offline.json --bc_init --random_seeds 0 --target_kl 0.1 --max_path_length 250 --gaussian 0.1 --const_sampling

After the training is finished, I observed the following evaluation results:

---------------------------
| Iteration    | 399      |
| TotalSamples | 850000   |
| episode_max  | 745      |
| episode_mean | 741      |
| episode_min  | 735      |
---------------------------

However, according to Fig 2 in the original paper (https://arxiv.org/pdf/2006.03647.pdf), the performance of HalfCheetah should be around 6000, which is quite different from the evaluation results.

I wonder the parameter setting specified in the command above is the same as the setting of experiments in this paper? If not, could you let me know which hyper-parameter should be modified in order to reproduce the results reported in the paper? Or maybe there are some other reasons for this performance gap?

Thanks a lot!

Run BREMEN on D4RL

Hi. Thanks for sharing the code. I am interested in offline reinforcement learning. In Appendix D. of the paper, you show the performance of BREMEN on D4RL, but the launch script is not found in the codebase. Do you have a plan to share the script to launch d4rl experiments?

FetchReach-v0

Hello Authors!

Thanks so much for releasing this wonderful codebase. I have looked at your latest work on ICLR, and wondering if your code has the implementation on FetchReach-v0? Is it possible to also update the portion?

Thanks!
Candy

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.