Code Monkey home page Code Monkey logo

Comments (8)

davide97l avatar davide97l commented on July 21, 2024

Hello @hlsafin, first of all you should make sure to train your agent for enough training steps in order to see meaningful results. For example, using our default configuration, R2D2 takes about 4M steps to converge on Pong and 10M to reach good performance on Spaceinvaders. Moreover, there are two important h-paramerers you may want to tune for different environments:

  • burnin_step: how many steps to use for initializing RNN hidden state, since those steps are not used for training, you could reduce them in those envs where initial steps are the most important.
  • learn_unroll_len: length of a trajectory, you could increase this value when having a longer memory is critical.

You can refer to this file to see all the h-parameters that can be tuned: https://github.com/opendilab/DI-engine/blob/main/dizoo/atari/config/serial/pong/pong_r2d2_config.py

Following, I'm going to attach our training log relative to the Pong environment so that you can make a comparison.
image

from di-engine.

hlsafin avatar hlsafin commented on July 21, 2024

image
Yeah, that's so weird, I am running the same file as you, but getting different results. Hmm

from di-engine.

hlsafin avatar hlsafin commented on July 21, 2024

at 2.4 million, the mean reward is still around -20

from di-engine.

PaParaZz1 avatar PaParaZz1 commented on July 21, 2024

can you upload your tensorboard event?

from di-engine.

hlsafin avatar hlsafin commented on July 21, 2024

yeah, I don't think I can upload files here. Maybe I can email you? I tried oppo_config, and it converged. Just no luck with r2d2 config.py

from di-engine.

PaParaZz1 avatar PaParaZz1 commented on July 21, 2024

yeah, I don't think I can upload files here. Maybe I can email you? I tried oppo_config, and it converged. Just no luck with r2d2 config.py

You can send email to our email or contact us on slack channel

from di-engine.

hlsafin avatar hlsafin commented on July 21, 2024

yeah, I can't seem to find it. When you ran your config, did you make any changes to the config file? Also, which version are you using?

from di-engine.

PaParaZz1 avatar PaParaZz1 commented on July 21, 2024

slack channel link: https://join.slack.com/t/opendilab/shared_invite/zt-v9tmv4fp-nUBAQEH1_Kuyu_q4plBssQ

from di-engine.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.