Code Monkey home page Code Monkey logo

Comments (2)

Feng-Kaijun avatar Feng-Kaijun commented on June 5, 2024

Hi, I had finished the training of ga3c-cadrl whit TrainPhase1 and TrainPhase2, but i don't understand how to test this policy, in order to fork your experience which is provided in the TABLE 1 of your latest paper (Collision avoidance in pedestrian-rich environments with deep reinforcement learning). And i am working on this survey based on your great work, especially the gym-collision-avoidance environment. Looking forward to your answers!

from rl_collision_avoidance.

mfe7 avatar mfe7 commented on June 5, 2024

Thanks! That is great you were able to train some new policies. There is some capability to run the trained policy in random scenarios within this repo, but I don't remember exactly how, and it sounds like you'd like to go beyond that anyway.

To test the policy as I did in the paper you referred to, I would suggest using the gym-collision-avoidance repo on its own and run this bash script (i.e., clone a fresh copy of the gym repo in a new place, since the submodule gym env within this repo is probably not as up-to-date).

That bash script should automatically run a bunch of random test scenarios for various policies and numbers of agents, based on this config. You could edit self.POLICIES_TO_TEST to add a new policy key (e.g., GA3C-CADRL-Feng), and then add the corresponding key/value to this dict with the checkpoint path etc. To start, maybe you'll want to simply make the new policy the only element in self.POLICIES_TO_TEST.

This bash script should: for each number of agents, for each policy_to_test, run the same N pre-defined random test scenarios. Each test scenario will contain all agents running the same policy_to_test. Note that this config only runs 4 test cases by default, which is probably good to start with so you can be sure it's set up correctly and logs the results, but eventually increase this number (I think 500 was used in the paper?). If things are working properly, this bash script should generate a bunch of png files of the agent trajectories and also log some pkl files with results/stats of each test episode. I believe this script is what I used to generate the numbers in the table once I had run the experiments -- running 500 experiments for each policy and each number of agents took a while (hours). Unfortunately, this last script seems to have some hard-coded paths and I am not sure it will work right away, but maybe it can be used as a guide.

Please let me know if this works or if you run into other issues!

from rl_collision_avoidance.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.