Comments (2)
Hi, I had finished the training of ga3c-cadrl whit TrainPhase1 and TrainPhase2, but i don't understand how to test this policy, in order to fork your experience which is provided in the TABLE 1 of your latest paper (Collision avoidance in pedestrian-rich environments with deep reinforcement learning). And i am working on this survey based on your great work, especially the gym-collision-avoidance environment. Looking forward to your answers!
from rl_collision_avoidance.
Thanks! That is great you were able to train some new policies. There is some capability to run the trained policy in random scenarios within this repo, but I don't remember exactly how, and it sounds like you'd like to go beyond that anyway.
To test the policy as I did in the paper you referred to, I would suggest using the gym-collision-avoidance repo on its own and run this bash script (i.e., clone a fresh copy of the gym repo in a new place, since the submodule gym env within this repo is probably not as up-to-date).
That bash script should automatically run a bunch of random test scenarios for various policies and numbers of agents, based on this config. You could edit self.POLICIES_TO_TEST
to add a new policy key (e.g., GA3C-CADRL-Feng
), and then add the corresponding key/value to this dict with the checkpoint path etc. To start, maybe you'll want to simply make the new policy the only element in self.POLICIES_TO_TEST
.
This bash script should: for each number of agents, for each policy_to_test, run the same N pre-defined random test scenarios. Each test scenario will contain all agents running the same policy_to_test. Note that this config only runs 4 test cases by default, which is probably good to start with so you can be sure it's set up correctly and logs the results, but eventually increase this number (I think 500 was used in the paper?). If things are working properly, this bash script should generate a bunch of png
files of the agent trajectories and also log some pkl
files with results/stats of each test episode. I believe this script is what I used to generate the numbers in the table once I had run the experiments -- running 500 experiments for each policy and each number of agents took a while (hours). Unfortunately, this last script seems to have some hard-coded paths and I am not sure it will work right away, but maybe it can be used as a guide.
Please let me know if this works or if you run into other issues!
from rl_collision_avoidance.
Related Issues (20)
- Modifying State Vectors HOT 3
- ./train.sh TrainPhase1. tensorflow.python.framework.errors_impl.DataLossError: not an sstable (bad magic number) HOT 2
- Modifying State Vectors & collect_regression_dataset HOT 6
- collision_avoidance_env.py line 356 HOT 3
- Question about how to train code as single agent HOT 4
- SA-CADRL HOT 2
- Cannot execute ./install.sh HOT 4
- Debug HOT 1
- Question about how to make Learning cadrl! HOT 4
- Question about umber of episodes and fix one mistake in code! HOT 2
- Not able to train the model on GPU HOT 2
- Issue with downloading Regression dataset HOT 5
- Issue with RL Phase 2 Training HOT 1
- Issue including GA3C-CADRL agents in training environment HOT 3
- Test Cases in FullTestSuite HOT 2
- ./train.sh TrainPhase1`
- about checkpoints HOT 4
- tensorflow/core/util/events_writer.cc:108] Write failed because file could not be opened. HOT 4
- Visualization HOT 1
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from rl_collision_avoidance.