The aim of this project is to reproduce reinforcement learning algorithms. While there are a lot of repositories that attempt to do the same, many of them may use simpler benchmarks or not achieve results that are as good as the ones reported in the papers. An additional goal is to provide a modular code base that is quite readable and reusable.
Currently implemented algorithms:
- DQN both NIPS and Nature
- QR-DQN-0,QR-DQN-1
- A3C primarily based on universe-starter-agent
- PPO for recurrent and feed-forward networks
To view some of the result head over to the wiki page.
-
Install tensorflow or tensorflow-gpu (version >= 1.7.0). Tensorflow installation instructions.
-
Clone the repository:
git clone https://github.com/MichaelKonobeev/rl.git
cd
into the package and install it:
cd rl && pip install -e .