davinellulinvega / openairl Goto Github PK
View Code? Open in Web Editor NEWToy examples of using Q-learning and actor-critic methods implemented in deep neural networks for solving the inverse pendulum task.
License: GNU General Public License v3.0