jainammm / reinforcement-learning-openai-gym Goto Github PK
View Code? Open in Web Editor NEWImplementation of DP based policy iteration, value iteration and Q-learning algorithm on taxi_v3 environment of Gym toolkit. Implementation of value function approximation based Q-learning algorithm for for the mountain car and cart-pole environments of gym.