michaelfish199 / supermariobros-reinforcementlearning Goto Github PK
View Code? Open in Web Editor NEWThis project implements an agent for playing the SuperMarioBros game using the Proximal Policy Optimization (PPO) algorithm from the stablebaselines3 library. The agent is trained to learn the optimal actions to take at each step in the game in order to complete the level and maximize the score.