iyanuoluwa-vic / learning-paths-using-reinforcement-learning-alphazero Goto Github PK
View Code? Open in Web Editor NEWAn agent-based system which over several iterations explores and learns an initially unknown environment, a five by five grid consisting of three different types of spaces, normal locations, pickup locations, and drop off locations.