In this project, we implement the Deep Q-Network (DQN) introduced by Mnih et al. (2015) and eventually learn to play Pong. The project consists of three phases: understanding the algorithm, implementing and testing the algorithm on a simple problem (CartPole environment), and using it to learn to play Pong.
This project is part of the Reinforcement Learning course at Uppsala University.