Experimentation with Deep Reinforcement learning on the example of learning how to play Quixo.
The effectiveness of the learning will be checked through the tournament against minimax_bot (bot using minimax with some improvements like alpha-beta pruning or iterative deepening).