Code Monkey home page Code Monkey logo

ttt's Introduction

ttt

An implementation of tic-tac-toe in C, featuring an AI powered by the negamax algorithm, the reinforcement learning (RL) algorithm, the Monte Carlo tree search (MCTS) algorithm. And the RL algorithm contains the Monte Carlo learning algorithm and TD learning algorithm.

Build

negamax AI

If you want to play with negamax AI: Build the game:

$ make

and run:

$ ./ttt

reinforcement learning AI

If you want to play with RL AI: Train the state value table first, you could modify the hyperparameters, which are macros in train.c:

  • MONTE_CARLO : Decide whether or not to use the Monte Carlo method or the TD method.
  • REWARD_TRADEOFF : Decide the reward of the Markov decision model, which is a balance between the episode reward and the score from get_score. Those who want to fully customize the reward can modify it in the init_agent function.
  • INITIAL_MULTIPLIER : Decide the initial state value, which is the multiplier of get_score in the initial state value. Those who want to fully customize the initial state value can modify it in the init_agent function.
  • EPSILON_GREEDY : Decide whether or not to use epsilon-greedy exploration when training.
  • NUM_EPISODE : the number of game episode in training
  • LEARNING_RATE, GAMMA : $\alpha$, $\gamma$ in training.
  • EPSILON_START EPSILON_END : $\epsilon$ in Epsilon-Greedy Algorithm and it would decay exponentially.

compile

$ make train

and run:

$./train

Build the game playing with RL agent, it would load the pretrained model from train:

$ make rl

and run:

$ ./rl

MCTS AI

If you want to play with MCTS AI:
There are several hyperparameters you can modify:

  • EXPLORATION_FACTOR in agents/mcts.h : The exploration parameter.
  • ITERATIONS in agents/mcts.h : Number of simulations in MCTS.

Build the game:

$ make mcts

and run:

$ ./mcts

Run

These program operate entirely in the terminal environment. Below is its appearance as it awaits your next move:

 1 |  ×
 2 |     ○
 3 |
---+----------
      A  B  C
>

To execute a move, enter [column][row]. For example:

> a3

Press Ctrl-C to exit.

Game Rules

The winner is determined by the first player who successfully places GOAL of their marks in a row, whether it is vertically, horizontally, or diagonally, regardless of the board size.

Using the following 4x4 board games, whose GOAL is 3, as examples:

 1 |  ×  ×
 2 |     ○  ×
 3 |     ○
 4 |     ○
---+------------
      A  B  C  D
>

The player "○" wins the game since he placed his marks in a row vertically (B2-B3-B4).

 1 |  ×  ×  ○
 2 |  ×  ○  
 3 |  ○  
 4 |     
---+------------
      A  B  C  D
>

The player "○" wins the game since he placed his marks in a row diagonally (A3-B2-C1).

 1 |  o  x  
 2 |  o  x  
 3 |  o     x
 4 |  o  x
---+------------
      A  B  C  D
>

The player "○" wins the game if ALLOW_EXCEED is 1; otherwise, the game will continue because the number of "○"s in a row exceeds GOAL.

Reference

ttt's People

Contributors

freshliver avatar hotmercury avatar idoleat avatar jouae avatar jserv avatar kaiweike avatar lumynou5 avatar ndsl7109256 avatar otteryc avatar p96114175 avatar paul90317 avatar visitorckw avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.