Light

agrawal-rohit / tic-tac-toe-ai-bots Goto Github PK

View Code? Open in Web Editor NEW

32.0 2.0 56.0 45 KB

AI bots playing Tic Tac Toe

Python 100.00%

games minimax-algorithm reinforcement-learning temporal-differencing-learning tic-tac-toe tic-tac-toe-game tictactoe tictactoe-game

tic-tac-toe-ai-bots's Introduction

A Tic Tac Toe Bot

This is the code repository for my article on Medium - Playing Games with Python - Tic Tac Toe, where I have tried to take the famous Tic-Tac-Toe game and create a bot proficient enough to beat human players, if not the game itself.

What's inside this repo?

A fully playable Tic-Tac-Toe environment.
A bot trained using Temporal Difference learning (A technique in Reinforcement learning).
A bot trained using the Minimax Algorithm.

How to use

Play against the RL bot

run python testing_(HumanvsAI)_ReinforcementLearning.py

Play against the Minimax bot

run python HumanvsAI_Minimax.py

Play against another human player (Regular tic tac toe)

run python HumanvsHuman.py

Training the RL Bot

Out of the two implementations, only the RL bot needs to train in order to reach proficiency. It does so by play 1v1 with another RL bot sharing the same state values in order to learn to beat itself and eventually become better. The num_iterations parameter controls the number of games that will be played among the bots.

run python training_(AIvsAI)_ReinforcementLearning.py

Testing the two bots by making them play among themselves

I wrote anotherr script in order to see which bot performed better in very brutal 1v1 fashion. The num_iterations parameter controls the number of games that will be played among both the bots.

run python Showdown_Minimax_vs_RL.py

To Try

Minimax Algorithm
Temporal Difference Learning
Q Learning
Genetic Algorithms

tic-tac-toe-ai-bots's People

Contributors

Stargazers

Watchers

Forkers

romanbryzhchuk iaamar lejames12142 lucvanacker devakisj1 27guptamohit strawhatrick per-stian mrhktriot 9raven leo031938 rmskannan proob aayushsingla bsmurfy 11fisher andychen66523 wenseslaus qqqyyy111 anmol221b gxxxxxxxx nerinepomuceno sreebhargavi byeruva diznet munka21 rubimen-py webapplicationkhthth anh56 necopottsun karmiksan hbcbh1999 jl276 aruncsk degrutto adityasawant0912 sid6595 raghavk16 lancer081 jordandiazfr bananacoder404 isacolak kirtivardhan99 aditidatta aifictionfact mahimagarg17 bbsvip sarankoundinya2000 harmonyel yangboz xinthedark mhhbrasse avpromo aviptl101 rbtkswlf kxwang-max

tic-tac-toe-ai-bots's Issues

Found (and solved) error in your Minimax

Hi

I was surprised to see that Minimax loose from RL.
So I checked your code. And I found that the return value (type) of getBestMove(state, player) is not (consistently) correct.
The function returns a "best move" ( return best_move ), but in the beginning of the function it may return a "-1", "1", or "0", which is the best score (instead of a move).
Hopefully you can fix the issue with this information.

Update: I fixed your code, and now Minimax is even better than RL, and it never looses from a human player :)
Please check it out. The trick is to return both the score and the move as a tuple in the function getBestMove.

Regards

Marco Brassé
The Netherlands

UPDATED CODE (with also some small edits of "==" instead of "is")
See
HumanvsAI_Minimax.zip

issues

its showing , invalid syntax at line 211.could you please resolve this issue

regarding .txt files

how are you getting those values in .txt values. are they random or is there any procedure to get them

REGARDING trained_state _values_0.txt file

from where i can download this file, trained_state _values_0.txt

Recommend Projects

React

A declarative, efficient, and flexible JavaScript library for building user interfaces.
Vue.js

🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
Typescript

TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
TensorFlow

An Open Source Machine Learning Framework for Everyone
Django

The Web framework for perfectionists with deadlines.
Laravel

A PHP framework for web artisans
D3

Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

javascript

JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
web

Some thing interesting about web. New door for the world.
server

A server is a program made to process requests and deliver data to clients.
Machine learning

Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Visualization

Some thing interesting about visualization, use data art
Game

Some thing interesting about game, make everyone happy.

Recommend Org

Facebook

We are working to build community through open source technology. NB: members must have two-factor auth.
Microsoft

Open source projects and samples from Microsoft.
Google

Google ❤️ Open Source for everyone.
Alibaba

Alibaba Open Source for everyone
D3

Data-Driven Documents codes.
Tencent

China tencent open source team.