AI-Sudoku-Solver

Solving Sudoku Puzzles With Computer Vision And Neural Networks

Solving Sudoku Puzzle Using Neural Network

The classic sudoku is a number placing puzzle game with a grid of 9 rows and 9 columns, partly filled with numbers 1..9 . We have to fill up the remaining positions such that each row, columns and 3x3 sub grids contains numbers 1..9, without repeatation.

Here our input is an image of sudoku puzzle and we need to produce a corresponding output image by filling the remaining positions of the input. The pipeline for the solution consists of the following steps.

Preprocess the input image and remove the background
Crop ROI's containing digits from the grid
Predict numbers from image crops using neural network
Predict solution using neural network in an iterative manner
Verify the solution and plot the resuts on the input image

We use tensorflow-keras library for training(prediction) the neural network and opencv library for image processing. The input sudoku puzzles are assumed to be images of printed version of the puzzle.

Training Datasets

The digit recognition model was trained using the entire SVHN dataset(train, test and extra) in grayscale mode. It is used to classify digits 0 to 9. The sudoku solver model was trained using a dataset of 10 million puzzles. The inputs for this model contains 9x9 arrays of integers representing the puzzles, such that zeros represent the unfilled positions.

The numpy dataset used for training was created by combining the following two datasets in csv formats.

Sudoku Solver Algorithm

A single iteration of the model, as such does not seem to produce correct results for all the positions in the input. So, we follow a iterative approach of feeding the partial solution of one iteration as input to next iteration.

The input is a sudoku matrix of 9x9 with numbers 0...9 as input(i.e 'puzzle').
Zeros represents the blank spaces in the original puzzle.
Each iteration produces an output array of 9x9 with numbers 1...9 (i.e 'out').
For each such output array, 'maxp'(9x9) contains the corresponding probability values.
For each filled(non-zero) element in input array we set corresponding probability in 'maxp' o -1.
Now, find the maximum element(single) in 'maxp' and set the corresponding position of input with corresponding values from current output.
Repeat the iterations with modified input(i.e 'puzzle'), until all elements are filled (ie. no zeros).

The algoritm takes N iterations for solving the entire puzzle, where N represenets the number of unfilled positions.

Digit Recognition Inputs

The input puzzle should be a grayscale or rgb image.
The images should not be blurry or shaky.
It should be a close-up image of the puzzle from a flat surface.
The puzzle should be in printed format eg.: paper or screen
The puzzle image should not contain marks, stains or unnecessary patterns.

Sudoku Models

Digit Recognition

The model was trained using Adam optimizer with a learning rate 0.001 ~ 0.000001 and SCCE loss function.

_________________________________________________________________
Layer (type)                 Output Shape              Param #   
=================================================================
conv2d_4 (Conv2D)            (None, 30, 30, 32)        320       
_________________________________________________________________
conv2d_5 (Conv2D)            (None, 28, 28, 64)        18496     
_________________________________________________________________
max_pooling2d_2 (MaxPooling2 (None, 14, 14, 64)        0         
_________________________________________________________________
dropout_4 (Dropout)          (None, 14, 14, 64)        0         
_________________________________________________________________
flatten_2 (Flatten)          (None, 12544)             0         
_________________________________________________________________
dense_4 (Dense)              (None, 128)               1605760   
_________________________________________________________________
dropout_5 (Dropout)          (None, 128)               0         
_________________________________________________________________
dense_5 (Dense)              (None, 10)                1290      
=================================================================
Total params: 1,625,866
Trainable params: 1,625,866
Non-trainable params: 0
_________________________________________________________________

Loss: 0.14, Accuracy: 96%
Epochs: 196, Size: 19.6MB

Sudoku Solver

A heavy dense and heavy conv model was trained using the the same dataset. The following sections shows the overall summary of the model and their training results.

a) Dense model

The model was trained using Adam optimizer with a learning rate 0.001 ~ 0.000001 and SCCE loss function. Here, most of the parameters are contributed by dense layers and conv layers are light weight.

_________________________________________________________________
Layer (type)                 Output Shape              Param #   
=================================================================
conv2d (Conv2D)              (None, 9, 9, 81)          810       
_________________________________________________________________
batch_normalization (BatchNo (None, 9, 9, 81)          324       
_________________________________________________________________
p_re_lu (PReLU)              (None, 9, 9, 81)          6561      
_________________________________________________________________
conv2d_1 (Conv2D)            (None, 9, 9, 81)          59130     
_________________________________________________________________
batch_normalization_1 (Batch (None, 9, 9, 81)          324       
_________________________________________________________________
p_re_lu_1 (PReLU)            (None, 9, 9, 81)          6561      
_________________________________________________________________
conv2d_2 (Conv2D)            (None, 9, 9, 81)          59130     
_________________________________________________________________
batch_normalization_2 (Batch (None, 9, 9, 81)          324       
_________________________________________________________________
p_re_lu_2 (PReLU)            (None, 9, 9, 81)          6561      
_________________________________________________________________
conv2d_3 (Conv2D)            (None, 9, 9, 162)         13284     
_________________________________________________________________
batch_normalization_3 (Batch (None, 9, 9, 162)         648       
_________________________________________________________________
p_re_lu_3 (PReLU)            (None, 9, 9, 162)         13122     
_________________________________________________________________
flatten (Flatten)            (None, 13122)             0         
_________________________________________________________________
dense (Dense)                (None, 1458)              19133334  
_________________________________________________________________
p_re_lu_4 (PReLU)            (None, 1458)              1458      
_________________________________________________________________
dense_1 (Dense)              (None, 729)               1063611   
_________________________________________________________________
reshape (Reshape)            (None, 9, 9, 9)           0         
_________________________________________________________________
softmax (Softmax)            (None, 9, 9, 9)           0         
=================================================================
Total params: 20,365,182
Trainable params: 20,364,372
Non-trainable params: 810
_________________________________________________________________

Loss: 0.24, Accuracy: 90%
Epochs: 245, Size: 244.5MB

b) Conv Model

The model was trained using Adam optimizer with a learning rate 0.001 and SCCE loss function. Here, there are no dense layers and conv layers are heavy(filters).

Layer (type)                 Output Shape              Param #   
=================================================================
conv2d (Conv2D)              (None, 9, 9, 512)         5120      
_________________________________________________________________
batch_normalization (BatchNo (None, 9, 9, 512)         2048      
_________________________________________________________________
re_lu (ReLU)                 (None, 9, 9, 512)         0         
_________________________________________________________________
conv2d_1 (Conv2D)            (None, 9, 9, 512)         2359808   
_________________________________________________________________
batch_normalization_1 (Batch (None, 9, 9, 512)         2048      
_________________________________________________________________
re_lu_1 (ReLU)               (None, 9, 9, 512)         0         
_________________________________________________________________
conv2d_2 (Conv2D)            (None, 9, 9, 512)         2359808   
_________________________________________________________________
batch_normalization_2 (Batch (None, 9, 9, 512)         2048      
_________________________________________________________________
re_lu_2 (ReLU)               (None, 9, 9, 512)         0         
_________________________________________________________________
conv2d_3 (Conv2D)            (None, 9, 9, 512)         2359808   
_________________________________________________________________
batch_normalization_3 (Batch (None, 9, 9, 512)         2048      
_________________________________________________________________
re_lu_3 (ReLU)               (None, 9, 9, 512)         0         
_________________________________________________________________
conv2d_4 (Conv2D)            (None, 9, 9, 512)         2359808   
_________________________________________________________________
batch_normalization_4 (Batch (None, 9, 9, 512)         2048      
_________________________________________________________________
re_lu_4 (ReLU)               (None, 9, 9, 512)         0         
_________________________________________________________________
conv2d_5 (Conv2D)            (None, 9, 9, 512)         2359808   
_________________________________________________________________
batch_normalization_5 (Batch (None, 9, 9, 512)         2048      
_________________________________________________________________
re_lu_5 (ReLU)               (None, 9, 9, 512)         0         
_________________________________________________________________
conv2d_6 (Conv2D)            (None, 9, 9, 512)         2359808   
_________________________________________________________________
batch_normalization_6 (Batch (None, 9, 9, 512)         2048      
_________________________________________________________________
re_lu_6 (ReLU)               (None, 9, 9, 512)         0         
_________________________________________________________________
conv2d_7 (Conv2D)            (None, 9, 9, 512)         2359808   
_________________________________________________________________
batch_normalization_7 (Batch (None, 9, 9, 512)         2048      
_________________________________________________________________
re_lu_7 (ReLU)               (None, 9, 9, 512)         0         
_________________________________________________________________
conv2d_8 (Conv2D)            (None, 9, 9, 512)         2359808   
_________________________________________________________________
batch_normalization_8 (Batch (None, 9, 9, 512)         2048      
_________________________________________________________________
re_lu_8 (ReLU)               (None, 9, 9, 512)         0         
_________________________________________________________________
conv2d_9 (Conv2D)            (None, 9, 9, 512)         2359808   
_________________________________________________________________
batch_normalization_9 (Batch (None, 9, 9, 512)         2048      
_________________________________________________________________
re_lu_9 (ReLU)               (None, 9, 9, 512)         0         
_________________________________________________________________
conv2d_10 (Conv2D)           (None, 9, 9, 9)           4617      
=================================================================
Total params: 21,268,489
Trainable params: 21,258,249
Non-trainable params: 10,240
_________________________________________________________________

Loss: 0.10, Accuracy: 96%
Epochs: 20, Size: 255.3MB

ashiishkaushal / ai_sudoku Goto Github PK

ai_sudoku's Introduction

AI-Sudoku-Solver

Training Datasets

Sudoku Solver Algorithm

Digit Recognition Inputs

Sudoku Models

References

ai_sudoku's People

Contributors

Watchers

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent