Visual Sudoku Solver

In this repo, a sudoku solver is designed to solve directly from images (Visual Sudoku). Recurrent Relational Networks are used for the task of solving the sudoku.

Task

We need to make a model which uses takes in an input sudoku board made of handwritten digits and outputs the the solved sudoku board in symbolic form (in the form of digits on a computer). For training purposes we are given pairs of visual sudoku boards as follows

(unsolved) (solved)

(Note that these are 8*8 sudoku boards where each column, each row and each bloack of size 2*4 (long side along x axis) is filled with 8 unique digits in the solution)

For converting individual handwritten cells to symbolic data, we can use all the training images, extract all sub-images from it which (8*8=64 images from one sudoku board image) and then do clustering/classification techniques. For (semi) supervision, we can use 1 labeled image from each class which is given separately. We use a combined technique of kmeans+Unsupervised Data augmentation(UDA) to make a classifier which gives 95%+ accuracy using just these 9 labeled images and a larger set of unlabeled images

labelled image (1 per class, class 0 to 8 from left to right)

Running the solver

Without Joint training

Performing Unsupervised clustering (using UDA) then using the classifier made in the UDA step to convert visual sudoku boards into symbolic boards (will have some noise) and then training the RRN on these input-output symbolic sudoku boards. Noise in labels limits the ability of the RRN to learn the rules of sudoku

run_solver.sh <path_to_train> <path_to_test_query> <path_to_sample_imgs> <path_to_out_csv>

With Joint training

Similar to the earlier part but this time, the classifier that we get from UDA is fine tuned while training the RRN. ie The pretrained classifier and RRN are trained jointly so that both improve each other

run_solver.sh <path_to_train> <path_to_test_query> <path_to_sample_imgs> <path_to_out_csv> true

In the above commands

<path_to_train> directory has to sub directories, <path_to_train>/query/ and <path_to_train>/target/. Both these subdirectories have images of sudoku boards made of handwritten digits. Solution of the board <path_to_train>/query/n.png should be <path_to_train>/target/n.png where n is the number of the board (eg 0.png, 1.png ......)
<path_to_test_query> has unsolved visual boards just like in <path_to_train>/query/ that will be solved after model is trained (for testing purposes)
<path_to_sample_imgs> is a numpy file (.npy) of shape (10,784) having one labelled image of each class (digit)
<path_to_out_csv> is where the result of solving the unsolved sudoku boards present in <path_to_test_query> will be stored in symbolic form (in the form of digits).

Rough notes

Recurrent Relational Network

RRN (as per paper with slight modification) (reference)

Joint Training for solving visual sudoku

SatNet : idea of training classifier and rrn together (but this was possible as they had symbolic ground truth sudoku output tables)
Optimize loss function of RRN with two more loss functions on (same or tied weights) classifier as penalty terms
we may use here cGAN with batchnorm or bit more "richer" architecture (wrt paper) .. ? (RRN is already modified)

Name		Name	Last commit message	Last commit date
Latest commit History 10 Commits
Joint_training		Joint_training
RRN		RRN
clustering		clustering
notebooks		notebooks
results		results
useful_scripts		useful_scripts
.DS_Store		.DS_Store
LICENSE		LICENSE
README.md		README.md
run_solver.sh		run_solver.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Visual Sudoku Solver

Task

Running the solver

Without Joint training

With Joint training

Rough notes

Recurrent Relational Network

Joint Training for solving visual sudoku

About

Releases

Packages

Languages

License

HarmanDotpy/Visual-Sudoku-Solver-Deep-Learning

Folders and files

Latest commit

History

Repository files navigation

Visual Sudoku Solver

Task

Running the solver

Without Joint training

With Joint training

Rough notes

Recurrent Relational Network

Joint Training for solving visual sudoku

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages