Scan and Solve Sudoku Puzzles
This is a toy project for educational purpose. I usually use this project to explore some JVM/Kotlin libs, new Gradle features/plugins, AI libs and CI pipes (using github actions).
For a more detailed explanation of how the project works, check out the Project Blog.
The main objective of this project is to use an image of a sudoku puzzle as input, identify the puzzle, recognize the numbers, solve the problem and plot the solution back to the original image.
That been said, the project can be divided in concepts:
- Parser/Plotter: computer vision components responsible the read/write information from/to an image;
- Recognizer: responsible to recognize the number images and translate it into an integer;
- Solver: responsible to effectively solve the sudoku puzzle;
- Engine: a pipe that glues all above components together to achieve the main objective of the project.
The project was broken into the following modules (using java SPI):
- sudoscan-api: module containing domain classes, spi for solvers and recognizers and main engine.
- sudoscan-solver-choco: module containing a spi implementation using choco solver (a CSP solver) to solve to find the sudoku solution;
- sudoscan-solver-ojalgo: module containing a spi implementation using ojAlgo (a MIP solver) to solve to find the sudoku solution;
- sudoscan-recognizer-djl: module containing a spi implementation using djl (a deep learning framework) to recognize numeric images;
- sudoscan-recognizer-dl4j: module containing a spi implementation using dl4j (a deep learning framework) to recognize numeric images (this is the default recognizer);
- sudoscan-cli: cli application using other modules to solve sudoku problems using a webcam as user interface.
Both solvers (choco-solver and ojalgo) are implementations that use different discrete optimizations approaches. To learn more about MIP and CSP approaches on Sudoku problems, take a look at the following Kaggle Notebook.
Both recognizers (dlf4 and djl) are implementations that use a pre-trained model. The model creation and training can be found on the following Kaggle Notebook.
Add the repository
maven {
name = "SudoscanLibs"
url = uri("https://pintowar.jfrog.io/artifactory/sudoscan-libs-release")
}
Add api, solver and recognizer the dependencies
implementation("com.github.pintowar:sudoscan-api:x.y.z")
implementation("com.github.pintowar:sudoscan-solver-choco:x.y.z") // or sudoscan-solver-ojalgo
implementation("com.github.pintowar:sudoscan-recognizer-dl4j:x.y.z") // or sudoscan-recognizer-djl
To build the fat jar client version of the app, run the following command:
gradle -PjavacppPlatform=linux-x86_64,macosx-x86_64,windows-x86_64 clean assembleCliApp
The command above will build a fat jar containing the native dependencies of all informed platforms. To build a more optimized jar, just inform the desired platform, for instance:
gradle -PjavacppPlatform=linux-x86_64 clean assembleCliApp
This second command will build a smaller jar, but it will run only on linux-x86_64 platforms.
It is also possible to chose in compile time, which solver/recognizer module to use. The commands above (by default) will generate a jar using sudoscan-solver-choco and sudoscan-recognizer-dl4j as the main solver and recognizer. To use sudoscan-solver-ojalgo and sudoscan-recognizer-djl as solver and recognizer components, run the following command:
gradle -Pojalgo -Pdjl -PjavacppPlatform=macosx-x86_64 clean assembleCliApp
It is also possible to generate a native image of the sudoscan-cli, using a special JVM called the GraalVM.
GraalVM provides this mechanism that helps to create a native image of the application which can be executed as a standalone executable. The build process builds the executable with all the required dependencies such that there is no need the JVM to run the application.
The latest GraalVM version tested was GraalVM CE 21.2.0 (graalvm-21.2.0+java11) which is for Java 11.
To build the native image, run the following command on sudoscan-cli
module:
gradle -PjavacppPlatform=linux-x86_64 clean nativeCompile