Solving for TickTackToe with Payoff Maximization and self play in C++11
The software plays tick tack toe against itself and rewards every action on an information set that leads to a victory. After some time of self play it can recall from experience which action to choose in which situations to win the game. The PayoffMaximization is general and could be used for many different games and even other problems.