Visualization Tool based on Gridworld. It supports Dynamic Programming algorithms (Value Iterarion and Policy Iteration), Temporal Difference algorithms (Q-Learning and SARSA), and customizable grid, agent and learning aspects.
This repository holds the source code of this project (made with Unity) and also my final report (in portuguese).
You can also access the web version.