Skip to content

Project for Reinforcement Learning course in master degree Data Science and Scientific Computing - Units.

Notifications You must be signed in to change notification settings

Cristian-Curaba/Solving-Gymnasium-toy-games-with-Reinforcement-Learning-methods

Repository files navigation

Project for reinforcement learning course at "Data Science and Scientific Computing" master degree in Units.

Goal: solving toy games from Gymnasium (openAI) package with algorithms implemented from scratch.

Algorithms: q-learning, sarsa, expected sarsa, constant rate Monte Carlo.

Toy games: Blackjack, Cliff Walking, Frozen Lake, Taxi.

Contents: Expected_SARSA.ipynb, Constant_Rate_Monte_Carlo, Q_learning.ipynb, SARSA.ipynb contains the realative algorithm with everything needed to test them in each environment.

Everything.ipynb, Analysis.ipynb are auxiliar Jupyter Notebook to make a grid tuning and confront the solutions.

Best_*game_name* folders contain the tuned algorithm for solving each game.

others_*game_name* folders contain all the discarded plots from the tuning grid process.

policy_*game_name* folders contain gifs (20 images) to show the policy learned by the two best algorithms for each game (they are always q-learning and sarsa). policy_blackjack folder contains also grid plots to better understand differences between policies.

Report.pdf is the presentation for the exam (use Adobe Reader to visualize gifs).

About

Project for Reinforcement Learning course in master degree Data Science and Scientific Computing - Units.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published