Reinforcement-Learning-Problems

In this repository I try to solve small Reinforcement Learning problems using NumPy. Current problems solved:

Gridworld (State Value Estimation Problem)
Gambler's Problem (State Value Estimation Problem)
Windy Gridworld (Control Problem)
K-Bandits (Control Problem)