In this repository I try to solve small Reinforcement Learning problems using NumPy. Current problems solved:
- Gridworld (State Value Estimation Problem)
- Gambler's Problem (State Value Estimation Problem)
- Windy Gridworld (Control Problem)
- K-Bandits (Control Problem)