Solving the Gambler's Problem using Value Iteration from the book Reinforcement Learning (2nd Edition) by Richard S. Sutton and Andre G. Barto
-
Updated
Nov 29, 2019 - Python
Solving the Gambler's Problem using Value Iteration from the book Reinforcement Learning (2nd Edition) by Richard S. Sutton and Andre G. Barto
multi-armed bandit, gambler problem, cliff problem and TD learning
python code successfullly reproduce the Gambler problem, Figure 4.6 of Chapter 4 on Sutton's book, Sutton, R. S., & Barto, A. G. (1998). Reinforcement learning: An introduction (Vol. 1, No. 1). Cambridge: MIT press.
Gambler's problem environment implemented OpenAI gym-style
A dynamic programming solution to the classic gambler's problem introduced in Sutton and Barton's RL book
A simulation of the gambler's ruin problem: the traditional set-up with 2 players, as well as a general solution in n players.
Add a description, image, and links to the gambler-problem topic page so that developers can more easily learn about it.
To associate your repository with the gambler-problem topic, visit your repo's landing page and select "manage topics."