tabularRL

Implementation of tabular methods in reinforcement learning for policy control and evaluation.

We demonstrate different policy evaluation (Monte Carlo, k step Temporal Difference) and control (k step SARSA, TD lambda, Monte Carlo) algorithms for the game of BlackJack.

Code

Environment

blackjack_simulator.py is an implementation of BlackJack simulator. It also defines an abstract class for Environment. Any suitable class derived from this abstract class can be substituted in place of BlackJack to run the RL algorithms for a different setting.

RL Algorithms

evaluate_policy.py contains implementation for different policy evaluation algorithms while control_policy.py contains policy control algorithms. It also contains test_policy method to evaluate any policy by computing average rewards by acting out in the environment according to the given policy.

Demo code

tabular_rl.py implements a wrapper over the implemented algorithms providing a clean interface for benchmarking the algorithms. For a more interactive guide and visualization of algorithms, and observing the effect of different hyperparameters refer to BlackJackDemo.ipynb notebook.

Name		Name	Last commit message	Last commit date
Latest commit History 11 Commits
.gitignore		.gitignore
BlackJackDemo.ipynb		BlackJackDemo.ipynb
LICENSE		LICENSE
README.md		README.md
blackjack_simulator.py		blackjack_simulator.py
control_policy.py		control_policy.py
evaluate_policy.py		evaluate_policy.py
tabular_rl.py		tabular_rl.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

tabularRL

Code

Environment

RL Algorithms

Demo code

About

Releases

Packages

Languages

License

djin31/tabularRL

Folders and files

Latest commit

History

Repository files navigation

tabularRL

Code

Environment

RL Algorithms

Demo code

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages