A reinforcement learning framework for the game of Nim.
reinforcement-learning
q-learning
dqn
sarsa
dynamic-programming
policy-iteration
value-iteration
expected-sarsa
monte-carlo-methods
double-q-learning
temporal-difference-learning
double-sarsa
double-expected-sarsa
n-step-bootstrapping
n-step-sarsa
n-step-expected-sarsa
off-policy-n-step-sarsa
off-policy-n-step-expected-sarsa
n-step-tree-backup
-
Updated
Nov 21, 2020 - C++