Interactive Learning Course | Home Works & Quiz | Fall 2021 | Prof. Majid Nili
q-learning
epsilon-greedy
sarsa
value-iteration
tree-backup
n-armed-bandit-problem
regret-minimization
multi-agent-multi-armed-bandits
2-step-tree-backup
model-based-learning
off-policy-monte-carlo
social-bandit-learning
reinforcement-comparison
model-based-model-free-mixture
-
Updated
Feb 24, 2022 - Jupyter Notebook