Interactive Learning Course | Home Works & Quiz | Fall 2021 | Prof. Majid Nili
q-learning epsilon-greedy sarsa value-iteration tree-backup n-armed-bandit-problem regret-minimization multi-agent-multi-armed-bandits 2-step-tree-backup model-based-learning off-policy-monte-carlo social-bandit-learning reinforcement-comparison model-based-model-free-mixture
-
Updated
Feb 24, 2022 - Jupyter Notebook