Skip to content

Latest commit

 

History

History
12 lines (9 loc) · 732 Bytes

README.md

File metadata and controls

12 lines (9 loc) · 732 Bytes

ReinforcementLearning

Implementations of standard RL problems and algorithms

  1. Monte Carlo Learning Off-policy every-visit and off-policy every-visit with Importance Sampling

  2. Dynamic Programming

    1. Value Iteration Value Iteration algorithm tested on Gambler's problem and Frozen Lake environment
  3. TD learning Implement three TD learning control algorithms SARSA, Expected SARSA and Q-Learning