This repo contains self-contained RL implementations, including:
- Basic Plot Usage
- Basic OpenAI Gym Usage
- Value Iteration
- Policy Iteration
- Monte Carlo Learning
- SARSA
- Q Learning
- DQN
- Generalized Advantage Estimate
The objective is to make a custom MuJoCo environment for training an agent (Snapbot
) with RL.