!pip install gym
https://pytorch.org/get-started/locally/
!pip install numpy
!pip install matplotlib
- Actor Critic Final.py
- Contains the implementation of the Actor-Critic algorithm, a reinforcement learning technique combining policy (Actor) and value function (Critic) approximation to enhance learning efficiency.
- REINFORCE with Baseline Final.py:
- Encompasses the implementation of the REINFORCE algorithm with Baseline, a policy gradient method incorporating a baseline to reduce variance in gradient estimates.
- Semi-Gradient-SARSA Final.py
- Houses the implementation of the Semi-Gradient-SARSA algorithm, a temporal difference learning method applied in reinforcement learning scenarios for updating Q-values and optimizing policy.