Skip to content

Latest commit

 

History

History
23 lines (17 loc) · 1.07 KB

README.md

File metadata and controls

23 lines (17 loc) · 1.07 KB

CodeSize Repo LastCommint

Policy Gradient Algorithms

  • VPG (VANILLA POLICY GRADIENT)
  • PPO (PROXIMAL POLICY OPTIMIZATION)
  • TRPO (TRUST REGION POLICY OPTIMIZATION)

Installation

pip install matplotlib gym==0.25.2 tensorflow keras-rl2 pyglet protobuf==3.20.*

Training results

graph_500ep_