Bipedal Walker using DQN
-
Updated
Jun 26, 2024 - Python
Bipedal Walker using DQN
Teaching an bipedal bot how to walk using a TD3 algorithm (variant of Reinforcement Learning - Actor & Critic method)
This project implements agent training using the Proximal Policy Optimization (PPO) algorithm in the BipedalWalker-v3 environment at two difficulty levels: normal and hardcore. The model's performance is evaluated based on rewards collected during the training process.
Reinforcement Learning and Deeep reinforcement Learning
PPO for Bipedal Walker
Add a description, image, and links to the bipedalwalker-v3 topic page so that developers can more easily learn about it.
To associate your repository with the bipedalwalker-v3 topic, visit your repo's landing page and select "manage topics."