In this project, we will work with the Crawler environment.
In this environment, there is a creature with 4 arms and 4 four arms.
Goal: The agents must move its body toward the goal direction without falling.
- CrawlerStaticTarget - Goal direction is always forward.
- CrawlerDynamicTarget- Goal direction is randomized.
Agent Reward Function (independent):
- 0.03 times body velocity in the goal direction.
- 0.01 times body direction alignment with goal direction.
The Observation space consists of 117 variables corresponding to position, rotation, velocity, and angular velocities of each limb plus the acceleration and angular acceleration of the body. Vector Action space: (Continuous) Size of 20, corresponding to target rotations for joints.
The version of environment in this project contains 12 identical agents, each with its own copy of the environment.
For this project, we can download it from one of the links below. You need only select the environment that matches the operating system:
- Linux: click here
- Mac OSX: click here
- Windows (32-bit): click here
- Windows (64-bit): click here
Then, place the file in the Crawler_using_PPO/data/
folder, and unzip (or decompress) the file.
This repo is built in Ubuntu, please change the environment file if your OS is different.
To install required packages, run pip install -r src/requirements.txt
in terminal.
To test the existing agent, please run python test.py
To train your own agent, please run python train.py