This is the code for the paper "Episodic Return Decomposition by Difference of Implicitly Assigned Sub-trajectory Reward".
Install Python environment with:
conda create -n diaster python=3.9 -y
conda activate diaster
conda install pytorch cudatoolkit=11.3 -c pytorch -y
pip install -r ./requirements.txtpython3 main.py --env-name=[Env name] The config files act as defaults for a task.
They are all located in config.
--env-name refers to the config files in config/ including Hopper-v3, Walker2d-v3, Swimmer-v3, Humanoid-v3, HumanoidStandup-v2.
All results will be stored in the result folder.
For example, run Diaster on Hopper:
python main.py --env-name=Hopper-v3