Curricular Subgoal for Inverse Reinforcement Learning

Official codebase for paper Curricular Subgoal for Inverse Reinforcement Learning.

Overview

TLDR: Our main contribution is a dedicated curricular subgoal-based IRL framework that enables multi-stage imitation based on expert demonstrations. Extensive experiments conducted on the D4RL and autonomous driving benchmarks show that our proposed CSIRL framework yields significantly superior performance to state-of-the-art competitors, as well as better interpretability in the training process. Moreover, the robustness analysis experiments show that CSIRL still maintains high performance even with only one expert trajectory.

Abstract: Inverse Reinforcement Learning (IRL) aims to reconstruct the reward function from expert demonstrations to facilitate policy learning, and has demonstrated its remarkable success in imitation learning. To promote expert-like behavior, existing IRL methods mainly focus on learning global reward functions to minimize the trajectory difference between the imitator and the expert. However, these global designs are still limited by the redundant noise and error propagation problems, leading to the unsuitable reward assignment and thus downgrading the agent capability in complex multi-stage tasks. In this paper, we propose a novel Curricular Subgoal-based Inverse Reinforcement Learning (CSIRL) framework, that explicitly disentangles one task with several local subgoals to guide agent imitation. Specifically, CSIRL firstly introduces decision uncertainty of the trained agent over expert trajectories to dynamically select subgoals, which directly determines the exploration boundary of different task stages. To further acquire local reward functions for each stage, we customize a meta-imitation objective based on these curricular subgoals to train an intrinsic reward generator. Experiments on the D4RL and autonomous driving benchmarks demonstrate that the proposed methods yields results superior to the state-of-the-art counterparts, as well as better interpretability.

Prerequisites

Install dependencies

See requirments.txt file for more information about how to install the dependencies.

Install highway-env

It should be noted that we make some modification on the original highway-env to make it more fit the real driving environment. The modified highway-env is provided by highway_modify, which can be installed by running:

cd highway_modify
pip install -e .

Usage

Detailed instructions to replicate the results in the paper are contained in scripts directory. Here we give the form of the instructions.

# highway-fast
python main.py env=highway-fast-continues-v0_s35_d1 expert.tra=<EXPERT_DATASET_PATH> seed=<RANDOM_SEED>

# merge
python main.py env=merge-continues-v0 expert.tra=<EXPERT_DATASET_PATH> seed=<RANDOM_SEED>

# roundabout
python main.py env=roundabout-continues-v1 expert.tra=<EXPERT_DATASET_PATH> seed=<RANDOM_SEED>

# intersection
python main.py env=intersection-continues-v0-o1 expert.tra=<EXPERT_DATASET_PATH> seed=<RANDOM_SEED>

Make sure to replace EXPERT_DATASET_PATH with the path to the corresponding dataset in expert_data.

Citation

If you find this work useful for your research, please cite our paper:

@article{liu2023CSIRL,
  title={Curricular Subgoal for Inverse Reinforcement Learning},
  author={Liu, Shunyu and Qing, Yunpeng and Xu, Shuqi and Wu, Hongyan and Zhang, Jiangtao and Cong, Jingyuan and Liu, Yunfu and Song, Mingli},
  journal={arXiv preprint arXiv:2306.08232},
  year={2023}
}

Contact

Please feel free to contact me via email (liushunyu@zju.edu.cn, qingyunpeng@zju.edu.cn) if you are interested in my research :)

Name		Name	Last commit message	Last commit date
Latest commit History 20 Commits
config		config
dataset		dataset
expert_data		expert_data
highway_modify		highway_modify
model		model
scripts		scripts
utils		utils
wrappers		wrappers
LICENSE		LICENSE
README.md		README.md
__init__.py		__init__.py
exp-highway-table.png		exp-highway-table.png
exp-highway.png		exp-highway.png
framework.png		framework.png
introduction.png		introduction.png
main.py		main.py
make_envs.py		make_envs.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Curricular Subgoal for Inverse Reinforcement Learning

Overview

Prerequisites

Install dependencies

Install highway-env

Usage

Citation

Contact

About

Releases

Packages

Contributors 2

Languages

License

Plankson/CSIRL

Folders and files

Latest commit

History

Repository files navigation

Curricular Subgoal for Inverse Reinforcement Learning

Overview

Prerequisites

Install dependencies

Install highway-env

Usage

Citation

Contact

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages