Constrained Intrinsic Motivation for Reinforcement Learning

Constrained Intrinsic Motivation for Reinforcement Learning (IJCAI 2024) [Paper]
Xiang Zheng, Xingjun Ma, Chao Shen, Cong Wang

Abstract

This paper investigates two fundamental problems that arise when utilizing Intrinsic Motivation (IM) for reinforcement learning in Reward-Free Pre-Training (RFPT) tasks and Exploration with Intrinsic Motivation (EIM) tasks: 1) how to design an effective intrinsic objective in RFPT tasks, and 2) how to reduce the bias introduced by the intrinsic objective in EIM tasks. Existing IM methods suffer from static skills, limited state coverage, sample inefficiency in RFPT tasks, and suboptimality in EIM tasks. To tackle these problems, we propose Constrained Intrinsic Motivation (CIM) for RFPT and EIM tasks, respectively: 1) CIM for RFPT maximizes the lower bound of the conditional state entropy subject to an alignment constraint on the state encoder network for efficient dynamic and diverse skill discovery and state coverage maximization; 2) CIM for EIM leverages constrained policy optimization to adaptively adjust the coefficient of the intrinsic objective to mitigate the distraction from the intrinsic objective. In various MuJoCo robotics environments, we empirically show that CIM for RFPT greatly surpasses fifteen IM methods for unsupervised skill discovery in terms of skill diversity, state coverage, and fine-tuning performance. Additionally, we showcase the effectiveness of CIM for EIM in redeeming intrinsic rewards when task rewards are exposed from the beginning.

Environment

conda env create -f environment.yml -n cim
conda activate cim

Run

## Test vanilla PPO
python src/train.py -m task_type=gym task=Ant-v4 tl=200 method=base p.rf_rate=0

## Run CIM for unsupervised skill discovery
python src/train.py -m task_type=gym task=Ant-v4 tl=200 method=cim p.sd=2 p.ro=1 c.tt=2a7 c.spc="512*64"

Name		Name	Last commit message	Last commit date
Latest commit History 11 Commits
configs		configs
src		src
.gitignore		.gitignore
.project-root		.project-root
README.md		README.md
environment.yml		environment.yml
pyproject.toml		pyproject.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Constrained Intrinsic Motivation for Reinforcement Learning

Abstract

Environment

Run

About

Releases

Packages

Languages

x-zheng16/CIM

Folders and files

Latest commit

History

Repository files navigation

Constrained Intrinsic Motivation for Reinforcement Learning

Abstract

Environment

Run

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages