- This code contains official implementation codes of PALR, which is used to produce experimental results in the original paper presented in NeurIPS 2023 (pdf).
- This code contains PALR and its baseline 4 methods (BC, FCA, MINE, RAP).
- To run this code, first install the anaconda virtual environment and install D4RL:
conda env create -f environment.yml
conda activate palr
pip install d4rl
- (optional) Download D4RL dataset:
python download_d4rl_dataset.py
- Train imitation policies using
main.py
. For the ease of hyperparameter search,pid
pass into main code andpid
-th configuration of the overall grid will be executed. Note that the default setting is:
methodlist = ['BC', 'RAP', 'FCA', 'MINE', 'PALR']
envlist = ['Hopper', 'Walker2d', 'HalfCheetah', 'Ant']
stacksizelist = [2, 4]
seedlist = [0, 1, 2, 3, 4]
To execute 0-th configuration, i.e. method='BC', env='Hopper', stacksize=2, seed=0
, run:
python train.py --pid=0
- Our code implementation is based on the following public repositories:
- rlkit : https://github.com/rail-berkeley/rlkit
- FCA : https://github.com/AlvinWen428/fighting-copycat-agents.git
- MINE implementation : https://github.com/mohith-sakthivel/mine-pytorch.git
- HSCIC implementation : https://github.com/namratadeka/circe.git