Supported Value Regularization for Offline Reinforcement Learning

Code for NeurIPS 2023 accepted paper: Supported Value Regularization for Offline Reinforcement Learning.

Environment

Paper results were collected with MuJoCo 210 (and mujoco-py 2.1.2.14) in OpenAI gym 0.23.1 with the D4RL datasets. Networks are trained using PyTorch 1.11.0 and Python 3.7.

Usage

Pretrained Models

We have uploaded pretrained behavior models in SVR_bcmodels/ to facilitate experiment reproduction.

You can also pretrain behavior models by running:

./run_pretrain.sh

Offline RL

You can train SVR on D4RL datasets by running:

./run_experiments.sh

Logging

This codebase uses tensorboard. You can view saved runs with:

tensorboard --logdir <run_dir>

Citation

If you find this work useful, please consider citing:

@article{mao2023supported,
  title={Supported value regularization for offline reinforcement learning},
  author={Mao, Yixiu and Zhang, Hongchang and Chen, Chen and Xu, Yi and Ji, Xiangyang},
  journal={Advances in Neural Information Processing Systems},
  volume={36},
  pages={40587--40609},
  year={2023}
}

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
SVR_bcmodels		SVR_bcmodels
LICENSE		LICENSE
README.md		README.md
SVR.py		SVR.py
main.py		main.py
pretrain.py		pretrain.py
run_experiments.sh		run_experiments.sh
run_pretrain.sh		run_pretrain.sh
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Supported Value Regularization for Offline Reinforcement Learning

Environment

Usage

Pretrained Models

Offline RL

Logging

Citation

About

Uh oh!

Releases

Packages

Languages

License

thu-rllab/SVR

Folders and files

Latest commit

History

Repository files navigation

Supported Value Regularization for Offline Reinforcement Learning

Environment

Usage

Pretrained Models

Offline RL

Logging

Citation

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages