Performative Reinforcement Learning [ICML'23]

This repository contains code for the paper Performative Reinforcement Learning.

Overview

The repository is structured as follows:

src/ : This folder contains all the source code files required for generating the experiments' data and figures.
data/ : This folder is where all the data will be generated.
figures/ : This folder is where all the figures will be generated.
limiting_envs/ : This folder is for storing visualizations of the environment.

Before running the scripts, please install the following prerequisites.

Prerequisites:

Python3
matplotlib
numpy
copy
itertools
time
cvxpy
cvxopt
click
multiprocessing
statistics
json
contextlib
joblib
tqdm
os
cmath

Running the code

To recreate the results of our paper, you will need to run the following scripts. Each of these scripts implements one of the methods described in the paper.

Repeated Policy Optimization (Fig. 2.a and 2.b)

python run_experiment.py --fbeta=10

Repeated Gradient Ascent (Fig. 2.c and 2.d)

python run_experiment.py --gradient

Repeated Policy Optimization with Finite Samples (Fig. 2.e)

python run_experiment.py --sampling

Repeated Gradient Ascent with Finite Samples (Fig. 2.f)

python run_experiment.py --gradient --sampling --etas 1

Repeated Policy Gradient

python run_experiment.py --policy_gradient

Solving Lagrangian

python run_experiment.py --sampling --lagrangian

Results

After running the above scripts, new plots will be generated in the figures directory.

Additional Notes

The following are not included in the paper:

For the experiment repeated gradient ascent with finite samples the corresponding suboptimality gap is also computed

Contact Details

For any questions or comments, contact strianta@mpi-sws.org.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Performative Reinforcement Learning [ICML'23]

Overview

Prerequisites:

Running the code

Repeated Policy Optimization (Fig. 2.a and 2.b)

Repeated Gradient Ascent (Fig. 2.c and 2.d)

Repeated Policy Optimization with Finite Samples (Fig. 2.e)

Repeated Gradient Ascent with Finite Samples (Fig. 2.f)

Repeated Policy Gradient

Solving Lagrangian

Results

Additional Notes

Contact Details

About

Releases

Packages

Contributors 2

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 12 Commits
data		data
figures		figures
limiting_envs		limiting_envs
src		src
LICENSE		LICENSE
README.md		README.md
run_experiment.py		run_experiment.py

License

gradanovic/icml2023-performative-rl-paper-code

Folders and files

Latest commit

History

Repository files navigation

Performative Reinforcement Learning [ICML'23]

Overview

Prerequisites:

Running the code

Repeated Policy Optimization (Fig. 2.a and 2.b)

Repeated Gradient Ascent (Fig. 2.c and 2.d)

Repeated Policy Optimization with Finite Samples (Fig. 2.e)

Repeated Gradient Ascent with Finite Samples (Fig. 2.f)

Repeated Policy Gradient

Solving Lagrangian

Results

Additional Notes

Contact Details

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages