Ergodic reinforcement learning

This repository contains an algorithm for transforming time-series data into time-series with ergodic increments. As an example application, it contains an implementation of a game with non-ergodic dynamics and a demo where reinforcement learning agents try to learn optimal policies for this game with transformed and untransformed reward time-series. A more detailed description of the method can be found in

Dominik Baumann, Erfaun Noorani, James Price, Ole Peters, Colm Connaughton, and Thomas B. Schön, "Reinforcement learning with non-ergodic reward increments: robustness via ergodicity transformations," Transactions on Machine Learning Research, 2025, arXiv.

Requirements

The code was developed using Python 3.9 and depends on the following libraries:

gym 0.21.0
numpy 1.24.2
scipy 1.10.1
stable-baselines3 1.8.0
statsmodels 0.13.5
for plotting: matplotlib 3.7.1

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
LICENSE		LICENSE
README.md		README.md
coin_toss.py		coin_toss.py
demo_coin_toss.ipynb		demo_coin_toss.ipynb
transform.py		transform.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Ergodic reinforcement learning

Requirements

About

Releases

Packages

Languages

License

baumanndominik/ergodic_rl

Folders and files

Latest commit

History

Repository files navigation

Ergodic reinforcement learning

Requirements

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages