This repository has been archived by the owner on Oct 7, 2024. It is now read-only.

0.3.0 -- Cartpole bugfix and more

aslanides released this 23 Apr 13:10

· 83 commits to master since this release

Fixed an issue with cartpole_swingup's best_episode not being computed correctly due to negative rewards (beb1630). This change will affect scores for this experiment, so we bump the minor version in keeping with semantic versioning.
Fix versioning issues in colab and setup.py.
Improve test.sh coverage.
Make TAGS accessible as sweep (e.g. sweep.TAGS['exploration'] to retrieve all exploration experiments.
Add a bootstrapped DQN baseline in JAX.
Extract environments to their own package for easier direct access for debugging/development.

Assets 2

Provide feedback