This repository has been archived by the owner on Oct 7, 2024. It is now read-only.
0.3.0 -- Cartpole bugfix and more
- Fixed an issue with cartpole_swingup's
best_episode
not being computed correctly due to negative rewards (beb1630). This change will affect scores for this experiment, so we bump the minor version in keeping with semantic versioning. - Fix versioning issues in colab and setup.py.
- Improve test.sh coverage.
- Make TAGS accessible as sweep (e.g. sweep.TAGS['exploration'] to retrieve all exploration experiments.
- Add a bootstrapped DQN baseline in JAX.
- Extract environments to their own package for easier direct access for debugging/development.