tinygym

A tiny RL playground with minimal, hackable implementations of common RL algorithms.

python tinygym.py --algo [algo] --task [task] --max_evals [default=1000] --save [True]

Test on sample tasks: python unit_test.py

RL

Converges to basic controls tasks in <1K episodes (CMA takes longer, ~10K).