-
Notifications
You must be signed in to change notification settings - Fork 2
Move reward function out of simulator #266
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
base: main
Are you sure you want to change the base?
Conversation
|
Made this a draft before the pipeline checks suceeed.
|
|
I ran ruff, there is a commit for it. (Update I ran the wrong ruff) The tests are failing on the master as well. (I get them to pass now, so I may have ran something wrong. I will check the failing tests.) |
|
I'll take a look. |
|
Tests now passing on main using latest mal-toolbox. Try rebasing and you should see which tests started failing in this PR. |
|
This PR is also outdated since I realized the reward functions were not behaving as I expected. |
To avoid accidental leakages of statefulness.
Should also make it easier to define custom reward functions.