Skip to content

Commit

Permalink
Rename safe_policy_checkpoints -> base_policy_checkpoints
Browse files Browse the repository at this point in the history
  • Loading branch information
cassidylaidlaw committed Nov 11, 2024
1 parent a3f6ed7 commit 95cd1fa
Show file tree
Hide file tree
Showing 31 changed files with 1 addition and 1 deletion.
2 changes: 1 addition & 1 deletion README.md
Original file line number Diff line number Diff line change
Expand Up @@ -12,7 +12,7 @@ All Python code is under the `occupancy_measures` package. Run
to install dependencies.

## Training the ORPO policies
Checkpoints for the behavioral cloning (BC) trained base policies are stored within the `data/safe_policy_checkpoints` directory. For now, these checkpoints were generated in Python 3.9, but in the future, we will provide checkpoints that work with all python versions. You can use these checkpoints to train your own ORPO policies using the following commands:
Checkpoints for the behavioral cloning (BC) trained base policies are stored within the `data/base_policy_checkpoints` directory. For now, these checkpoints were generated in Python 3.9, but in the future, we will provide checkpoints that work with all python versions. You can use these checkpoints to train your own ORPO policies using the following commands:

- state-action occupancy measure regularization:
```
Expand Down

0 comments on commit 95cd1fa

Please sign in to comment.