Skip to content

Commit

Permalink
Update README.md
Browse files Browse the repository at this point in the history
  • Loading branch information
team-daniel authored May 20, 2024
1 parent 40c9781 commit 206503a
Showing 1 changed file with 1 addition and 1 deletion.
2 changes: 1 addition & 1 deletion README.md
Original file line number Diff line number Diff line change
Expand Up @@ -7,7 +7,7 @@ Empowering safe exploration of reinforcement learning (RL) agents during trainin
<p align="center">
<img src="img/overview.png" alt="High-level Overview of ADVICE" width="800"/>
</p>
<p align="center">Fig 1. A high-level overview of ADVICE including training, inference, and Adaptive ADVICE.</p>
<p align="center">Fig 1. A high-level overview of ADVICE including construction, execution, and adaptation.</p>

### ADVICE
ADVICE starts with collecting a dataset of state-action pairs, classified as either safe or unsafe based on the outcomes they lead to within the training environment. This dataset is then used to train the contrastive autoencoder. The training process leverages a unique loss function that helps the model learn by comparing similar (safe) and dissimilar (unsafe) pairs, enhancing its ability to identify and categorize new observations quickly. To classify unseen data, a nearest neighbours model is fit on the final embedding space.
Expand Down

0 comments on commit 206503a

Please sign in to comment.