Skip to content

Commit 6c3dcf9

Browse files
authored
Update README.md
1 parent 145db87 commit 6c3dcf9

File tree

1 file changed

+1
-1
lines changed

1 file changed

+1
-1
lines changed

README.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -5,7 +5,7 @@ This repository contains code from our paper - 'Safe Exploration using Adaptive
55
Empowering safe exploration of reinforcement learning (RL) agents during training is a critical impediment towards deploying RL agents in many real-world scenarios. Training RL agents in unknown, \textit{black-box} environments poses an even greater safety risk when prior knowledge of the domain/task is unavailable. We introduce ADVICE (Adaptive Shielding with a Contrastive Autoencoder), a novel post-shielding technique that distinguishes safe and unsafe features of state-action pairs during training, thus protecting the RL agent from executing actions that yield potentially hazardous outcomes. Our comprehensive experimental evaluation against state-of-the-art safe exploration techniques demonstrates how ADVICE can significantly reduce safety violations during training while maintaining a competitive outcome reward.
66

77
<p align="center">
8-
<img src="overview.png" alt="High-level Overview of ADVICE" width="800"/>
8+
<img src="img/overview.png" alt="High-level Overview of ADVICE" width="800"/>
99
</p>
1010
<p align="center">Fig 1. A high-level overview of ADVICE including training, inference, and Adaptive ADVICE.</p>
1111

0 commit comments

Comments
 (0)