MORL Preference Driving

Code repository for "Multi-Objective Reinforcement Learning for Adaptable Personalized Autonomous Driving" by Surmann et al., ECMR 2025

Multi-Objective Reinforcement Learning for Adaptable Personalized Autonomous Driving
by Hendrik Surmann
Supervised by Jorge de Heuvel and Prof. Dr. Maren Bennewitz
University of Bonn, 2025

📌 Overview

This project extends the Stable Baselines3 TD3 implementation to develop a multi-objective reinforcement learning (MORL) framework that incorporates dynamic user preferences in autonomous driving scenarios. The implementation is based on the PD-MORL algorithm (GitHub) and Stable-Baselines3 (GitHub) and is evaluated using the CARLA simulator (v0.9.15).

The goal is to train a single policy network capable of dynamically balancing multiple driving objectives, such as:

Efficiency
Comfort
Aggressiveness
Speed

Our focus is on training a single policy network capable of realizing multiple driving styles from vision-based input. The model is evaluated in diverse urban driving scenarios to assess its ability to align driving behavior with user preferences. Advanced traffic rules are only partially considered, as the primary objective is preference-adaptive driving behavior rather than raw autonomous driving performance. While the implementation is based on the SB3 TD3 framework, we extended it to the PD-MORL algorithm (an integration of preferences into TD3 with multiple Q-values). We further adapted PD-MORL to include a non-preference dimension. See the training function for details.

🚀 Installation

The installation details can be found in install/install.txt.
The core dependencies include:

Stable-Baselines3 v2.0.0
Python 3.8.10
PyTorch
CUDA 12.2
CARLA 0.9.15
Conda (for environment management)

🔧 Basic Setup (short)

Ensure you have the required dependencies installed:

conda env create -f environment.yml
conda activate my_env

WandB might need and api-key for experiment tracking: and wandb login

📂 Project Structure

install/ → Contains installation instructions.
sb3/ → The main implementation of the extended TD3 MORL algorithm.
run/ → Stores trained networks/agents.
scenarios/ → Images/Videos of the implemented driving scenarios.
sb3/logs/ → Contains logging files and images and plots of the experiment results.

The training progress and results are saved in: run/<experiment_name>_bestPref.zip/

🎮 Running Experiments

1️⃣ Start the CARLA Server

./CarlaUE4.sh -RenderOffScreen -world-port="$ARG1" &

2️⃣ Run the MORL Agent

python td3_main.py --run="$ARG0" --client_port="$ARG1" --tm_port="$ARG2"

Where:

ARG0 → Name of the agent.
ARG1 → CARLA client port.
ARG2 → Traffic manager port.

Example:

python td3_main.py --run=Agent --client_port=2000 --tm_port=8000

🔧 Configurations

Modify the config file to adjust key settings like:

Enable visualization: SPECATE=False
Train/Evaluate model: evaluate=False
Show policy: showPolicy=False
Traing Phase:key_steps= int(1e6) # or 0

📝 Citation

@INPROCEEDINGS{surmann2025ecmr,
  author={H. Surmann and J. de Heuvel and M. Bennewitz},
  title={Multi-Objective Reinforcement Learning for Adaptable Personalized Autonomous Driving},
  booktitle={Proc. European Conference on Mobile Robots (ECMR)},
  year={2025},
  address={Padua, Italy}
}

🔗 References

Stable Baselines3: stable-baselines3.readthedocs.io
PD-MORL Algorithm: GitHub Repository
Supplemental Paper Video on YouTube

🔧 Support

For questions or issues, please contact: 📧 Hendrik Surmann - [hendrik.surmann@uni-bonn.de]

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

MORL Preference Driving

📌 Overview

🚀 Installation

🔧 Basic Setup (short)

📂 Project Structure

🎮 Running Experiments

1️⃣ Start the CARLA Server

2️⃣ Run the MORL Agent

🔧 Configurations

📝 Citation

🔗 References

🔧 Support

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
install		install
sb3		sb3
README.md		README.md

Folders and files

Latest commit

History

Repository files navigation

MORL Preference Driving

📌 Overview

🚀 Installation

🔧 Basic Setup (short)

📂 Project Structure

🎮 Running Experiments

1️⃣ Start the CARLA Server

2️⃣ Run the MORL Agent

🔧 Configurations

📝 Citation

🔗 References

🔧 Support

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages