GPUDrive

An extremely fast, data-driven driving simulator written in C++.

Highlights

⚡️ Fast simulation for agent development and evaluation at 1 million FPS through the Madrona engine.
🐍 Provides Python bindings and gymnasium wrappers in torch and jax.
🏃‍➡️ Compatible with the Waymo Open Motion Dataset, featuring over 100K scenarios with human demonstrations.
📜 Readily available PPO implementations via SB3 and CleanRL / Pufferlib.
👀 Easily configure the simulator and agent views.
🎨 Diverse agent types: Vehicles, cyclists and pedestrians.

Simulator state	Agent observation

For details, see our paper and the introduction tutorials, which guide you through the basic usage.

Installation

To build GPUDrive, ensure you have all the required dependencies listed here including CMake, Python, and the CUDA Toolkit. See the details below.

Dependencies

CMake >= 3.24
Python >= 3.11
CUDA Toolkit >= 12.2 and <= 12.4 (We do not support CUDA versions 12.5+ at this time. Verify your CUDA version using nvcc --version.)
On macOS and Windows, install the required dependencies for XCode and Visual Studio C++ tools, respectively.

After installing the necessary dependencies, clone the repository (don't forget the --recursive flag!):

git clone --recursive https://github.com/Emerge-Lab/gpudrive.git
cd gpudrive

Then, there are two options for building the simulator:

🔧 Option 1. Manual install

For Linux and macOS, use the following commands:

mkdir build
cd build
cmake .. -DCMAKE_BUILD_TYPE=Release
make -j # cores to build with, e.g. 32
cd ..

For Windows, open the cloned repository in Visual Studio and build the project using the integrated cmake functionality.

Next, set up a Python environment

With uv (Recommended)

Create a virtual environment and install the Python components of the repository:

uv sync --frozen

With pyenv

Create a virtual environment:

pyenv virtualenv 3.11 gpudrive
pyenv activate gpudrive

Set it for the current project directory (optional):

pyenv local gpudrive

With conda

conda env create -f ./environment.yml
conda activate gpudrive

Install Python package

Finally, install the Python components of the repository using pip (this step is not required for the uv installation):

# macOS and Linux.
pip install -e .

Dependency-groups include pufferlib, sb3, vbd, and tests.

# On Windows.
pip install -e . -Cpackages.madrona_escape_room.ext-out-dir=<PATH_TO_YOUR_BUILD_DIR on Windows>

🐳 Option 2. Docker

To get started quickly, we provide a Dockerfile in the root directory.

Prerequisites

Ensure you have the following installed:

Building the Docker mage

Once installed, you can build the container with:

DOCKER_BUILDKIT=1 docker build --build-arg USE_CUDA=true --tag gpudrive:latest --progress=plain .

Running the Container

To run the container with GPU support and shared memory:

docker run --gpus all -it --rm --shm-size=20G -v ${PWD}:/workspace gpudrive:latest /bin/bash

Test whether the installation was successful by importing the simulator:

import madrona_gpudrive

To avoid compiling on GPU mode everytime, the following environment variable can be set with any custom path. For example, you can store the compiled program in a cache called gpudrive_cache:

export MADRONA_MWGPU_KERNEL_CACHE=./gpudrive_cache

Please remember that if you make any changes in C++, you need to delete the cache and recompile.

Optional: If you want to use the Madrona viewer in C++

Extra dependencies to use Madrona viewer

To build the simulator with visualization support on Linux (build/viewer), you will need to install X11 and OpenGL development libraries. Equivalent dependencies are already installed by Xcode on macOS. For example, on Ubuntu:

  sudo apt install libx11-dev libxrandr-dev libxinerama-dev libxcursor-dev libxi-dev mesa-common-dev libc++1

Integrations

What	Info	Run	Training SPS
IPPO implementation SB3	IPPO, PufferLib, Implementation	`python baselines/ppo/ppo_sb3.py`	25 - 50K
IPPO implementation PufferLib 🐡	PPO	`python baselines/ppo/ppo_pufferlib.py`	100 - 300K

Getting started

To get started, see these entry points:

Our intro tutorials. These tutorials take approximately 30-60 minutes to complete and will guide you through the dataset, simulator, and how to populate the simulator with different types of actors.
The environment docs provide detailed info on environment settings and supported features.

Pre-trained policies

Several pre-trained policies are available via the PyTorchModelHubMixin class on 🤗 huggingface_hub.

Best Policy (10,000 Scenarios). The best policy from Building reliable sim driving agents by scaling self-play is available here here. This policy was trained on 10,000 randomly sampled scenarios from the WOMD training dataset.
Alternative Policy (1,000 Scenarios). A policy trained on 1,000 scenarios can be found here

Note: These models were trained with the environment configurations defined in examples/experimental/config/reliable_agents_params.yaml, changing environment/observation configurations will affect performance.

Usage

To load a pre-trained policy, use the following:

from gpudrive.networks.late_fusion import NeuralNet

# Load pre-trained model via huggingface_hub
agent = NeuralNet.from_pretrained("daphne-cornelisse/policy_S10_000_02_27")

See tutorial 04 for all the details.

Dataset

Download the dataset

Two versions of the dataset are available, a mini version with a 1000 training files and 300 test/validation files, and a large dataset with 100k unique scenes.
Replace 'GPUDrive_mini' with 'GPUDrive' below if you wish to download the full dataset.

Download the dataset

To download the dataset you need the huggingface_hub library

pip install huggingface_hub

Then you can download the dataset using python or just huggingface-cli.

Option 1: Using Python

>>> from huggingface_hub import snapshot_download
>>> snapshot_download(repo_id="EMERGE-lab/GPUDrive_mini", repo_type="dataset", local_dir="data/processed")

Option 2: Use the huggingface-cli

Log in to your Hugging Face account:

huggingface-cli login

Download the dataset:

huggingface-cli download EMERGE-lab/GPUDrive_mini --local-dir data/processed --repo-type "dataset"

Option 3: Manual Download

Visit https://huggingface.co/datasets/EMERGE-lab/GPUDrive_mini
Navigate to the Files and versions tab.
Download the desired files/directories.

NOTE: If you downloaded the full-sized dataset, it is grouped to subdirectories of 10k files each (according to hugging face constraints). In order for the path to work with GPUDrive, you need to run

python data_utils/post_processing.py #use --help if you've used a custom download path

Re-build the dataset

If you wish to manually generate the dataset, GPUDrive is compatible with the complete Waymo Open Motion Dataset, which contains well over 100,000 scenarios. To download new files and create scenarios for the simulator, follow the steps below.

Re-build the dataset in 3 steps

First, head to https://waymo.com/open/ and click on the "download" button a the top. After registering, click on the files from v1.2.1 March 2024, the newest version of the dataset at the time of wrting (10/2024). This will lead you a Google Cloud page. From here, you should see a folder structure like this:

waymo_open_dataset_motion_v_1_2_1/
│
├── uncompressed/
│   ├── lidar_and_camera/
│   ├── scenario/
│   │   ├── testing_interactive/
│   │   ├── testing/
│   │   ├── training_20s/
│   │   ├── training/
│   │   ├── validation_interactive/
│   │   └── validation/
│   └── tf_example/

Now, download files from testing, training and/or validation in the scenario folder. An easy way to do this is through gsutil. First register using:

gcloud auth login

...then run the command below to download the dataset you prefer. For example, to download the validation dataset:

gsutil -m cp -r gs://waymo_open_dataset_motion_v_1_2_1/uncompressed/scenario/validation/ data/raw

where data/raw is your local storage folder. Note that this can take a while, depending on the size of the dataset you're downloading.

The last thing we need to do is convert the raw data to a format that is compatible with the simulator using:

python data_utils/process_waymo_files.py '<raw-data-path>' '<storage-path>' '<dataset>'

Note: Due to an open issue, installation of waymo-open-dataset-tf-2.12.0 fails for Python 3.11. To use the script, in a separate Python 3.10 environment, run

pip install waymo-open-dataset-tf-2-12-0 trimesh[easy] python-fcl

Then for example, if you want to process the validation data, run:

python data_utils/process_waymo_files.py 'data/raw/' 'data/processed/' 'validation'
>>>
Processing Waymo files: 100%|████████████████████████████████████████████████████████████████| 150/150 [00:05<00:00, 28.18it/s]
INFO:root:Done!

and that's it!

🧐 Caveat: A single Waymo tfrecord file contains approximately 500 traffic scenarios. Processing speed is about 250 scenes/min on a 16 core CPU. Trying to process the entire validation set for example (150 tfrecords) is a LOT of time.

Post-processing

Running python data_utils/postprocessing.py filters out corrupted files and undoes hugging face directory grouping.

📜 Citing GPUDrive

If you use GPUDrive in your research, please cite our ICLR 2025 paper

@inproceedings{kazemkhani2025gpudrive,
      title={GPUDrive: Data-driven, multi-agent driving simulation at 1 million FPS},
      author={Saman Kazemkhani and Aarav Pandya and Daphne Cornelisse and Brennan Shacklett and Eugene Vinitsky},
      booktitle={Proceedings of the International Conference on Learning Representations (ICLR)},
      year={2025},
      url={https://arxiv.org/abs/2408.01584},
      eprint={2408.01584},
      archivePrefix={arXiv},
      primaryClass={cs.AI},
}

Contributing

If you encounter a bug, notice a missing feature, or want to contribute, feel free to create an issue or reach out! We'd be excited to have you involved in the project.

Name		Name	Last commit message	Last commit date
Latest commit History 644 Commits
.github/workflows		.github/workflows
assets		assets
baselines		baselines
data		data
data_utils		data_utils
examples		examples
external		external
gpudrive		gpudrive
src		src
tests		tests
.dockerignore		.dockerignore
.env.template		.env.template
.gitignore		.gitignore
.gitmodules		.gitmodules
.pre-commit-config.yaml		.pre-commit-config.yaml
.python-version		.python-version
CMakeLists.txt		CMakeLists.txt
Dockerfile		Dockerfile
LICENSE		LICENSE
README.md		README.md
build_gpudrive.py		build_gpudrive.py
conftest.py		conftest.py
environment.yml		environment.yml
pyproject.toml		pyproject.toml
pytest.ini		pytest.ini
setup.py		setup.py
uv.lock		uv.lock

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

GPUDrive

Highlights

Installation

With uv (Recommended)

With pyenv

With conda

Install Python package

Prerequisites

Building the Docker mage

Running the Container

Extra dependencies to use Madrona viewer

Integrations

Getting started

Pre-trained policies

Usage

Dataset

Download the dataset

Re-build the dataset

Post-processing

📜 Citing GPUDrive

Contributing

About

Uh oh!

Releases 4

Packages

Uh oh!

Uh oh!

Contributors 14

Languages

License

Emerge-Lab/gpudrive

Folders and files

Latest commit

History

Repository files navigation

GPUDrive

Highlights

Installation

With uv (Recommended)

With pyenv

With conda

Install Python package

Prerequisites

Building the Docker mage

Running the Container

Extra dependencies to use Madrona viewer

Integrations

Getting started

Pre-trained policies

Usage

Dataset

Download the dataset

Re-build the dataset

Post-processing

📜 Citing GPUDrive

Contributing

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases 4

Packages 0

Uh oh!

Uh oh!

Contributors 14

Languages

Packages