GitHub - OpenDriveLab/DriveLM: [ECCV 2024 Oral] DriveLM: Driving with Graph Visual Question Answering

Important

🌟 Stay up to date at opendrivelab.com!

DriveLM: Driving with Graph Visual Question Answering

Autonomous Driving Challenge 2024 Driving-with-Language Leaderboard.

drivelm_nus_demo_v2_1.mp4

Highlights

🔥 We instantiate datasets (DriveLM-Data) built upon nuScenes and CARLA, and propose a VLM-based baseline approach (DriveLM-Agent) for jointly performing Graph VQA and end-to-end driving.

🏁 DriveLM serves as a main track in the CVPR 2024 Autonomous Driving Challenge. Everything you need for the challenge is HERE, including baseline, test data and submission format and evaluation pipeline!

News

[2025/01/08] Drive-Bench release! In-depth analysis in what are DriveLM really benchmarking. Take a look at arxiv.
[2024/07/16] DriveLM official leaderboard reopen!
[2024/07/01] DriveLM got accepted to ECCV 2024! Congrats to the team!
[2024/06/01] Challenge ended up! See the final leaderboard.
[2024/03/25] Challenge test server is online and the test questions are released. Check it out!
[2024/02/29] Challenge repo release. Baseline, data and submission format, evaluation pipeline. Have a look!
[2023/08/25] DriveLM-nuScenes demo released.
[2023/12/22] DriveLM-nuScenes full v1.0 and paper released.

Getting Started

To get started with DriveLM:

(back to top)

Current Endeavors and Future Directions

The advent of GPT-style multimodal models in real-world applications motivates the study of the role of language in driving.

Date below reflects the arXiv submission date.

If there is any missing work, please reach out to us!

DriveLM attempts to address some of the challenges faced by the community.

Lack of data: DriveLM-Data serves as a comprehensive benchmark for driving with language.
Embodiment: GVQA provides a potential direction for embodied applications of LLMs / VLMs.
Closed-loop: DriveLM-CARLA attempts to explore closed-loop planning with language.

(back to top)

TODO List

(back to top)

DriveLM-Data

We facilitate the Perception, Prediction, Planning, Behavior, Motion tasks with human-written reasoning logic as a connection between them. We propose the task of GVQA on the DriveLM-Data.

📊 Comparison and Stats

DriveLM-Data is the first language-driving dataset facilitating the full stack of driving tasks with graph-structured logical dependencies.

Links to details about GVQA task, Dataset Features, and Annotation.

(back to top)

License and Citation

All assets and code in this repository are under the Apache 2.0 license unless specified otherwise. The language data is under CC BY-NC-SA 4.0. Other datasets (including nuScenes) inherit their own distribution licenses. Please consider citing our paper and project if they help your research.

@article{sima2023drivelm,
  title={DriveLM: Driving with Graph Visual Question Answering},
  author={Sima, Chonghao and Renz, Katrin and Chitta, Kashyap and Chen, Li and Zhang, Hanxue and Xie, Chengen and Luo, Ping and Geiger, Andreas and Li, Hongyang},
  journal={arXiv preprint arXiv:2312.14150},
  year={2023}
}

@misc{contributors2023drivelmrepo,
  title={DriveLM: Driving with Graph Visual Question Answering},
  author={DriveLM contributors},
  howpublished={\url{https://github.com/OpenDriveLab/DriveLM}},
  year={2023}
}

(back to top)

Other Resources

OpenDriveLab

DriveAGI | UniAD | OpenLane-V2 | Survey on E2EAD
Survey on BEV Perception | BEVFormer | OccNet

Autonomous Vision Group

tuPlan garage | CARLA garage | Survey on E2EAD
PlanT | KING | TransFuser | NEAT

(back to top)

Name		Name	Last commit message	Last commit date
Latest commit History 416 Commits
.github		.github
assets		assets
challenge		challenge
docs		docs
.gitignore		.gitignore
CITATION.cff		CITATION.cff
CODE_OF_CONDUCT.md		CODE_OF_CONDUCT.md
LICENSE		LICENSE
README.md		README.md
environment.yml		environment.yml
index.html		index.html
sample.html		sample.html

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Highlights

News

Table of Contents

Getting Started

Current Endeavors and Future Directions

TODO List

DriveLM-Data

📊 Comparison and Stats

License and Citation

Other Resources

About

Uh oh!

Sponsor this project

Uh oh!

Uh oh!

Contributors 13

Languages

Uh oh!

Folders and files

Latest commit

History

Repository files navigation

Highlights

News

Table of Contents

Getting Started

Current Endeavors and Future Directions

TODO List

DriveLM-Data

📊 Comparison and Stats

License and Citation

Other Resources

About

Topics

Resources

License

Code of conduct

Uh oh!

Stars

Watchers

Forks

Sponsor this project

Uh oh!

Uh oh!

Contributors 13

Languages