Multi-view Consistent 3D Panoptic Scene Understanding

Xianzhu Liu¹, Xin Sun¹, Haozhe Xie², Zonglin Li¹, Ru Li¹, Shengping Zhang¹

¹Harbin Institute of Technology, Weihai, China ²Nanyang Technological University, Singapore

Changelog

[2024/12/19] The repo is created.
[2025/5/29] The test code has been released.

Installation 📥

Moreover, this repository introduces an integrated 3D Panoptic Scene Understanding Benchmark implemented in Python 3.8, PyTorch 1.12 and CUDA 11.3.

You can use the following command to install PyTorch with CUDA 11.3.

conda create -n ssc python=3.8
conda activate ssc
conda install pytorch==1.12.1 torchvision==0.13.1 torchaudio==0.12.1 cudatoolkit=11.3 -c pytorch

Install dependencies:

pip install -r requirements.txt

Datasets and Pretrained Models 🛢️

For the datasets used in this paper, please refer to the download and preprocessing instructions provided in Panoptic-Lifting

You can download our pre-trained checkpoints here

Inference and Evaluation 🚩

We provide an example to use our code.

Please download the pretrained checkpoints and unzip.
Use the render_panopli.py script to render. Example:

python inference/render_panopli.py pretrained_ckpts/hypersim001008/checkpoints/hypersim001008.ckpt True

This will render the outputs to runs/<experiment> folder.

Use the evaluate.py script for calculating metrics. Example:

python inference/evaluate.py --root_path ./data/hypersim/hypersim001008 --exp_path runs/<experiment>

Training 👩🏽‍💻

This repository only contains the inference code for MVC-PSU. The training code will be released in our subsequent work.

Cite this work

@inproceedings{liu2025multi,
  title={Multi-view Consistent 3D Panoptic Scene Understanding},
  author={Liu, Xianzhu and Sun, Xin and Xie, Haozhe and Li, Zonglin and Li, Ru and Zhang, Shengping},
  booktitle={Proceedings of the AAAI Conference on Artificial Intelligence},
  volume={39},
  number={6},
  pages={5613--5621},
  year={2025}
}

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
config		config
dataset		dataset
images		images
inference		inference
model		model
resources		resources
third_party		third_party
util		util
README.md		README.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Multi-view Consistent 3D Panoptic Scene Understanding

Changelog

Installation 📥

Datasets and Pretrained Models 🛢️

Inference and Evaluation 🚩

Training 👩🏽‍💻

Cite this work

About

Uh oh!

Releases

Packages

Uh oh!

Languages

aipixel/MVC-PSU

Folders and files

Latest commit

History

Repository files navigation

Multi-view Consistent 3D Panoptic Scene Understanding

Changelog

Installation 📥

Datasets and Pretrained Models 🛢️

Inference and Evaluation 🚩

Training 👩🏽‍💻

Cite this work

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages