Xianzhu Liu1, Xin Sun1, Haozhe Xie2, Zonglin Li1, Ru Li1, Shengping Zhang1
1Harbin Institute of Technology, Weihai, China 2Nanyang Technological University, Singapore
- [2024/12/19] The repo is created.
- [2025/5/29] The test code has been released.
Moreover, this repository introduces an integrated 3D Panoptic Scene Understanding Benchmark implemented in Python 3.8, PyTorch 1.12 and CUDA 11.3.
- You can use the following command to install PyTorch with CUDA 11.3.
conda create -n ssc python=3.8
conda activate ssc
conda install pytorch==1.12.1 torchvision==0.13.1 torchaudio==0.12.1 cudatoolkit=11.3 -c pytorch
- Install dependencies:
pip install -r requirements.txt
For the datasets used in this paper, please refer to the download and preprocessing instructions provided in Panoptic-Lifting
You can download our pre-trained checkpoints here
We provide an example to use our code.
-
Please download the pretrained checkpoints and unzip.
-
Use the
render_panopli.py
script to render. Example:
python inference/render_panopli.py pretrained_ckpts/hypersim001008/checkpoints/hypersim001008.ckpt True
This will render the outputs to runs/<experiment>
folder.
- Use the
evaluate.py
script for calculating metrics. Example:
python inference/evaluate.py --root_path ./data/hypersim/hypersim001008 --exp_path runs/<experiment>
This repository only contains the inference code for MVC-PSU. The training code will be released in our subsequent work.
@inproceedings{liu2025multi,
title={Multi-view Consistent 3D Panoptic Scene Understanding},
author={Liu, Xianzhu and Sun, Xin and Xie, Haozhe and Li, Zonglin and Li, Ru and Zhang, Shengping},
booktitle={Proceedings of the AAAI Conference on Artificial Intelligence},
volume={39},
number={6},
pages={5613--5621},
year={2025}
}