[中文|EN]
Existing methods mainly rely on spatial transformation with sensor calibration information. However, this operation also results in two significant issues: First, the model performance is degraded due to the calibration noise (e.g., changing of sensor position during driving ). Second, the spatial transformation is computationally expensive and limits the efficiency of existing methods. In this work, we exploit a calibration-free spatial transformation scheme based on the vanilla attention scheme, to project the 2D images and point clouds into a compact BEV plane. Besides, we also introduce extra auxiliary training tasks and an efficient decoder to improve the model performance and efficiency, respectively.
This work is based on our prior work, i.e., PMF. If you are familiar with our PMF project, you can run this project easily.
The code of this project will be released after our work is finally accepted.
@article{zhuang2024robust3dsemanticoccupancy,
title={Robust 3D Semantic Occupancy Prediction with Calibration-free Spatial Transformation},
author={Zhuangwei Zhuang and Ziyin Wang and Sitao Chen and Lizhao Liu and Hui Luo and Mingkui Tan},
journal={arXiv preprint arXiv:2411.12177},
year={2024}
}