简体中文 | English

MobileSeg

MobileSeg系列模型采用编解码架构，使用轻量级的模型作为骨干网络，适合部署在X86 CPU、ARM CPU等算量低的硬件。

参考论文

Sandler, Mark, Andrew Howard, Menglong Zhu, Andrey Zhmoginov, and Liang-Chieh Chen. "Mobilenetv2: Inverted residuals and linear bottlenecks." In Proceedings of the IEEE conference on computer vision and pattern recognition, pp. 4510-4520. 2018.

Howard, Andrew, Mark Sandler, Grace Chu, Liang-Chieh Chen, Bo Chen, Mingxing Tan, Weijun Wang et al. "Searching for mobilenetv3." In Proceedings of the IEEE/CVF International Conference on Computer Vision, pp. 1314-1324. 2019.

Ma, Ningning, Xiangyu Zhang, Hai-Tao Zheng, and Jian Sun. "Shufflenet v2: Practical guidelines for efficient cnn architecture design." In Proceedings of the European conference on computer vision (ECCV), pp. 116-131. 2018.

Yu, Changqian, Bin Xiao, Changxin Gao, Lu Yuan, Lei Zhang, Nong Sang, and Jingdong Wang. "Lite-hrnet: A lightweight high-resolution network." In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 10440-10450. 2021.

Han, Kai, Yunhe Wang, Qi Tian, Jianyuan Guo, Chunjing Xu, and Chang Xu. "Ghostnet: More features from cheap operations." In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 1580-1589. 2020.

分割精度

Cityscapes数据集

Model	Backbone	Resolution	Training Iters	mIoU	mIoU (flip)	mIoU (ms+flip)	Links
MobileSeg	MobileNetV2	1024x512	80000	73.94%	74.32%	75.33%	model \| log \| vdl
MobileSeg	MobileNetV3_large_x1_0	1024x512	80000	73.47%	73.72%	74.70%	model \| log \| vdl
MobileSeg	Lite_HRNet_18	1024x512	80000	70.75%	71.62%	72.40%	model \| log \| vdl
MobileSeg	ShuffleNetV2_x1_0	1024x512	80000	69.46%	70.00%	70.90%	model \| log \| vdl
MobileSeg	GhostNet_x1_0	1024x512	80000	71.88%	72.22%	73.11%	model \| log \| vdl

预测速度

Model	Backbone	V100 TRT Inference Speed(FPS)	Snapdragon 855 Inference Speed(FPS)
MobileSeg	MobileNetV2	67.57	27.01
MobileSeg	MobileNetV3_large_x1_0	67.39	32.90
MobileSeg	Lite_HRNet_18	10.5	13.05
MobileSeg	ShuffleNetV2_x1_0	37.09	39.61
MobileSeg	GhostNet_x1_0	35.58	38.74

测试条件：

在Nvidia GPU V100上测试模型速度：使用PaddleInference Python接口，开启TensorRT，推理数据类型是FP32，输入图像的维度是1x3x1024x2048。
在小米9（骁龙855 CPU）上测试模型速度：使用PaddleLite C++接口，单线程，输入图像的维度是1x3x256x256。

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README_cn.md

README_cn.md

MobileSeg

参考论文

分割精度

Cityscapes数据集

预测速度

Files

README_cn.md

Latest commit

History

README_cn.md

File metadata and controls

MobileSeg

参考论文

分割精度

Cityscapes数据集

预测速度