简体中文 | English
MobileSeg系列模型采用编解码架构,使用轻量级的模型作为骨干网络,适合部署在X86 CPU、ARM CPU等算量低的硬件。
Sandler, Mark, Andrew Howard, Menglong Zhu, Andrey Zhmoginov, and Liang-Chieh Chen. "Mobilenetv2: Inverted residuals and linear bottlenecks." In Proceedings of the IEEE conference on computer vision and pattern recognition, pp. 4510-4520. 2018.
Howard, Andrew, Mark Sandler, Grace Chu, Liang-Chieh Chen, Bo Chen, Mingxing Tan, Weijun Wang et al. "Searching for mobilenetv3." In Proceedings of the IEEE/CVF International Conference on Computer Vision, pp. 1314-1324. 2019.
Ma, Ningning, Xiangyu Zhang, Hai-Tao Zheng, and Jian Sun. "Shufflenet v2: Practical guidelines for efficient cnn architecture design." In Proceedings of the European conference on computer vision (ECCV), pp. 116-131. 2018.
Yu, Changqian, Bin Xiao, Changxin Gao, Lu Yuan, Lei Zhang, Nong Sang, and Jingdong Wang. "Lite-hrnet: A lightweight high-resolution network." In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 10440-10450. 2021.
Han, Kai, Yunhe Wang, Qi Tian, Jianyuan Guo, Chunjing Xu, and Chang Xu. "Ghostnet: More features from cheap operations." In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 1580-1589. 2020.
Model | Backbone | Resolution | Training Iters | mIoU | mIoU (flip) | mIoU (ms+flip) | Links |
---|---|---|---|---|---|---|---|
MobileSeg | MobileNetV2 | 1024x512 | 80000 | 73.94% | 74.32% | 75.33% | model | log | vdl |
MobileSeg | MobileNetV3_large_x1_0 | 1024x512 | 80000 | 73.47% | 73.72% | 74.70% | model | log | vdl |
MobileSeg | Lite_HRNet_18 | 1024x512 | 80000 | 70.75% | 71.62% | 72.40% | model | log | vdl |
MobileSeg | ShuffleNetV2_x1_0 | 1024x512 | 80000 | 69.46% | 70.00% | 70.90% | model | log | vdl |
MobileSeg | GhostNet_x1_0 | 1024x512 | 80000 | 71.88% | 72.22% | 73.11% | model | log | vdl |
Model | Backbone | V100 TRT Inference Speed(FPS) | Snapdragon 855 Inference Speed(FPS) |
---|---|---|---|
MobileSeg | MobileNetV2 | 67.57 | 27.01 |
MobileSeg | MobileNetV3_large_x1_0 | 67.39 | 32.90 |
MobileSeg | Lite_HRNet_18 | 10.5 | 13.05 |
MobileSeg | ShuffleNetV2_x1_0 | 37.09 | 39.61 |
MobileSeg | GhostNet_x1_0 | 35.58 | 38.74 |
测试条件:
- 在Nvidia GPU V100上测试模型速度:使用PaddleInference Python接口,开启TensorRT,推理数据类型是FP32,输入图像的维度是1x3x1024x2048。
- 在小米9(骁龙855 CPU)上测试模型速度:使用PaddleLite C++接口,单线程,输入图像的维度是1x3x256x256。