Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

RKYolo 预测推理耗时不断增加 #2596

Open
erroot opened this issue Mar 10, 2025 · 0 comments
Open

RKYolo 预测推理耗时不断增加 #2596

erroot opened this issue Mar 10, 2025 · 0 comments
Assignees

Comments

@erroot
Copy link

erroot commented Mar 10, 2025


温馨提示:根据社区不完全统计,按照模板提问,可以加快回复和解决问题的速度


环境

  • 【FastDeploy版本】: 说明具体的版本,如fastdeploy-linux-rknn2-0.0.3

  • 【编译命令】如果您是自行编译的FastDeploy,cmake .. -DENABLE_ORT_BACKEND=ON
    -DENABLE_RKNPU2_BACKEND=ON -DENABLE_VISION=ON -DRKNN2_TARGET_SOC=RK3588 -DCMAKE_INSTALL_PREFIX=${PWD}/fastdeploy-0.0.3

  • 【系统平台】: Linux x64(Ubuntu 18.04)

  • 【硬件】: RK3588

  • 【编译语言】: C++

  • 【性能问题】描述清楚对比的方式
    /opt/pp/FastDeploy/fastdeploy/vision/detection/contrib/rknpu2/rkyolo.cc
    同一个(fastdeploy::vision::detection::RKYOLOV5对象实例 RKYOLO 多次推理耗时不断增加从100多毫秒不断增加4秒多离了大普
    调试打印发现:
    void RKYOLOPreprocessor::LetterBox(FDMat* mat) {
    ...
    scale_.push_back(scale);
    pad_hw_values_.push_back({pad_h, pad_w});
    }
    两个向量scale_,pad_hw_values_ 没有清理的地方,向量大小会不断增长,隐藏引起对象拷贝耗时不断增加

    postprocessor_.SetPadHWValues(preprocessor_.GetPadHWValues());
    postprocessor_.SetScale(preprocessor_.GetScale());

rkyolo.cc(109)::BatchPredict RKYOLO sn:11232,Infer:0.057892 s
rkyolo.cc(109)::BatchPredict RKYOLO sn:11232,Infer:0.075516 s
rkyolo.cc(109)::BatchPredict RKYOLO sn:11232,Infer:0.092575 s
rkyolo.cc(114)::BatchPredict RKYOLO sn:11232,GetPadHWValues time:0.067186 s,size:3760
rkyolo.cc(114)::BatchPredict RKYOLO sn:11232,GetPadHWValues time:0.065356 s,size:3727
rkyolo.cc(114)::BatchPredict RKYOLO sn:11232,GetPadHWValues time:0.067844 s,size:3750
rkyolo.cc(119)::BatchPredict RKYOLO sn:11232,SetPadHWValues: GetPadHWValues time:0.260485 s,size:3760
rkyolo.cc(125)::BatchPredict RKYOLO sn:11232,GetScale & SetScale time:0.000106 s,size:3760
rkyolo.cc(119)::BatchPredict RKYOLO sn:11232,SetPadHWValues: GetPadHWValues time:0.341411 s,size:3727
rkyolo.cc(136)::BatchPredict RKYOLO sn:11232,postprocessor_:0.028234 s
rkyolo.cc(139)::BatchPredict RKYOLO sn:11232,all:0.665357 s
rkyolo.cc(119)::BatchPredict RKYOLO sn:11232,SetPadHWValues: GetPadHWValues time:0.476271 s,size:3750
rkyolo.cc(125)::BatchPredict RKYOLO sn:11232,GetScale & SetScale time:0.000125 s, size:3727

两个操作耗时增加
deepseek 给出修改建议:
1 使用局部变量替代成员变量;
2. 清空向量
如果 pad_hw_values_ 和 scale_ 必须在类的成员变量中保留,可以在每次调用 Preprocess 或 LetterBox 之前清空它们,以避免它们不断增长。
3. 使用引用传递
在 SetPadHWValues 和 SetScale 中,可以使用引用传递来避免不必要的拷贝。

@erroot erroot changed the title RKYolo 预测推理时间不端增加 RKYolo 预测推理耗时不断增加 Mar 10, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants