Feat/quantization paradox verifier by jay7-tech · Pull Request #308 · opencv/opencv_zoo

jay7-tech · 2026-03-23T15:20:58Z

Added a small verification check in the [benchmark.py]
to catch the "Quantization Paradox".

Sometimes INT8 models actually run slower than FP32 on certain ARM targets due to missing dot-product extensions or ORT threading overhead.

The loop now tracks the mean latency for fp32, fp16, and int8. If you pass --paradox_strict, the build will fail if the INT8 model regresses performance compared to FP32, preventing us from merging mathematically slower quantized models.

Note: I also patched a silent bug in Benchmark.run where _benchmark_results_brief kept accumulating across different models rather than resetting, which was mixing up the statss

…rch64 hardware to resolve quantization paradox

…nlocking TensorRT/NPU hardware acceleration

…ction

jay7-tech added 3 commits March 4, 2026 20:07

fix(benchmark): Auto-route INT8 precision to NPU/TIM-VX backend on aa…

d69cf6d

…rch64 hardware to resolve quantization paradox

fix(quantize): Overhaul INT8 static quantization to use QDQ format, u…

0b543a5

…nlocking TensorRT/NPU hardware acceleration

feat(benchmark): add Verification Layer for Quantization Paradox dete…

2cb6f03

…ction

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Feat/quantization paradox verifier#308

Feat/quantization paradox verifier#308
jay7-tech wants to merge 3 commits intoopencv:mainfrom
jay7-tech:feat/quantization-paradox-verifier

jay7-tech commented Mar 23, 2026 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

jay7-tech commented Mar 23, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

jay7-tech commented Mar 23, 2026 •

edited

Loading