[CPU][ARM] Limit cases when ACL int8 convolution executor is chosen #33040

alvoron · 2025-11-26T13:44:03Z

Details:

Do not fuse per-channel DQ scales to Convolution
Do not select int8 executor if output precision is quantized and FQ is not fused (i.e. requantize scale can not be applied)

Tickets:

ticket-id

v-Golubev

Could you please extend the existing tests by adding a test case with per channel dequantization?

src/plugins/intel_cpu/src/graph_optimizer.cpp

src/plugins/intel_cpu/src/nodes/executors/acl/acl_conv.cpp

alvoron · 2025-11-27T09:27:20Z

Could you please extend the existing tests by adding a test case with per channel dequantization?

Added per-channel test case.

src/plugins/intel_cpu/src/nodes/executors/acl/acl_conv.cpp

src/plugins/intel_cpu/tests/functional/custom/subgraph_tests/src/arm/conv_fq.cpp

v-Golubev

LGTM

v-Golubev · 2025-11-27T17:03:22Z

src/plugins/intel_cpu/tests/functional/custom/subgraph_tests/src/arm/conv_fq.cpp

+    ov::element::Type expectedPrecision = element::f32;
+#if defined(OPENVINO_ARCH_ARM64)
+    const auto& [inputShape, inputPrecision, quantizeIntervals, fqConstShapes, targetName] = this->GetParam();
+    if (fqConstShapes.empty()) {
+        expectedPrecision = quantizeIntervals[0][0] < 0.f ? element::i8 : element::u8;
+    }
+#endif


Could you please add a short comment on why do we expect these precisions? The main thing I'd expect to see here is the mention that we don't support per channel dequantization for quantized convolution, so in this case we fallback on f32 implementation

init

4e58c7a

alvoron added the platform: arm OpenVINO on ARM / ARM64 label Nov 26, 2025

github-actions bot added the category: CPU OpenVINO CPU plugin label Nov 26, 2025

alvoron assigned v-Golubev Nov 26, 2025

alvoron marked this pull request as ready for review November 26, 2025 14:02

alvoron requested review from a team as code owners November 26, 2025 14:02

v-Golubev reviewed Nov 26, 2025

View reviewed changes

src/plugins/intel_cpu/src/graph_optimizer.cpp Outdated Show resolved Hide resolved

src/plugins/intel_cpu/src/nodes/executors/acl/acl_conv.cpp Outdated Show resolved Hide resolved

address comments

06a154d

v-Golubev reviewed Nov 27, 2025

View reviewed changes

src/plugins/intel_cpu/src/nodes/executors/acl/acl_conv.cpp Outdated Show resolved Hide resolved

src/plugins/intel_cpu/tests/functional/custom/subgraph_tests/src/arm/conv_fq.cpp Show resolved Hide resolved

alvoron added 4 commits November 27, 2025 13:34

fixed comments

feabefb

fix clang-tidy and arm32 tests

3c43a84

fix clang-tidy

3ed4e83

fix clang

ba07c2d

v-Golubev approved these changes Nov 27, 2025

View reviewed changes

v-Golubev assigned EgorDuplensky and unassigned v-Golubev Nov 28, 2025

added comment

08ddf4e

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[CPU][ARM] Limit cases when ACL int8 convolution executor is chosen #33040

[CPU][ARM] Limit cases when ACL int8 convolution executor is chosen #33040

alvoron commented Nov 26, 2025

Uh oh!

v-Golubev left a comment

Uh oh!

Uh oh!

Uh oh!

alvoron commented Nov 27, 2025

Uh oh!

Uh oh!

Uh oh!

v-Golubev left a comment

Uh oh!

v-Golubev Nov 27, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

[CPU][ARM] Limit cases when ACL int8 convolution executor is chosen #33040

Are you sure you want to change the base?

[CPU][ARM] Limit cases when ACL int8 convolution executor is chosen #33040

Conversation

alvoron commented Nov 26, 2025

Details:

Tickets:

Uh oh!

v-Golubev left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

alvoron commented Nov 27, 2025

Uh oh!

Uh oh!

Uh oh!

v-Golubev left a comment

Choose a reason for hiding this comment

Uh oh!

v-Golubev Nov 27, 2025

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants