[GPU] Fix oneDNN FP16 convolution format selection for channel expansion operations #33131

andrew-k-park · 2025-12-05T07:08:56Z

Details:

When FP16 dynamic convolution has small input channels (≤4) and large output channels (e.g., 1024), the current format selection logic chooses bfyx → fsv16, which triggers oneDNN reference kernel instead of optimized JIT kernel, resulting in significant performance degradation.
Override output format to planar (bfyx) when input channels are small (≤ 16), and output channels are large (≥ 32)

Current behavior:

Input: 3 channels → Converted to bfyx
Output: 1024 channels → Remains fsv16 (only changed when output ≤ 4)
Result: bfyx → fsv16 combination uses reference kernel (slow)

Root Cause

The fsv16 blocked format is optimized for reading many channels but introduces overhead when used for writing outputs in channel-expansion scenarios (small input → large output). oneDNN's reference kernel is selected because:

Inefficient write pattern: fsv16 output requires interleaved writes every 16 elements (non-contiguous)
No optimized implementation: oneDNN doesn't provide JIT-optimized kernel for fsv16 output generation from small input channels
Scatter write overhead: Writing 1024 channels in fsv16 format requires complex block-strided access

Tickets:

CVS-177671

…ge channel expansion Signed-off-by: Andrew Park <andrew.park@intel.com>

Optimize OneDNN dynamic convolution layout selection for small-to-lar…

83e0756

…ge channel expansion Signed-off-by: Andrew Park <andrew.park@intel.com>

andrew-k-park requested review from a team as code owners December 5, 2025 07:08

github-actions bot added the category: GPU OpenVINO GPU plugin label Dec 5, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[GPU] Fix oneDNN FP16 convolution format selection for channel expansion operations #33131

[GPU] Fix oneDNN FP16 convolution format selection for channel expansion operations #33131

andrew-k-park commented Dec 5, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

[GPU] Fix oneDNN FP16 convolution format selection for channel expansion operations #33131

Are you sure you want to change the base?

[GPU] Fix oneDNN FP16 convolution format selection for channel expansion operations #33131

Conversation

andrew-k-park commented Dec 5, 2025

Details:

Root Cause

Tickets:

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant