Feat：Add SVE (Scalable Vector Extension) support for SimdUtil-inl.h #59

KAlbert2333 · 2025-12-25T11:07:08Z

What problem does this PR solve?

Issue Number: close #54

Type of Change

🐛 Bug fix (non-breaking change which fixes an issue)
✨ New feature (non-breaking change which adds functionality)
🚀 Performance improvement (optimization)
⚠️ Breaking change (fix or feature that would cause existing functionality to change)
🔨 Refactoring (no logic changes)
🔧 Build/CI or Infrastructure changes
📝 Documentation only

Description

Constructed the xsimd extension of SVE and implemented SVE optimization for some Simd functions.

Performance Impact

No Impact: This change does not affect the critical path (e.g., build system, doc, error handling).
Positive Impact: I have run benchmarks.
Click to view Benchmark Results
```
TPCDS99 1T results.
Before: 1565s
After: 1530s  (+2%)
```
Negative Impact: Explained below (e.g., trade-off for correctness).

Release Note

Please describe the changes in this PR

Release Note:

Release Note:
- Constructed an xsimd extension for SVE and provided vectorized implementations of functions such as BitMask, Gather, MaskGather, Pack32, Permute, Filter, etc

Checklist (For Author)

I have added/updated unit tests (ctest).
I have verified the code with local build (Release/Debug).
I have run clang-format / linters.
(Optional) I have run Sanitizers (ASAN/TSAN) locally for complex C++ changes.
No need to test or manual test.

Breaking Changes

No

Yes (Description: ...)

Click to view Breaking Changes

Breaking Changes:
- Description of the breaking change.
- Possible solutions or workarounds.
- Any other relevant information.

Added SVE support for various SIMD operations and updated function signatures to use int64_t instead of int32_t for better compatibility with larger data sizes.

CLAassistant · 2025-12-25T11:08:47Z

All committers have signed the CLA.

fzhedu · 2026-01-04T06:18:16Z

can you add some tests about the simd code? besides, it is better to report the performance gain by comparing to the scalar code

bolt/common/base/SimdUtil.h

kexianda · 2026-01-04T07:09:51Z

conanfile.py

-            # Support CRC & NEON on ARMv8
-            flags = f"{self.BOLT_GLOABL_FLAGS} -march=armv8.3-a"
+            # Support CRC & NEON & SVE on ARMv8
+            flags = f"{self.BOLT_GLOABL_FLAGS} -march=armv8.3-a+sve -msve-vector-bits=256 -DSVE_BITS=256"


It won't work on the old ARM platform which only support Neon?
Is any compiler flag to detect this? The specified preprocessor macro SVE_BITS does not work on all the hardware platforms

It won't work on the old ARM platform which only support Neon? Is any compiler flag to detect this? The specified preprocessor macro SVE_BITS does not work on all the hardware platforms

Added 'lscpu' to determine if the current CPU supports the 'sve' instruction set

KAlbert2333 · 2026-01-15T01:50:58Z

can you add some tests about the simd code? besides, it is better to report the performance gain by comparing to the scalar code

We will provide reports on scalar and vector differences soon

KAlbert2333 added 6 commits December 25, 2025 10:56

SIMD operations with SVE support

0702708

Added SVE support for various SIMD operations and updated function signatures to use int64_t instead of int32_t for better compatibility with larger data sizes.

Add Batch128 struct with various utility functions

fce76e5

Update ColumnVisitors.h

815f2db

Update SimdUtil-inl.h

9c0942d

Update ColumnVisitors.h

44ac41e

Update conanfile.py

d3575c3

KAlbert2333 added 3 commits December 25, 2025 19:09

Update SimdUtil-inl.h

3f6a298

Merge branch 'main' into main

2e54c04

Merge branch 'main' into main

6d1597d

fzhedu self-requested a review January 4, 2026 06:18

kexianda reviewed Jan 4, 2026

View reviewed changes

yangzhg added enhancement New feature or request performance performance improvement needed labels Jan 9, 2026

KAlbert2333 added 4 commits January 9, 2026 11:11

Add GitHub Actions workflow for clang-format2 checks

c9f66d3

Delete .github/workflows/main.yml

3e06462

Merge branch 'bytedance:main' into main

f93b751

Update conanfile.py

cbd6cf3

This comment was marked as resolved.

Sign in to view

Update SimdUtil.h

3e27151

Update conanfile.py

ea47c59

KAlbert2333 mentioned this pull request Jan 15, 2026

Feat:Optimize and iterate the unpacking process #131

Open

17 tasks

Merge branch 'main' into main

a14f884

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Feat：Add SVE (Scalable Vector Extension) support for SimdUtil-inl.h #59

Feat：Add SVE (Scalable Vector Extension) support for SimdUtil-inl.h #59

KAlbert2333 commented Dec 25, 2025

Uh oh!

CLAassistant commented Dec 25, 2025 •

edited

Loading

Uh oh!

fzhedu commented Jan 4, 2026

Uh oh!

Uh oh!

Uh oh!

kexianda Jan 4, 2026

Uh oh!

KAlbert2333 Jan 15, 2026

Uh oh!

This comment was marked as resolved.

Uh oh!

KAlbert2333 commented Jan 15, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants

Feat：Add SVE (Scalable Vector Extension) support for SimdUtil-inl.h #59

Are you sure you want to change the base?

Feat：Add SVE (Scalable Vector Extension) support for SimdUtil-inl.h #59

Conversation

KAlbert2333 commented Dec 25, 2025

What problem does this PR solve?

Type of Change

Description

Performance Impact

Release Note

Checklist (For Author)

Breaking Changes

Uh oh!

CLAassistant commented Dec 25, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

fzhedu commented Jan 4, 2026

Uh oh!

Uh oh!

Uh oh!

kexianda Jan 4, 2026

Choose a reason for hiding this comment

Uh oh!

KAlbert2333 Jan 15, 2026

Choose a reason for hiding this comment

Uh oh!

This comment was marked as resolved.

Uh oh!

KAlbert2333 commented Jan 15, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants

CLAassistant commented Dec 25, 2025 •

edited

Loading