Checksums in AVX-512, AVX2, NEON #199

ashvardanian · 2024-12-01T08:32:59Z

🆕 sz_checksum(char const *, size_t) C 99 interface
🆕 sz::str().checksum() C++ 11 interface
🆕 sz.checksum(str) Python interface

Database and other Systems Engineers, you can now use StringZilla to dynamically dispatch different check-sum kernels for AVX2 capable Haswell+ CPUs, AVX-512BW capable Ice Lake+ CPUs, and Arm NEON CPUs on mobile. In AVX-512, masked loads are used extensively, resulting in a 10% improvement even on typical English words, averaging 5 bytes in length and 20x performance improvement compared to the serial code for longer strings.

On the technical side, on x86, the kernels use the well-known SAD(text, zeros) idiom to accumulate absolute differences between individual bytes into 64-bit words. It also uses bidirectional traversal to saturate the core, capable of performing 2 loads per CPU cycle. Moreover, on large inputs, it switches to streaming loads, separately handling the head and the tail, similar to our memcpy alternative, also outperforming LibC on AVX-512-capable machines 😎

AVX-512, AVX2, NEON

ashvardanian added 2 commits December 1, 2024 08:18

Add: Checksum kernels

a99337b

AVX-512, AVX2, NEON

Add: Checksum tests

c2b997c

ashvardanian force-pushed the main-dev branch from f6c29da to e0a9e4e Compare December 1, 2024 09:25

Fix: Missing _mm_cvtsi128_si64x in Clang

c8c6c7c

ashvardanian force-pushed the main-dev branch from e0a9e4e to c8c6c7c Compare December 1, 2024 09:36

ashvardanian added 3 commits December 1, 2024 09:50

Fix: sz_checksum visibility

9bec0eb

Add: Checksums in Python

1b77de9

Docs: Simpler Python doc-strings

ad5fa2c

ashvardanian merged commit d528548 into main Dec 1, 2024
40 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Checksums in AVX-512, AVX2, NEON #199

Checksums in AVX-512, AVX2, NEON #199

ashvardanian commented Dec 1, 2024 •

edited

Loading

Checksums in AVX-512, AVX2, NEON #199

Checksums in AVX-512, AVX2, NEON #199

Conversation

ashvardanian commented Dec 1, 2024 • edited Loading

ashvardanian commented Dec 1, 2024 •

edited

Loading