More CRC-32 and Adler-32 updates #353

ebiggers · 2024-03-10T23:59:07Z

test_checksums: increase number of long inputs tested
lib/{adler32,crc32}: misc cleanups
lib/x86/crc32: more optimizations

Various cleanups, including tweaks to make the Adler-32 code more consistent with the CRC-32 code and vice versa. No behavior changes.

- As was recently done in the Adler-32 code, take advantage of the fact that on recent x86 processors, vmovdqu with an aligned pointer is just as fast as vmovdqa. Don't waste time aligning the pointer unless the length is very large, and at the same time, handle all cases of len >= 8*VL using the main loop so that the 4*VL wide loop isn't needed. (Before, aligning the pointer was tied to whether the main loop was used or not, since the main loop used vmovdqa.) - Handle short lengths more efficiently. Instead of falling back to crc32_slice1() for all len < VL, use AVX-512 masking (when available) to handle 4 <= len <= 15, and use 128-bit vector instructions to handle 16 <= len < VL. - Document why the main loop uses a width of 8*VL instead of 4*VL.

test_checksums: increase number of long inputs tested

511893f

ebiggers force-pushed the dev branch from 0c7911b to 5ea3e43 Compare March 11, 2024 07:17

ebiggers added 2 commits March 12, 2024 00:31

lib/{adler32,crc32}: misc cleanups

8ae3a19

Various cleanups, including tweaks to make the Adler-32 code more consistent with the CRC-32 code and vice versa. No behavior changes.

ebiggers force-pushed the dev branch from 5ea3e43 to 5d15bce Compare March 12, 2024 07:40

ebiggers merged commit 5d15bce into master Mar 16, 2024
52 checks passed

ebiggers deleted the dev branch March 16, 2024 18:19

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

More CRC-32 and Adler-32 updates #353

More CRC-32 and Adler-32 updates #353

ebiggers commented Mar 10, 2024 •

edited

Loading

More CRC-32 and Adler-32 updates #353

More CRC-32 and Adler-32 updates #353

Conversation

ebiggers commented Mar 10, 2024 • edited Loading

ebiggers commented Mar 10, 2024 •

edited

Loading