Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Use AVX-512 when available #1

Closed
wants to merge 1 commit into from
Closed

Use AVX-512 when available #1

wants to merge 1 commit into from

Conversation

Shnatsel
Copy link

@Shnatsel Shnatsel commented Nov 29, 2023

On Zen 4 this increases the improvement over the base64 crate from 3.5x to 4.5x

@Shnatsel
Copy link
Author

I haven't tested this (other than benchmarking) so it is entirely possible that the handling of the remainder needs to be changed too.

@Shnatsel
Copy link
Author

Actually, I've re-measured and it seems to be a measurement error on my part. Both the avx2 and avx512 modes produce the exact same performance.

It seems that the avx512 target feature is not yet stable, and is ignored: rust-lang/rust#44839
Or maybe it's just the Zen4 double-pumped implementation of AVX-512 not being any different than the AVX2. I'd have to look at the disassembly to confirm. I'll close this until then.

@Shnatsel Shnatsel closed this Nov 29, 2023
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant