[Discussion] the bit unpacking performance #11000
zombee0
started this conversation in
Show and tell
Replies: 2 comments 8 replies
-
I just rerun the benchmark and arrow is only a little bit faster (< 13%) in |
Beta Was this translation helpful? Give feedback.
6 replies
-
Another thing I observed is the datainput size is important, the arrow algorithm might be good for some case that input is long enough. Otherwise the bmi2 would be faster |
Beta Was this translation helpful? Give feedback.
2 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
Uh oh!
There was an error while loading. Please reload this page.
-
arrow-avx2.txt
arrow-none.txt
I like the job of #3000 #2353, but i can't reproduce your result,
there are two files, arrow_none.txt is the result of default compile configuration
while arrow_avx2.txt is the result of enable avx2 for arrow,
for result of uint16 and uint32, I found that arrow-avx2 performs better.
my test is done on Intel(R) Xeon(R) Platinum 8269CY CPU @ 2.50GHz
would you be willing to help confirm this? @yingsu00 @Yuhta
Beta Was this translation helpful? Give feedback.
All reactions