SpQR Quantization? ... for Near-Lossless LLM Weight Compression #2061
ianscrivener
started this conversation in
Ideas
Replies: 1 comment 3 replies
-
#1602 (comment) |
Beta Was this translation helpful? Give feedback.
3 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
almost sounds too good to be true... but this technique even makes sense to the layman
SpQR: A Sparse-Quantized Representation for Near-Lossless LLM Weight Compression
https://arxiv.org/abs/2306.03078
https://github.com/Vahe1994/SpQR
via: https://www.superdatascience.com/podcast/near-lossless-llm-quantization
Beta Was this translation helpful? Give feedback.
All reactions