Skip to content

Commit

Permalink
update int_mm minimal pytorch version
Browse files Browse the repository at this point in the history
  • Loading branch information
faaany authored and dacorvo committed Nov 12, 2024
1 parent 002ece8 commit ad9d6f6
Showing 1 changed file with 1 addition and 1 deletion.
2 changes: 1 addition & 1 deletion optimum/quanto/library/qbytes_mm.py
Original file line number Diff line number Diff line change
Expand Up @@ -92,7 +92,7 @@ def qbytes_mm_impl_cuda(activations: torch.Tensor, weights: torch.Tensor, output
def qbytes_mm_impl_cpu(activations: torch.Tensor, weights: torch.Tensor, output_scales: torch.Tensor) -> torch.Tensor:
if (
# FIXME: accuracy issues with 2.4.x
version.parse(torch.__version__).release > version.parse("2.5.0").release
version.parse(torch.__version__).release >= version.parse("2.6.0").release
and activations.dtype == torch.int8
and weights.dtype == torch.int8
):
Expand Down

0 comments on commit ad9d6f6

Please sign in to comment.