Skip to content

Commit

Permalink
Add default quantization int4 config for Mixtral-8x7B (#576)
Browse files Browse the repository at this point in the history
  • Loading branch information
ljaljushkin authored Feb 28, 2024
1 parent 2d14e25 commit dcf82af
Showing 1 changed file with 1 addition and 0 deletions.
1 change: 1 addition & 0 deletions optimum/intel/openvino/configuration.py
Original file line number Diff line number Diff line change
Expand Up @@ -96,6 +96,7 @@
"openlm-research/open_llama_3b": {"bits": 4, "sym": True, "group_size": 64, "all_layers": True},
"tiiuae/falcon-7b": {"bits": 4, "sym": True, "group_size": 64, "all_layers": True},
"psmathur/orca_mini_3b": {"bits": 4, "sym": True, "group_size": 64, "all_layers": True},
"mistralai/Mixtral-8x7B-v0.1": {"bits": 4, "sym": True, "group_size": 128, "ratio": 0.8},
}


Expand Down

0 comments on commit dcf82af

Please sign in to comment.