Skip to content

v1.12.0: AutoGPTQ integration, extended BetterTransformer support

Compare
Choose a tag to compare
@fxmarty fxmarty released this 23 Aug 12:27
· 387 commits to main since this release

AutoGPTQ integration

Part of AutoGPTQ library has been integrated in Optimum, with utilities to ease the integration in other Hugging Face libraries. Reference: https://huggingface.co/docs/optimum/llm_quantization/usage_guides/quantization

Extended BetterTransformer support

BetterTransformer now supports BLOOM and GPT-BigCode architectures.

Other changes and bugfixes

New Contributors

Full Changelog: v1.11.2...v1.12.0