- Add support for converting HuggingFace GPTQ-int4 models (requires groupsize to be 32, 64, or 128, and desc_act set to false).
- Add support for TeleChat/TeleChat2/MiniCPM-S models.
- Support exporting llm model in Qwen2VL
- Resolve issues with LoRA inference.
- Fix an import error related to IPython.