[feat] Added vision input example by JoeLiu996 · Pull Request #19 · hpcaitech/HPC-AI-SDK

JoeLiu996 · 2026-01-22T09:56:26Z

Adopted VLM-classifier recipe as cookbook vision input example
Currently support Qwen3-VL-8b-Instruct for training, will add more MoE multi-modal models in the future after supporting MoE training

Training Qwen3-VL-8b-Instruct on caltech101 dataset:
Negative Log Likelihood Loss:

support vision input for Qwen3-VL

6a96993

JoeLiu996 requested review from CZYCW and TongLi3701 January 22, 2026 09:57

JoeLiu996 added 3 commits January 27, 2026 07:38

fix bugs

852800b

update gitignore

dd13525

Added image_chunk_param

5aea5c5

Provide feedback