Skip to content

[feat] Added vision input example #19

Open
JoeLiu996 wants to merge 4 commits intomainfrom
feat/vision_input
Open

[feat] Added vision input example #19
JoeLiu996 wants to merge 4 commits intomainfrom
feat/vision_input

Conversation

@JoeLiu996
Copy link
Collaborator

  • Adopted VLM-classifier recipe as cookbook vision input example
  • Currently support Qwen3-VL-8b-Instruct for training, will add more MoE multi-modal models in the future after supporting MoE training

Training Qwen3-VL-8b-Instruct on caltech101 dataset:
Negative Log Likelihood Loss:
chart

@JoeLiu996 JoeLiu996 requested review from CZYCW and TongLi3701 January 22, 2026 09:57
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant