kv and output scale loading bug -- FIX by amirumoAMD · Pull Request #146 · ROCm/ATOM

amirumoAMD · 2026-01-16T21:31:04Z

Motivation

Aim was to create a proper solution that didn't just skip over the parameter for kv_scale or output_scale in LLFP4 or LLFP8,
and loaded each parameter properly.

Technical Details

Small changes to attention_mha to have k_scale and v_scale load as parameters not just NoneType, functions to remap the parameter names for the inputted tensors, and handling weight loading.

Test Plan

Trace looks normal, and lm_eval results match vllm lm_eval results with the same command used.

Test Result

Testing passed.

Submission Checklist

Look over the contributing guidelines at https://github.com/ROCm/ROCm/blob/develop/CONTRIBUTING.md#pull-requests.

atom/model_ops/attention_mha.py

ChuanLi1101

Mostly good with minor suggestions.

atom/model_ops/attention_mha.py

…found

amirumoAMD and others added 7 commits January 16, 2026 20:10

fix for row parallel linear parameters

e7a0bb8

minor fix

b75302b

changes for llfp8 handling

0b4a890

quick test to see if this was the CI testing issue

5db6edc

cleanup and handling for static scales outside of fp8

6bfc5a2

Merge branch 'main' into amemoore/llama/kv-scale-load-fix

80ea3e7

cleanup

4771aa9

valarLip reviewed Jan 20, 2026

View reviewed changes

atom/model_ops/attention_mha.py Show resolved Hide resolved

ChuanLi1101 reviewed Jan 20, 2026

View reviewed changes

atom/model_ops/attention_mha.py Show resolved Hide resolved

atom/model_ops/attention_mha.py Show resolved Hide resolved

amirumoAMD and others added 2 commits January 20, 2026 17:59

clarifying comment. also no k_scale or v_scale is/is not None checks …

eb715ae

…found

Merge branch 'main' into amemoore/llama/kv-scale-load-fix

d83a311

amirumoAMD requested a review from ChuanLi1101 January 20, 2026 19:54

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

kv and output scale loading bug -- FIX#146

kv and output scale loading bug -- FIX#146
amirumoAMD wants to merge 9 commits intomainfrom
amemoore/llama/kv-scale-load-fix

amirumoAMD commented Jan 16, 2026 •

edited

Loading

Uh oh!

Uh oh!

ChuanLi1101 left a comment

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

amirumoAMD commented Jan 16, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Motivation

Technical Details

Test Plan

Test Result

Submission Checklist

Uh oh!

Uh oh!

ChuanLi1101 left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

amirumoAMD commented Jan 16, 2026 •

edited

Loading