fix: ensure LTXAVTEModel uses half-precision for SageAttention compat… #11898

HyperExtendedReality · 2026-01-15T19:44:19Z

Add automatic detection and default to bfloat16 (or fp16 fallback) when no explicit dtype is provided, based on device capabilities
Respect provided dtype_llama/dtype consistently across Gemma model, projection layer, and connectors
Remove forced out.float() in encode_token_weights to prevent downgrading to fp32 after projection
This allows SageAttention's optimized kernel to run instead of falling back to PyTorch attention

Fixes the warning:
"Error running sage attention: Input tensors must be in dtype of torch.float16 or torch.bfloat16, using pytorch attention instead."

…ibility - Add automatic detection and default to bfloat16 (or fp16 fallback) when no explicit dtype is provided, based on device capabilities - Respect provided dtype_llama/dtype consistently across Gemma model, projection layer, and connectors - Remove forced `out.float()` in encode_token_weights to prevent downgrading to fp32 after projection - This allows SageAttention's optimized kernel to run instead of falling back to PyTorch attention Fixes the warning: "Error running sage attention: Input tensors must be in dtype of torch.float16 or torch.bfloat16, using pytorch attention instead."

HyperExtendedReality requested review from Kosinkadink, comfyanonymous and guill as code owners January 15, 2026 19:44

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

fix: ensure LTXAVTEModel uses half-precision for SageAttention compat… #11898

fix: ensure LTXAVTEModel uses half-precision for SageAttention compat… #11898

HyperExtendedReality commented Jan 15, 2026 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

fix: ensure LTXAVTEModel uses half-precision for SageAttention compat… #11898

Are you sure you want to change the base?

fix: ensure LTXAVTEModel uses half-precision for SageAttention compat… #11898

Conversation

HyperExtendedReality commented Jan 15, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

HyperExtendedReality commented Jan 15, 2026 •

edited

Loading