loss NAN when load original zimage weights

While reproducing TwinFlow training on Z-Image, we found that when the transformer loads the [original weights](https://huggingface.co/Tongyi-MAI/Z-Image-Turbo/tree/main) , **the loss becomes NaN**. After debugging, we traced the issue to the forward function in `transformer_z_image.py` where the NaNs are introduced by the variables at lines 95–96:`   t_emb = t_emb + \
                        t_emb_2 * delta_t_abs.unsqueeze(1) `

<img width="2104" height="920" alt="Image" src="https://github.com/user-attachments/assets/3784f790-c926-48ad-beb9-bf4fda98e03e" />

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

loss NAN when load original zimage weights #21

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

loss NAN when load original zimage weights #21

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions