### Bug description In short, DDP/HSDP and FSDP grad norms have a scaling difference of a factor of 8 for the current version of torchtitan. <img width="512" height="205" alt="Image" src="https://github.com/user-attachments/assets/f1aa6829-2458-47f2-95b2-8230bd0833e0" /> ### Versions latest torchtitan