You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
It should be because we are also not enabling CUDAGraphs with Thunder too ("reduce-overhead" simply enables CUDAGraphs). So this way is more of an apples-to-apples comparison. Maybe @parthmannan can correct me here
Either way, from my personal experience in running this, cudagraphs won't help during training (especially if your model is large enough) because you are not overhead bound at all
Was curious to know why the
reduce-overhead
option was not used as a baseline and if there's a comparison somewhere usingreduce-overhead
?Thanks !
cc @apaz-cli
The text was updated successfully, but these errors were encountered: