Logging gradients norms before clipping #1026

AakashKumarNain · 2024-07-25T16:53:56Z

AakashKumarNain
Jul 25, 2024

Is there an easy way to log the norm of the gradients before clip_by_global_norm(...) is applied. Here is how the optimizer is defined:

grad_accum_steps = 4
optim = optax.chain(optax.adamw(1e-5, mask=mask), optax.clip_by_global_norm(1.0))
optim = optax.MultiSteps(optim, every_k_schedule=grad_accum_steps)

Answered by vroulet

Jul 25, 2024

You may simply create a custom GradientTransform that does not touch the update, just computes the norm, put it in the state (or even print it if you want). Then you do the usual chain except that you insert that custom transform just before the clipping.

You may then fetch the gradient norm from the overall state using optax.tree_utils.tree_get

Something along the following lines
´´´
class RecordNormState(typing.NamedTuple):
grad_norm: jax.Array

def record_norm():
def init_fn(params)
return RecordNormState(grad_norm=jax.as_array(0))

def update_fn(updates, state, params=None):
return updates, RecordNormState(grad_norm=optax.tree_utils.tree_l2_norm(updates))

return optax.GradientTransforma…

View full answer

vroulet · 2024-07-25T17:26:49Z

vroulet
Jul 25, 2024
Maintainer

You may simply create a custom GradientTransform that does not touch the update, just computes the norm, put it in the state (or even print it if you want). Then you do the usual chain except that you insert that custom transform just before the clipping.

You may then fetch the gradient norm from the overall state using optax.tree_utils.tree_get

Something along the following lines
´´´
class RecordNormState(typing.NamedTuple):
grad_norm: jax.Array

def record_norm():
def init_fn(params)
return RecordNormState(grad_norm=jax.as_array(0))

def update_fn(updates, state, params=None):
return updates, RecordNormState(grad_norm=optax.tree_utils.tree_l2_norm(updates))

return optax.GradientTransformation(init_fn, update_fn)
´´´

1 reply

AakashKumarNain Jul 25, 2024
Author

Extremely cool! Thanks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Logging gradients norms before clipping #1026

{{title}}

Replies: 1 comment 1 reply

{{title}}

{{editor}}'s edit

{{editor}}'s edit

{{title}}

Select a reply

Logging gradients norms before clipping #1026

AakashKumarNain Jul 25, 2024

Replies: 1 comment · 1 reply

vroulet Jul 25, 2024 Maintainer

AakashKumarNain Jul 25, 2024 Author

AakashKumarNain
Jul 25, 2024

Replies: 1 comment 1 reply

vroulet
Jul 25, 2024
Maintainer

AakashKumarNain Jul 25, 2024
Author