Skip to content

What means the step in the update function? #5972

Answered by hawkinsp
jingtianweidizzf asked this question in Q&A
Discussion options

You must be logged in to vote

I'd suggest reading the code of the optimizers module: it's pretty readable on the whole.

Step numbers are used for at least two reasons:

  1. for learning rate schedules, where parameters like the learning rate are varied as training progresses. For example, if you look at the implementation of sgd (https://cs.opensource.google/jax/jax/+/master:jax/experimental/optimizers.py;drc=b260468b51efff40183796e98b844314b66f7686;l=246 ) it computes a step_size based on the current step number.
  2. some optimizers use the step number explicitly. For example, the ADAM optimizer uses it in its bias correction term (https://cs.opensource.google/jax/jax/+/master:jax/experimental/optimizers.py;drc=b260468b51eff…

Replies: 1 comment 1 reply

Comment options

You must be logged in to vote
1 reply
@jingtianweidizzf
Comment options

Answer selected by jingtianweidizzf
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
Q&A
Labels
None yet
2 participants