gradient checks do not match #13

vuptran · 2017-07-08T10:01:37Z

I checked the gradients you derived against the numerical gradients, and your implementation does not match. It looks like the error is in two places:

In calculate_loss, you average the total loss (including the regularization term) over the data batch. The correct implementation should average only the log loss, but not the regularization term.
In build_model, the gradients (dW1, dW2, db1, db2) during backprop should be averaged over the data batch. Again, the correct implementation should not include the regularization terms in the average over the data batch.

The text was updated successfully, but these errors were encountered:

uripeled2 · 2020-05-11T09:43:35Z

Do you have or know a better implementation?
Can you explain or show to me how you checked it?

vuptran · 2020-05-11T15:55:38Z

@uripeled2 I have a method for gradient checking in my implementation here: https://github.com/vuptran/introduction-to-neural-networks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

gradient checks do not match #13

gradient checks do not match #13

vuptran commented Jul 8, 2017

uripeled2 commented May 11, 2020

vuptran commented May 11, 2020

gradient checks do not match #13

gradient checks do not match #13

Comments

vuptran commented Jul 8, 2017

uripeled2 commented May 11, 2020

vuptran commented May 11, 2020