SNIP should calculate gradients on indicators/mask, not on weights #2

iamanigeeit · 2022-03-15T18:42:21Z

Hello,

Thanks for implementing in PyTorch, but i believe there is a something wrong in the code.

In the original paper and implementation, loss is differentiated against the connection indicators and not the weights.

From Lee's original code in line 67:
grads = tf.gradients(loss, [mask_init[k] for k in prn_keys])

I understand you have weight = indicator * weight before computing gradients, but i can't see where you extract the gradients for the indicators only. I see you've posted on the pytorch forum about this but nobody has answered properly.

The text was updated successfully, but these errors were encountered:

iamanigeeit changed the title ~~SNIP differentiates on indicators/mask, not on weights~~ SNIP should differentiate on indicators/mask, not on weights Mar 15, 2022

iamanigeeit changed the title ~~SNIP should differentiate on indicators/mask, not on weights~~ SNIP should calculate gradients on indicators/mask, not on weights Mar 15, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

SNIP should calculate gradients on indicators/mask, not on weights #2

SNIP should calculate gradients on indicators/mask, not on weights #2

iamanigeeit commented Mar 15, 2022 •

edited

Loading

SNIP should calculate gradients on indicators/mask, not on weights #2

SNIP should calculate gradients on indicators/mask, not on weights #2

Comments

iamanigeeit commented Mar 15, 2022 • edited Loading

iamanigeeit commented Mar 15, 2022 •

edited

Loading