About the learning rate setting of p_conv and m_conv #7

dontLoveBugs · 2019-02-21T04:36:51Z

You set the gradient of p_conv and m_conv to 0.1 times the other layers, but I find the gradient has no change after backward.
I use the following code to test.

    def _set_lr(module, grad_input, grad_output):
        print('grad input:', grad_input)
        print('grad output:', grad_output)
        grad_input = (grad_input[i] * 0.1 for i in range(len(grad_input)))
        grad_output = (grad_output[i] * 0.1 for i in range(len(grad_output)))

    x = torch.randn(4, 3, 5, 5)
    y_ = torch.randn(4, 1, 5, 5)
    loss = nn.L1Loss()

    d_conv = DeformConv2d(inc=3, outc=1, modulation=True)

    y = d_conv.forward(x)
    l = loss(y, y_)
    l.backward()

    print('p conv grad:')
    print(d_conv.p_conv.weight.grad)
    print('m conv grad:')
    print(d_conv.m_conv.weight.grad)
    print('conv grad:')
    print(d_conv.conv.weight.grad)

The gradient of p_conv is same with the grad_input, but I think the gradient of p_conv is 0.1 times the gradient of the grad_input. Am I wrong?

The text was updated successfully, but these errors were encountered:

4uiiurz1 · 2019-04-19T09:11:48Z

You're right!
I'll fix it.

BananaLv26 · 2019-07-24T11:19:01Z

You're right!
I'll fix it.

Have you solved this problem now?

jszgz · 2020-05-28T13:26:29Z

@dontLoveBugs Hello, can you review my issue ? I think the bilinear kernel is wrong

zcong17huang · 2020-09-22T10:29:48Z

You're right!
I'll fix it.

'tuple' object can not be modified. Your code just get an generator.

XinZhangRadar · 2020-12-10T14:28:56Z

I have searched online, the grad of output can not be modified, if you want modify the grad of input, you need to return the modified grad of input , like :
def _set_lr(module, grad_input, grad_output):
return (grad_input[i] * 0.1 for i in range(len(grad_input)))

you can try it. My question is ：
Why change the p_conv gradients, Is it to avoid affecting the learning of another feature extraction branch?

steven22tom · 2020-12-18T07:34:32Z

@XinZhangNLPR the you is becuse the backforward_hook expected tuple, not 'generator'

I have searched online, the grad of output can not be modified, if you want modify the grad of input, you need to return the modified grad of input , like :
def _set_lr(module, grad_input, grad_output):
return (grad_input[i] * 0.1 for i in range(len(grad_input)))

you can try it. My question is ：
Why change the p_conv gradients, Is it to avoid affecting the learning of another feature extraction branch?

Your suggestion still return a generator not a tuple

YXB-NKU · 2023-10-03T03:29:59Z

You're right! I'll fix it.

it seems this bug has not fixed yet

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

About the learning rate setting of p_conv and m_conv #7

About the learning rate setting of p_conv and m_conv #7

dontLoveBugs commented Feb 21, 2019 •

edited

Loading

4uiiurz1 commented Apr 19, 2019

BananaLv26 commented Jul 24, 2019

jszgz commented May 28, 2020

zcong17huang commented Sep 22, 2020

XinZhangRadar commented Dec 10, 2020

steven22tom commented Dec 18, 2020

YXB-NKU commented Oct 3, 2023

About the learning rate setting of p_conv and m_conv #7

About the learning rate setting of p_conv and m_conv #7

Comments

dontLoveBugs commented Feb 21, 2019 • edited Loading

4uiiurz1 commented Apr 19, 2019

BananaLv26 commented Jul 24, 2019

jszgz commented May 28, 2020

zcong17huang commented Sep 22, 2020

XinZhangRadar commented Dec 10, 2020

steven22tom commented Dec 18, 2020

YXB-NKU commented Oct 3, 2023

dontLoveBugs commented Feb 21, 2019 •

edited

Loading