refactor hardshrink_opinfo with singularity_fn_producer #1517

beverlylytle · 2024-12-05T09:32:46Z

Many linear activation functions of torch.nn.functional have partial derivatives with jump discontinuities at dynamically defined values, eg, hardshrink has a kwarg lambd which sets the relevant discontinuities at +/-lambd. The test test_vjp_correctness relies on using the technique of computing finite differences to approximate these partial derivatives to validate Thunder's computation of the partials. These finite differences behave badly around these discontinuities. Currently, each OpInfo allows the supplement of a singularity_fn to push test input values away from the discontinuities, but it only allows for a single singularity_fn, which cannot reflect the dynamic variation of the "bad" points. This PR introduces a singularity_fn_producer, which is a function mapping a SampleInput to a singularity_fn, allowing the singularity_fn to reflect the kwargs of the SampleInput.

mruberry · 2024-12-05T19:08:25Z

The test failure is unrelated to this PR, fyi @t-vi. It is

FAILED thunder/tests/test_grad.py::test_vjp_correctness_celu_torch_cpu_thunder.dtypes.float64 - AssertionError: Scalars are not close!

Expected 12.54497983051468 but got 12.544988595853251.
Absolute difference: 8.765338570526637e-06 (up to 1e-07 allowed)
Relative difference: 6.98712846807903e-07 (up to 1e-07 allowed)

which is tracked by #1514

mruberry

Cool!

This is a good generalization of singularity functions. It's nice that it doesn't require rewriting existing OpInfos that use singularity functions. In the future we may want to remove the singularity_fn option for OpInfos, since it's now just a sugar for specifying a singularity_fn_producer that ignores its inputs

refactor hardshrink_opinfo with singularity_fn_producer

a7f30a1

beverlylytle requested review from mruberry, lantiga and t-vi as code owners December 5, 2024 09:32

fix

0cc2cf4

mruberry approved these changes Dec 5, 2024

View reviewed changes

mruberry enabled auto-merge (squash) December 5, 2024 19:10

Merge branch 'main' into singularity_prod

c3965b8

mruberry merged commit 4410127 into main Dec 6, 2024
41 checks passed

mruberry deleted the singularity_prod branch December 6, 2024 09:00

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

refactor hardshrink_opinfo with singularity_fn_producer #1517

refactor hardshrink_opinfo with singularity_fn_producer #1517

beverlylytle commented Dec 5, 2024

mruberry commented Dec 5, 2024

mruberry left a comment

refactor hardshrink_opinfo with singularity_fn_producer #1517

refactor hardshrink_opinfo with singularity_fn_producer #1517

Conversation

beverlylytle commented Dec 5, 2024

mruberry commented Dec 5, 2024

mruberry left a comment

Choose a reason for hiding this comment