Lookaside for `torch.ops.higher_order.autograd_function_apply` #1256

crcrpar · 2024-10-03T13:34:04Z

What does this PR do?

As per #1248, the support of torch.ops.higher_order.autograd_function_apply would be a bit more flexible by tracing into both fwd and bwd.

cc @apaz-cli

thunder/core/jit_ext.py

thunder/torch/__init__.py

thunder/core/jit_ext.py

Signed-off-by: Masaki Kozuki <mkozuki@nvidia.com>

IvanYashchuk

I want the lookasides' scope to be limited only to the preprocessing of PyTorch code. If the removed code is reused in the updated lookaside we'll achieve that.

thunder/core/jit_ext.py

thunder/torch/__init__.py

thunder/core/jit_ext.py

thunder/torch/__init__.py

Signed-off-by: Masaki Kozuki <mkozuki@nvidia.com>

t-vi · 2024-11-18T09:51:22Z

thunder/tests/test_jit_general.py

@@ -1231,6 +1233,9 @@ def my_sin(x):
    torch.testing.assert_close(y, y_ref)

    initial_computation_trace = thunder.last_traces(jitted)[0]
+    bsym_str_ids = tuple(


initial_computation trace is not a valid Python function

import thunder.torch as ltorch import torch from thunder.executors.torchex import no_autocast @torch.no_grad() @no_autocast def computation(x): # x: "cpu f32[2, 2]" # /home/tv/data/firma/grid/thunder/lightning-thunder/thunder/tests/test_jit_general.py:1216: return grad_output * torch.cos(x) t6 = ltorch.autograd_function_apply(_function_0, _function_1, x, args_tensor_mask=[True], non_differentiable_idx=[]) # t6: "cpu f32[2, 2]" # t6 = ltorch.sin(x) # t6: "cpu f32[2, 2]" # t6 = prims.sin(x) # t6: "cpu f32[2, 2]" return t6 .

Is it a contract that any of the traces generated from a callable is executable?

What part of the provided code snippet makes it an invalid Python function?

Is it a contract that any of the traces generated from a callable is executable?

This has been the case up to now and it is what I have repeatedly said about why I want the things properly inlined.

Could you please elaborate on what part of the example trace makes it improperly executable?

A valid Python program from a trace is generated using its string representation (trace.python()) and its "context" (trace.python_ctx). The context is used to pass as the globals= argument to the built-in exec function (https://docs.python.org/3/library/functions.html#exec).

I brought the two

lightning-thunder/thunder/tests/test_jit_general.py

Line 1233 in 0205c73

initial_computation_trace = thunder.last_traces(jitted)[0]

with this PR and main branch.
The top is this PR, the bottom, main.

I'm not seeing the difference.

import thunder import thunder.torch as ltorch import torch from thunder.executors.torchex import no_autocast @torch.no_grad() @no_autocast def computation(x): # x: "cpu f32[2, 2]" # /home/mkozuki/ghq/github.com/crcrpar/lightning-thunder/thunder/tests/test_jit_general.py:1218: return grad_output * x.cos() t6 = ltorch.autograd_function_apply(_function_0, _function_1, x, args_tensor_mask=[True], non_differentiable_idx=[]) # t6: "cpu f32[2, 2]" # t6 = ltorch.sin(x) # t6: "cpu f32[2, 2]" # t6 = prims.sin(x) # t6: "cpu f32[2, 2]" return t6

import thunder import thunder.torch as ltorch import torch from thunder.executors.torchex import no_autocast @torch.no_grad() @no_autocast def computation(x): # x: "cpu f32[2, 2]" # /home/mkozuki/ghq/github.com/crcrpar/lightning-thunder/thunder/tests/test_jit_general.py:1217: return torch.ops.higher_order.autograd_function_apply( t0 = ltorch.autograd_function_apply(_function_0, _function_1, x, args_tensor_mask=[True], non_differentiable_idx=[]) # t0: "cpu f32[2, 2]" # t0 = ltorch.sin(x) # t0: "cpu f32[2, 2]" # t0 = prims.sin(x) # t0: "cpu f32[2, 2]" return t0

Signed-off-by: Masaki Kozuki <mkozuki@nvidia.com>

t-vi · 2024-12-09T07:52:19Z

Masaki offered #1526 as an alternative without higher order functions. I merged it to unblock uses of autograd_function_apply.

crcrpar requested review from mruberry, lantiga and t-vi as code owners October 3, 2024 13:34

t-vi reviewed Oct 3, 2024

View reviewed changes

thunder/core/jit_ext.py Outdated Show resolved Hide resolved

crcrpar force-pushed the crpa/lookaside_autograd-function-apply branch from b4647ed to 71db6cd Compare October 3, 2024 14:42

IvanYashchuk reviewed Oct 4, 2024

View reviewed changes

thunder/torch/__init__.py Outdated Show resolved Hide resolved

thunder/core/jit_ext.py Outdated Show resolved Hide resolved

crcrpar added a commit that referenced this pull request Nov 13, 2024

cherry-pick of #1256 as of 627845d

f349308

Signed-off-by: Masaki Kozuki <mkozuki@nvidia.com>

crcrpar force-pushed the crpa/lookaside_autograd-function-apply branch from 627845d to f349308 Compare November 13, 2024 11:04

crcrpar added a commit that referenced this pull request Nov 13, 2024

cherry-pick of #1256 as of 627845d

cc2bbbe

Signed-off-by: Masaki Kozuki <mkozuki@nvidia.com>

crcrpar force-pushed the crpa/lookaside_autograd-function-apply branch 2 times, most recently from 9c51ae2 to 94c3409 Compare November 13, 2024 11:29

crcrpar added a commit that referenced this pull request Nov 13, 2024

cherry-pick of #1256 as of 627845d

b2eede7

Signed-off-by: Masaki Kozuki <mkozuki@nvidia.com>

crcrpar force-pushed the crpa/lookaside_autograd-function-apply branch from c5621cd to dd702f5 Compare November 13, 2024 12:24

crcrpar requested review from t-vi and IvanYashchuk November 13, 2024 12:35

crcrpar added a commit that referenced this pull request Nov 14, 2024

cherry-pick of #1256 as of 627845d

199d688

Signed-off-by: Masaki Kozuki <mkozuki@nvidia.com>

crcrpar force-pushed the crpa/lookaside_autograd-function-apply branch from dd702f5 to 1b85a21 Compare November 14, 2024 13:04

IvanYashchuk reviewed Nov 14, 2024

View reviewed changes

crcrpar added a commit that referenced this pull request Nov 14, 2024

cherry-pick of #1256 as of 627845d

eaae2d6

Signed-off-by: Masaki Kozuki <mkozuki@nvidia.com>

crcrpar force-pushed the crpa/lookaside_autograd-function-apply branch from 1b85a21 to 7729af1 Compare November 14, 2024 14:40

IvanYashchuk mentioned this pull request Nov 14, 2024

[WIP] Draft checkpoint interpret call #1275

Closed

4 tasks

crcrpar added a commit that referenced this pull request Nov 18, 2024

cherry-pick of #1256 as of 627845d

4632583

Signed-off-by: Masaki Kozuki <mkozuki@nvidia.com>

crcrpar force-pushed the crpa/lookaside_autograd-function-apply branch from 7729af1 to 40ae171 Compare November 18, 2024 08:50

IvanYashchuk added autograd tracing architecture labels Nov 18, 2024

IvanYashchuk approved these changes Nov 18, 2024

View reviewed changes

t-vi requested changes Nov 18, 2024

View reviewed changes

crcrpar added a commit that referenced this pull request Nov 19, 2024

cherry-pick of #1256 as of 627845d

e88ef67

Signed-off-by: Masaki Kozuki <mkozuki@nvidia.com>

crcrpar force-pushed the crpa/lookaside_autograd-function-apply branch from ce17411 to 005697e Compare November 19, 2024 13:23

cherry-pick of #1256 as of 627845d

fde3e1a

Signed-off-by: Masaki Kozuki <mkozuki@nvidia.com>

crcrpar added 6 commits November 26, 2024 16:06

use push|pop_scope

eb4a853

Signed-off-by: Masaki Kozuki <mkozuki@nvidia.com>

simplified siginfo

d46df81

Signed-off-by: Masaki Kozuki <mkozuki@nvidia.com>

Use ad_hoc_executor

9ed2c1a

Signed-off-by: Masaki Kozuki <mkozuki@nvidia.com>

keep torchsymbol

30fbb88

Signed-off-by: Masaki Kozuki <mkozuki@nvidia.com>

cosmetic

c5b4e20

Signed-off-by: Masaki Kozuki <mkozuki@nvidia.com>

remove unused variable from test

e903bb8

Signed-off-by: Masaki Kozuki <mkozuki@nvidia.com>

crcrpar force-pushed the crpa/lookaside_autograd-function-apply branch from 005697e to e903bb8 Compare November 26, 2024 07:06

crcrpar added a commit that referenced this pull request Dec 7, 2024

cherry-pick of #1256 as of 627845d

372e1d7

Signed-off-by: Masaki Kozuki <mkozuki@nvidia.com>

crcrpar mentioned this pull request Dec 7, 2024

spelling out higher_order.autograd_function_apply's fwd and bwd in trace. #1526

Merged

crcrpar closed this Dec 9, 2024

crcrpar deleted the crpa/lookaside_autograd-function-apply branch December 9, 2024 09:21

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Lookaside for `torch.ops.higher_order.autograd_function_apply` #1256

Lookaside for `torch.ops.higher_order.autograd_function_apply` #1256

crcrpar commented Oct 3, 2024 •

edited by github-actions bot

Loading

IvanYashchuk left a comment

t-vi Nov 18, 2024

crcrpar Nov 18, 2024

IvanYashchuk Nov 18, 2024

t-vi Nov 18, 2024

IvanYashchuk Nov 18, 2024

crcrpar Nov 18, 2024

t-vi commented Dec 9, 2024

Lookaside for torch.ops.higher_order.autograd_function_apply #1256

Lookaside for torch.ops.higher_order.autograd_function_apply #1256

Conversation

crcrpar commented Oct 3, 2024 • edited by github-actions bot Loading

What does this PR do?

IvanYashchuk left a comment

Choose a reason for hiding this comment

t-vi Nov 18, 2024

Choose a reason for hiding this comment

crcrpar Nov 18, 2024

Choose a reason for hiding this comment

IvanYashchuk Nov 18, 2024

Choose a reason for hiding this comment

t-vi Nov 18, 2024

Choose a reason for hiding this comment

IvanYashchuk Nov 18, 2024

Choose a reason for hiding this comment

crcrpar Nov 18, 2024

Choose a reason for hiding this comment

t-vi commented Dec 9, 2024

Lookaside for `torch.ops.higher_order.autograd_function_apply` #1256

Lookaside for `torch.ops.higher_order.autograd_function_apply` #1256

crcrpar commented Oct 3, 2024 •

edited by github-actions bot

Loading