Simplify function implementation returned by thunder.jit for easier instrumentation of different stages #1333

IvanYashchuk · 2024-10-21T11:36:04Z

This PR changes the thunder.jit.fn_ implementation to use just a few lines of code making it more readable and amenable to per-function instrumentation like cProfile, pyinstrument, or NVTX decorators.

I removed a few unused timer measurements or if there was just a stop with no start. Other than that all CompileStats time measurements are preserved in this PR.

There was a todo to rename "last_executed" to "last_computation", this change is included here.

Here is what the function looks like now:

def fn_(*args, **kwargs) -> Any:
    cache_entry, inps, pro_to_epi = get_computation_and_inputs(*args, **kwargs)
    check_storage_aliases(cache_entry, inps)
    result = cache_entry.computation_fn(*inps)
    result = maybe_connect_to_autograd(cache_entry, result)
    result = maybe_call_epilogue(cache_entry, result, pro_to_epi)
    return result

for more information, see https://pre-commit.ci

IvanYashchuk · 2024-10-21T15:32:00Z

@tfogal, could you please review the changes? Would this PR work well with the planned nvtx marker decorators from #1268?

IvanYashchuk · 2024-10-21T15:55:57Z

CI failures that I need to take a look what's going on:

=========================== short test summary info ============================
FAILED thunder/tests/test_jit_general.py::test_litgpt_variants_kvcache[cpu-llama1-like] - TypeError: tuple indices must be integers or slices, not tuple
FAILED thunder/tests/test_jit_general.py::test_litgpt_variants_kvcache[cpu-codellama2-like] - TypeError: tuple indices must be integers or slices, not tuple
FAILED thunder/tests/test_jit_general.py::test_litgpt_variants_kvcache[cpu-falcon-40b-like] - TypeError: tuple indices must be integers or slices, not tuple
FAILED thunder/tests/test_jit_general.py::test_litgpt_variants_kvcache[cpu-long-context-like] - TypeError: tuple indices must be integers or slices, not tuple
FAILED thunder/tests/test_jit_general.py::test_litgpt_variants_kvcache[cpu-llama2-like] - TypeError: tuple indices must be integers or slices, not tuple
FAILED thunder/tests/test_jit_general.py::test_litgpt_variants_kvcache[cpu-falcon-7b-like] - TypeError: tuple indices must be integers or slices, not tuple

tfogal

Thank you!

thunder/__init__.py

tfogal · 2024-10-21T17:47:30Z

There was a todo to rename "last_executed" to "last_computation", this change is included here.

Is this one of the timers that mixology is using? If so, we may want to hold off on this part. @mpatel31415 ?

mpatel31415 · 2024-10-22T06:51:26Z

There was a todo to rename "last_executed" to "last_computation", this change is included here.

Is this one of the timers that mixology is using? If so, we may want to hold off on this part. @mpatel31415 ?

No, we use iter_t0 = time.perf_counter() and t1 = time.perf_counter() from benchmark_litgpt.py script :)

tfogal · 2024-10-22T16:45:54Z

There was a todo to rename "last_executed" to "last_computation", this change is included here.

Is this one of the timers that mixology is using? If so, we may want to hold off on this part. @mpatel31415 ?

No, we use iter_t0 = time.perf_counter() and t1 = time.perf_counter() from benchmark_litgpt.py script :)

oh, great!

I thought you were using some timers, though---how are you measuring compilation time, if not using these CompileStats timers?

mpatel31415 · 2024-10-23T14:19:36Z

There was a todo to rename "last_executed" to "last_computation", this change is included here.

Is this one of the timers that mixology is using? If so, we may want to hold off on this part. @mpatel31415 ?

No, we use iter_t0 = time.perf_counter() and t1 = time.perf_counter() from benchmark_litgpt.py script :)

oh, great!

I thought you were using some timers, though---how are you measuring compilation time, if not using these CompileStats timers?

We measure 2N iterations giving us iter_times = [t1, t2, .. t_2N] and we assume that compilation (and warmup) time is sum(iter_times[:N]) - sum(iter_times[N:]). In this way it's easy to measure it irrespective of the compilation method in the same way.

tfogal · 2024-10-23T16:17:18Z

I thought you were using some timers, though---how are you measuring compilation time [. . .]

We measure 2N iterations [. . .]

Oh! That's actually great news as I had (incorrectly) thought you were using the timers, and this makes it easier for us to change that interface.

Thanks!

tfogal · 2024-10-30T23:16:53Z

Looks like tests pass and this is good to go, AFAICT.

@t-vi merge?

t-vi · 2024-10-31T08:44:21Z

So when will we be able to drop the timers?

t-vi

Thank you @IvanYashchuk @mpatel31415 @tfogal
Seems good, I would want to drop all the timings gathered here very soon unless there is a user for them.
From what I understand, people interested in timings do profile things.

tfogal · 2024-11-01T05:00:02Z

So when will we be able to drop the timers?

I can start looking at this after #1268

IvanYashchuk added 12 commits October 21, 2024 13:12

Move last_trace_host and calls recording to a decorator

4d15278

Remove last_trace_cache_stop there's no corresponding start

a36f14b

Rename last_executed -> last_computation

09034c1

Remove last_computation_execution_stop there's no corresponding start

288e840

Move storage check for aliases to a separate function

3495bfe

Move ThunderFunction.apply to a separate function

2b089d9

Move epilogue_fn invocation to a separate function

90f9f2a

Remove unused last_trace_host_tracing measurements

63a6064

Move host_execution timer to a decorator to be applied on computation_fn

fabfc41

Remove a couple of empty lines

70ac9a7

Add prologue_execution_timer decorator

d89f51d

decorate_computation_functions -> decorate_computation_function

77c7d95

IvanYashchuk added the jit label Oct 21, 2024

IvanYashchuk requested review from mruberry, lantiga and t-vi as code owners October 21, 2024 11:36

[pre-commit.ci] auto fixes from pre-commit.com hooks

a8c4d7a

for more information, see https://pre-commit.ci

IvanYashchuk marked this pull request as draft October 21, 2024 15:09

CacheEntry is a named tuple which doesn't support attribute assignment

a9fd993

IvanYashchuk marked this pull request as ready for review October 21, 2024 15:30

IvanYashchuk requested a review from tfogal October 21, 2024 15:30

maybe_call_epilogue should return result

9923a42

tfogal approved these changes Oct 21, 2024

View reviewed changes

thunder/__init__.py Show resolved Hide resolved

tfogal mentioned this pull request Oct 28, 2024

Add an nvtx decorator for annotating components #1268

Merged

IvanYashchuk added 2 commits October 29, 2024 09:46

Merge branch 'main' into thunder-fn-instrumentation

4ddfff5

Merge branch 'main' into thunder-fn-instrumentation

511f39a

t-vi approved these changes Oct 31, 2024

View reviewed changes

t-vi merged commit 7b620b0 into main Oct 31, 2024
41 checks passed

t-vi deleted the thunder-fn-instrumentation branch October 31, 2024 08:47

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Simplify function implementation returned by thunder.jit for easier instrumentation of different stages #1333

Simplify function implementation returned by thunder.jit for easier instrumentation of different stages #1333

Uh oh!

IvanYashchuk commented Oct 21, 2024 •

edited

Loading

Uh oh!

IvanYashchuk commented Oct 21, 2024

Uh oh!

IvanYashchuk commented Oct 21, 2024

Uh oh!

tfogal left a comment

Uh oh!

Uh oh!

tfogal commented Oct 21, 2024 •

edited

Loading

Uh oh!

mpatel31415 commented Oct 22, 2024

Uh oh!

tfogal commented Oct 22, 2024 •

edited

Loading

Uh oh!

mpatel31415 commented Oct 23, 2024

Uh oh!

tfogal commented Oct 23, 2024

Uh oh!

tfogal commented Oct 30, 2024

Uh oh!

t-vi commented Oct 31, 2024

Uh oh!

t-vi left a comment

Uh oh!

Uh oh!

tfogal commented Nov 1, 2024

Uh oh!

Uh oh!

Simplify function implementation returned by thunder.jit for easier instrumentation of different stages #1333

Simplify function implementation returned by thunder.jit for easier instrumentation of different stages #1333

Uh oh!

Conversation

IvanYashchuk commented Oct 21, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

IvanYashchuk commented Oct 21, 2024

Uh oh!

IvanYashchuk commented Oct 21, 2024

Uh oh!

tfogal left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

tfogal commented Oct 21, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

mpatel31415 commented Oct 22, 2024

Uh oh!

tfogal commented Oct 22, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

mpatel31415 commented Oct 23, 2024

Uh oh!

tfogal commented Oct 23, 2024

Uh oh!

tfogal commented Oct 30, 2024

Uh oh!

t-vi commented Oct 31, 2024

Uh oh!

t-vi left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

tfogal commented Nov 1, 2024

Uh oh!

Uh oh!

IvanYashchuk commented Oct 21, 2024 •

edited

Loading

tfogal commented Oct 21, 2024 •

edited

Loading

tfogal commented Oct 22, 2024 •

edited

Loading