support inferring dtype with torch.get_default_dtype for factory functions #775

kshitij12345 · 2024-07-16T11:17:48Z

Fixes: #750

Changes -

Stash torch.get_default_dtype in cache_info. Also, this adds a check to the prologue trace to verify that jitted fn is called with same default dtype - see example prologue below.
Factory functions infer the dtype based on torch.get_default_dtype from cache_info (if dtype is not passed explicitly)
We don't support changing the default dtype in the jitted fn as reordering and fusion can lead to issue - it is a loud error for now (we can revisit in follow-up if required).

Repro:

import torch
import thunder

def foo(x: torch.Tensor) -> torch.Tensor:
    o = torch.zeros(x.shape, device=x.device)
    return o

jfoo = thunder.jit(foo)
o = jfoo(torch.randn(3, 3))
print(o.dtype)
print(thunder.last_prologue_traces(jfoo)[0])
print()
print(thunder.last_traces(jfoo)[0])

Prologue Trace

import thunder
import thunder.core.prims as prims
import torch
from thunder.executors.torchex import no_autocast

@torch.no_grad()
@no_autocast
def prologue(*args, **kwargs):
  # args: "Any"
  prims.check_len(args, 1)
  # kwargs: "Any"
  prims.check_len(kwargs, 0)
  t_0: "cpu f32[3, 3]" = args[0]
  prims.check_tensor_metadata(t_0, (3, 3), 'cpu', torch.float32, False)
  cache_info: "Any" = thunder._get_cache_info()
  cache_info_default_dtype: "<class 'torch.dtype'>" = cache_info['default_dtype']
  # NOTE - We bake the torch.dtype in trace (check_tensor_metadata also does the same).
  prims.check_literal_like(cache_info_default_dtype, torch.float32)
  cache_info_is_autocast_enabled: "bool False" = cache_info['is_autocast_enabled']
  prims.check_number_type_and_value(cache_info_is_autocast_enabled, False)
  cache_info_no_grad_sync: "bool False" = cache_info['no_grad_sync']
  prims.check_number_type_and_value(cache_info_no_grad_sync, False)
  return ((), ())

Computation Trace

import thunder
import thunder.core.devices as devices
import thunder.torch as ltorch
import torch
from thunder.executors.torchex import no_autocast

@torch.no_grad()
@no_autocast
def computation():
  # /home/kkalambarkar/lightning-thunder/scratchpad/test.py:64:             o = torch.zeros(x.shape, device=x.device)
  o = ltorch.zeros((3, 3), device=devices.Device("cpu"), dtype=None)  # o: "cpu f32[3, 3]"
    # o = ltorch.full((3, 3), 0, device=devices.Device("cpu"), dtype=None)  # o: "cpu f32[3, 3]"
      # o = prims.full((3, 3), 0, device=devices.Device("cpu"), dtype=dtypes.float32)  # o: "cpu f32[3, 3]"
  return o

…tions

thunder/torch/__init__.py

t-vi

Supergood, thank you @kshitij12345

kshitij12345 added 3 commits July 16, 2024 13:10

support inferring dtype with torch.get_default_dtype for factory func…

8024342

…tions

updates

52c1474

update test and add comment

ea21f57

kshitij12345 commented Jul 16, 2024

View reviewed changes

thunder/torch/__init__.py Show resolved Hide resolved

kshitij12345 marked this pull request as ready for review July 16, 2024 12:34

kshitij12345 requested review from mruberry and lantiga as code owners July 16, 2024 12:34

t-vi approved these changes Jul 16, 2024

View reviewed changes

t-vi merged commit 6703b35 into Lightning-AI:main Jul 16, 2024
39 checks passed

kshitij12345 changed the title ~~[WIP] support inferring dtype with torch.get_default_dtype for factory functions~~ support inferring dtype with torch.get_default_dtype for factory functions Jul 16, 2024

kshitij12345 mentioned this pull request Jul 22, 2024

factory functions - fix handling of default device #820

Merged

github-actions bot deleted the support-torch-default-dtype branch October 16, 2024 00:46

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

support inferring dtype with torch.get_default_dtype for factory functions #775

support inferring dtype with torch.get_default_dtype for factory functions #775

Uh oh!

kshitij12345 commented Jul 16, 2024 •

edited

Loading

Uh oh!

Uh oh!

t-vi left a comment

Uh oh!

Uh oh!

Uh oh!

support inferring dtype with torch.get_default_dtype for factory functions #775

support inferring dtype with torch.get_default_dtype for factory functions #775

Uh oh!

Conversation

kshitij12345 commented Jul 16, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

t-vi left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

kshitij12345 commented Jul 16, 2024 •

edited

Loading