Added support `F.one_hot` #128

shaharelys · 2024-04-03T16:41:28Z

Before submitting

Was this discussed/approved via a Github issue? (no need for typos and docs improvements)
Did you read the contributor guideline, Pull Request section?
Did you make sure to update the docs? No, should I?
Did you write any new necessary tests?

What does this PR do?

Implements F.one_hot.

Fixes #64

PR review

This PR is open for review, yet again I'm not certain it's complete. Added comments under one_hot for things I wasn't sure of. Tests have passed. docs has not been updated (I had a look at this README but I wasn't sure I understand). Feedback is welcomed!

Did you have fun?

I sure did! 🙃

…pes on CPU devices

…wish_function

…/shaharelys/lightning-thunder into added_support_hardswish_function

…wish_function

for more information, see https://pre-commit.ci

…ensor elements

…rom main but same conflict)

for more information, see https://pre-commit.ci

thunder/tests/opinfos.py

thunder/torch/__init__.py

mruberry · 2024-04-03T18:17:15Z

thunder/tests/opinfos.py

+    make = partial(make_tensor, device=device, dtype=torch.long, requires_grad=requires_grad)
+
+    test_shapes = [
+        (10,),


Let's add a tensor with no dimensions (its shape is ()) and a tensor with no elements (like (0, 512)), too

Yeah, empty tensors + 0-dim tensors are very important. The underlying ops dispatch to PyTorch native implementations (scatter_add) but these were written by me before the widespread adoption of 0-dim tensors... So we'd better double check that PyTorch does the right thing here as well... And if PyTorch falls short, we could file an issue there, and short circuit our implementation here.

We are still missing empty inputs. Or is there an error?

@nikitaved I've added an empty input (0, 512) as you guys suggested (:

@shaharelys , what about scalar inputs, the inputs with no dimension (i.e. shape=())?

Hey @nikitaved! Sorry for the delay. I did not add these. Should we also add these now?

mruberry · 2024-04-03T18:20:30Z

Hey @shaharelys! This looks pretty good. I made some comments for your review. I also added @nikitaved as a reviewer.

shaharelys · 2024-04-03T18:23:02Z

@mruberry
Thx a lot! Will look into these 🙏🏼

nikitaved · 2024-04-04T08:37:37Z

thunder/torch/__init__.py

+    src = ones_like(index, dtype=dtypes.int64)
+
+    return scatter_add(canvas, dim=-1, index=index, src=src)


Seems fine for now, but what happens is that we create a tensor full of ones because tensor creation and scatter_add are not going to be fused together. We could:

create a tensor with a single 1 and broadcast it.

leave it be in hopes that sometime in some future our backends/executors will improve this bit (when scatter_add will be there for nvFuser, for example). Worth adding a comment, I guess?

PyTorch uses scatter and that, apparently, allows scalars as source inputs...

Hey @nikitaved! Not sure I understand the implementation suggested in ..and broadcast it. Tried to implement naively by replacing,
src = ones_like(index, device=a.device, dtype=dtypes.int64)
with,
src = tensor([1], device=a.device, dtype=dtypes.int64)
and got,

thunder/core/prims.py:3009: in scatter_add_meta utils.check( _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ cond = False, s = <function scatter_add_meta.<locals>.<lambda> at 0x7bb2a8c043a0> exception_type = <class 'RuntimeError'> def check(cond: bool, s: Callable[[], str], exception_type: type[Exception] = RuntimeError) -> None: """Helper function for raising an error_type (default: RuntimeError) if a boolean condition fails. s is a callable producing a string to avoid string construction if the error check is passed. """ if not cond: > raise exception_type(s()) E RuntimeError: Expected index (rank=3) to have the same rank as value (rank=1) thunder/core/baseutils.py:103: RuntimeError

You can read about broadcasting here: https://numpy.org/doc/stable/user/basics.broadcasting.html. The idea is to create a tensor of a single 1 and reshape it into some other shape which, in this case, should match either the index or the source as I reckon. This solution should be future-proof and optimal. But could be done as a follow-up.

@nikitaved
Cool, and should this be more efficient operation than current implementation?

Well, yes. It will spare us launching a kernel that fills memory with 1s. This might become redundant once the target executor can fuse together ones and scatter_add. In the context of NVFuser, this means putting the code of ones and scatter_add into a single CUDA kernel. Do not worry for now if you do not understand these things, and the change is not that critical for now. You can learn more about executors from the documentation that should be, alas, built locally.

… Colab.

…cally on Colab.

for more information, see https://pre-commit.ci

shaharelys · 2024-04-06T14:57:00Z

Hey @mruberry, @nikitaved ! I've reviewed and addressed most of the comments. However, there are a few points I'm uncertain about. I'll comment directly under those for clarity.

Looking forward to your feedback!

…/lightning-thunder into added_support_one_hot

t-vi

Thank you @shaharelys @nikitaved @mruberry

nikitaved · 2024-04-07T14:48:12Z

@t-vi , I have not checked the changes yet, nor have I answered the posited questions :)

shaharelys and others added 20 commits March 28, 2024 21:43

Added support for HardSwish function

e659c35

Adjust test directives for HardSwish to expect failure for integer ty…

4109be3

…pes on CPU devices

Merge branch 'main' into added_support_hardswish_function

39c2a55

Merge branch 'main' into added_support_hardswish_function

4ea0abc

Merge remote-tracking branch 'upstream/main' into added_support_hards…

cb9d941

…wish_function

Refine hardswish tests to exclude integer types on CUDA

4f1e1b4

Merge branch 'added_support_hardswish_function' of https://github.com…

90515e3

…/shaharelys/lightning-thunder into added_support_hardswish_function

Merge remote-tracking branch 'upstream/main' into added_support_hards…

f804a90

…wish_function

Refine hardswish to support only floating point dtypes and update OpInfo

fa58968

[pre-commit.ci] auto fixes from pre-commit.com hooks

50c8480

for more information, see https://pre-commit.ci

Implement dtype check for hardswish to only allow floating point types

a460236

Prepare one_hot function tests and implementation

74a1529

Merge branch 'added_support_hardswish_function'

c6c04c7

Removed torch. lookup in one_hot (Debugging)

3cd32d2

Fixed one_hot_sample_generator to use only non-nengative values for t…

4d7771f

…ensor elements

Fixed one_hot_sample_generator to use only non-nengative values for t…

991c1fa

…ensor elements

Resolved merge conflict by incorporating upstream changes

bbfed3f

Resolved conflict occured under group_norm by mistake

a6e250b

Resolved conflict occured under group_norm by mistake (seperately f…

63e3107

…rom main but same conflict)

Merge remote-tracking branch 'origin' into test_one_hot

d254a66

shaharelys requested review from mruberry, lantiga, robieta, t-vi and carmocca as code owners April 3, 2024 16:41

[pre-commit.ci] auto fixes from pre-commit.com hooks

9ff2c62

for more information, see https://pre-commit.ci