Max pool #163

jjsjann123 · 2024-04-11T17:52:03Z

What does this PR do?

Fixes #164.

We have restored thunder performance by having torchex running max_pool2d/3d via a single aten call, versus using the decomposed primitive operations using convolution.

A quick performance is demonstrated here:
This is before the PR:

jit_model elapsed time:  0.015625953674316406
torch eager elapsed time:  0.0018506050109863281

After the PR:

jit_model elapsed time:  0.0022668838500976562
torch eager elapsed time:  0.001873016357421875

Note this is only done for max_pool2d/3d. Because max_pool1d is implicitly differentiable in pytorch so there's no backward entry in aten.

for more information, see https://pre-commit.ci

thunder/executors/torchex.py

jjsjann123 · 2024-04-12T05:36:07Z

I'm just changing the executor implementation. Hence I don't think I need to add extra tests apart from what's already in CI.

thunder/executors/torchex.py

tfogal

Having an explicit op instead of decomposing feels reasonable.

Do we want the subsymbols of the poolXd to be the original verbose decomposition? My gut is actually 'no', i.e. that just about every backend would explicitly implement a pooling operator anyway. But wanted to throw it out there.

In general I'd recommend more """doc comments""" on functions but I'm not going to hold off on a +1 over that. A comment I'd like to see somewhere is something to the effect of "we tried decomposing this as conv + X + Y + Z, but it leads to really bad perf and systems like nvFuser implement pooling directly anyway", i.e. explaining why this op exists and that the alternative isn't great.

thunder/executors/torchex.py

for more information, see https://pre-commit.ci

t-vi

Thank you @jjsjann123 @tfogal

jjsjann123 and others added 7 commits April 10, 2024 17:30

special grad_transform for max_poolxd

1756c03

forgot the name

9511603

adding torch symbol for max_pool backward

f641130

fixing signature

75e284c

removing 3d because of pytorch aten API coverage

cc489c6

fixing max_pool2d with indices

486060a

[pre-commit.ci] auto fixes from pre-commit.com hooks

d252018

for more information, see https://pre-commit.ci

jjsjann123 mentioned this pull request Apr 11, 2024

torchex running pooling without decomposition #164

Closed

jjsjann123 and others added 13 commits April 11, 2024 14:04

adding torch operator max_pool2d_with_indices

d614035

Merge remote-tracking branch 'jiej/max_pool' into max_pool

104a96e

[pre-commit.ci] auto fixes from pre-commit.com hooks

c995511

for more information, see https://pre-commit.ci

patch backward operator

82f56e9

patch

22efa40

[pre-commit.ci] auto fixes from pre-commit.com hooks

254fcb0

for more information, see https://pre-commit.ci

fixing logic

0e9c441

functionally correct now at least

d8647bb

Merge branch 'main' into max_pool

e9e9dd9

refactor to support max_pool3d as well

7a06c07

[pre-commit.ci] auto fixes from pre-commit.com hooks

ef7ae3a

for more information, see https://pre-commit.ci

partial can't be used in grad_transform

351469d

[pre-commit.ci] auto fixes from pre-commit.com hooks

f8a167f

for more information, see https://pre-commit.ci

jjsjann123 commented Apr 12, 2024

View reviewed changes

thunder/executors/torchex.py Show resolved Hide resolved

jjsjann123 commented Apr 12, 2024

View reviewed changes

thunder/executors/torchex.py Show resolved Hide resolved

jjsjann123 requested review from tfogal, nikitaved and IvanYashchuk April 12, 2024 05:35

jjsjann123 marked this pull request as ready for review April 12, 2024 05:35

jjsjann123 requested review from mruberry, lantiga and robieta as code owners April 12, 2024 05:35

jjsjann123 requested review from t-vi and carmocca as code owners April 12, 2024 05:35

crcrpar reviewed Apr 12, 2024

View reviewed changes

thunder/executors/torchex.py Outdated Show resolved Hide resolved

thunder/executors/torchex.py Outdated Show resolved Hide resolved

thunder/executors/torchex.py Outdated Show resolved Hide resolved

thunder/executors/torchex.py Outdated Show resolved Hide resolved

tfogal approved these changes Apr 12, 2024

View reviewed changes

thunder/executors/torchex.py Outdated Show resolved Hide resolved

thunder/executors/torchex.py Show resolved Hide resolved

thunder/executors/torchex.py Show resolved Hide resolved

jjsjann123 added the visionmodel issues related to supporting vision model label Apr 12, 2024

jjsjann123 and others added 3 commits April 12, 2024 11:35

addressing reviews

dfaa058

typo

155ac78

[pre-commit.ci] auto fixes from pre-commit.com hooks

2a8da1b

for more information, see https://pre-commit.ci

t-vi enabled auto-merge (squash) April 12, 2024 20:25

t-vi approved these changes Apr 12, 2024

View reviewed changes

t-vi merged commit 709a062 into Lightning-AI:main Apr 12, 2024
39 checks passed

IvanYashchuk removed their request for review April 15, 2024 09:32

jjsjann123 deleted the max_pool branch April 17, 2024 17:24

nikitaved mentioned this pull request May 6, 2024

max_pool2d：Shape propagation error for input dimensions used by resnet #365

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Max pool #163

Max pool #163

jjsjann123 commented Apr 11, 2024 •

edited

Loading

jjsjann123 commented Apr 12, 2024

tfogal left a comment •

edited

Loading

t-vi left a comment

Max pool #163

Max pool #163

Conversation

jjsjann123 commented Apr 11, 2024 • edited Loading

What does this PR do?

jjsjann123 commented Apr 12, 2024

tfogal left a comment • edited Loading

Choose a reason for hiding this comment

t-vi left a comment

Choose a reason for hiding this comment

jjsjann123 commented Apr 11, 2024 •

edited

Loading

tfogal left a comment •

edited

Loading