-
Notifications
You must be signed in to change notification settings - Fork 55
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
BFS traversal could not visit some nodes while fusing take_along_axis
#3718
Comments
This one is because
When a fusion has slice/pad, we automatically switch to using the IdModel-based indexer, but that doesn't yet support ops like takeAlongAxis. That would be something we would need to work on this Q anyway, but for now, a quick fix would be to disallow fusing those ops together. I'll create a patch. |
@naoyam Out of curiosity, does this mean that with the patch, the region will be split in two fusions? |
Unfortunately, yes at this moment. This is a patch to make it run. Need more work for performance, which is part of our Q1 plans. |
|
While investigating #1552 in Thunder, I encountered the following error:
with repro:
ps. The repro seems to be working on H100 but only with flag
CUDA_LAUNCH_BLOCKING=1
, without it there is an IMAThe text was updated successfully, but these errors were encountered: