Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add masked_index_benchmark #2989

Closed
wants to merge 1 commit into from
Closed

Conversation

sryap
Copy link
Contributor

@sryap sryap commented Aug 14, 2024

Summary:
This diff adds a benchmark for measuring host-to-device copy
performance using torch.ops.fbgemm.masked_index_put. The host buffer
is a UVM buffer (by default it is malloc+cudaHostRegister).

Differential Revision: D61284671

Copy link

netlify bot commented Aug 14, 2024

Deploy Preview for pytorch-fbgemm-docs ready!

Name Link
🔨 Latest commit 1f9bfb1
🔍 Latest deploy log https://app.netlify.com/sites/pytorch-fbgemm-docs/deploys/66bfa9dc5c95ca0009f5ae8c
😎 Deploy Preview https://deploy-preview-2989--pytorch-fbgemm-docs.netlify.app
📱 Preview on mobile
Toggle QR Code...

QR Code

Use your smartphone camera to open QR code link.

To edit notification comments on pull requests, go to your Netlify site configuration.

@facebook-github-bot
Copy link
Contributor

This pull request was exported from Phabricator. Differential Revision: D61284671

@facebook-github-bot
Copy link
Contributor

This pull request was exported from Phabricator. Differential Revision: D61284671

sryap added a commit to sryap/FBGEMM that referenced this pull request Aug 14, 2024
Summary:
X-link: facebookresearch/FBGEMM#82

Pull Request resolved: pytorch#2989

This diff adds a benchmark for measuring host-to-device copy
performance using `torch.ops.fbgemm.masked_index_put`. The host buffer
is a UVM buffer (by default it is `malloc+cudaHostRegister`).

Differential Revision: D61284671
@sryap sryap force-pushed the export-D61284671 branch from 45ddc7e to b8422e8 Compare August 14, 2024 23:43
@facebook-github-bot
Copy link
Contributor

This pull request was exported from Phabricator. Differential Revision: D61284671

sryap added a commit to sryap/FBGEMM that referenced this pull request Aug 15, 2024
Summary:
X-link: facebookresearch/FBGEMM#82

Pull Request resolved: pytorch#2989

This diff adds a benchmark for measuring host-to-device copy
performance using `torch.ops.fbgemm.masked_index_put`. The host buffer
is a UVM buffer (by default it is `malloc+cudaHostRegister`).

Differential Revision: D61284671
@sryap sryap force-pushed the export-D61284671 branch from b8422e8 to 2caedf3 Compare August 15, 2024 05:53
@facebook-github-bot
Copy link
Contributor

This pull request was exported from Phabricator. Differential Revision: D61284671

@sryap sryap force-pushed the export-D61284671 branch from 2caedf3 to 4e318db Compare August 15, 2024 06:00
sryap added a commit to sryap/FBGEMM that referenced this pull request Aug 15, 2024
Summary:
X-link: facebookresearch/FBGEMM#82

Pull Request resolved: pytorch#2989

This diff adds a benchmark for measuring host-to-device copy
performance using `torch.ops.fbgemm.masked_index_put`. The host buffer
is a UVM buffer (by default it is `malloc+cudaHostRegister`).

Differential Revision: D61284671
Summary:
X-link: facebookresearch/FBGEMM#82

Pull Request resolved: pytorch#2989

This diff adds a benchmark for measuring host-to-device copy
performance using `torch.ops.fbgemm.masked_index_put`. The host buffer
is a UVM buffer (by default it is `malloc+cudaHostRegister`).

Reviewed By: jianyuh

Differential Revision: D61284671
@facebook-github-bot
Copy link
Contributor

This pull request was exported from Phabricator. Differential Revision: D61284671

@facebook-github-bot
Copy link
Contributor

This pull request has been merged in a83b65c.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants