Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Minor CK FP8 Tuning Improvements #2987

Closed
wants to merge 1 commit into from

Conversation

jwfromm
Copy link
Contributor

@jwfromm jwfromm commented Aug 14, 2024

Summary:
This diff makes a few small changes to improve CK FP8 performance based on recent improvements to ROCM and CK that have landed.

We specifically use the large kernel added in D60996231 more liberally as it's quite good and reenable some files in the CK Profiler that can now compile.

The latest llama benchmarks after this change are available here.

Differential Revision: D61285882

Summary:
This diff makes a few small changes to improve CK FP8 performance based on recent improvements to ROCM and CK that have landed.

We specifically use the large kernel added in D60996231 more liberally as it's quite good and reenable some files in the CK Profiler that can now compile.

The latest llama benchmarks after this change are available [here](https://docs.google.com/spreadsheets/d/1GD44u4Sud_6T9iq_SJvYmSn8tx0bRd9niZPy2gVHK-s/edit?gid=482861329#gid=482861329).

Differential Revision: D61285882
@facebook-github-bot
Copy link
Contributor

This pull request was exported from Phabricator. Differential Revision: D61285882

Copy link

netlify bot commented Aug 14, 2024

Deploy Preview for pytorch-fbgemm-docs ready!

Name Link
🔨 Latest commit b56e430
🔍 Latest deploy log https://app.netlify.com/sites/pytorch-fbgemm-docs/deploys/66bcf9d1113d01000939c8f4
😎 Deploy Preview https://deploy-preview-2987--pytorch-fbgemm-docs.netlify.app
📱 Preview on mobile
Toggle QR Code...

QR Code

Use your smartphone camera to open QR code link.

To edit notification comments on pull requests, go to your Netlify site configuration.

@facebook-github-bot
Copy link
Contributor

This pull request has been merged in 537aeb3.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants