Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

rocm support for moe tuning script #251

Merged
merged 8 commits into from
Nov 13, 2024
Merged

Conversation

divakar-amd
Copy link

  • add rocm triton search space and pruning
  • Ray fix: use device id for multi-gpu tuning to allocate tensors and kernels. Otherwise, the all the Ray workers were getting mapped to the single gpu for tuning.

- add rocm triton search space and pruning
- Ray fix: use device id for multi-gpu tuning
@divakar-amd divakar-amd force-pushed the fix_upstrm_moe_tuning_script branch from 65aeb9f to c12c376 Compare October 30, 2024 17:52
@divakar-amd divakar-amd self-assigned this Oct 30, 2024
@divakar-amd divakar-amd requested a review from gshtras November 13, 2024 14:16
@divakar-amd divakar-amd changed the base branch from main to develop November 13, 2024 14:16
@divakar-amd divakar-amd merged commit 3afc735 into develop Nov 13, 2024
7 of 8 checks passed
@gshtras gshtras deleted the fix_upstrm_moe_tuning_script branch December 7, 2024 03:22
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants