SNL sparta #1033

rfhaque · 2025-09-05T20:38:41Z

This PR adds the Sparta application (docs: Adding a Benchmark)

Not using develop, source code: Sparta build for AMD GPUs sparta/sparta#573
Adding repo/sparta-snl/package.py to specify build
Adding repo/sparta-snl/application.py to specify runtime parameters
Adding experiments/sparta-snl/experiment.py to specify the experiment to run
Sparta dry run is passing
Add Sparta+openmp and Sparta+rocm to gitlab CI on Dane and Tuo or Tioga -- see Make Alternate Workload Nightly Tests for Lammps and AMG #1111
Update the package to meet Spack 1.0 requirements @scheibelp

…periment_sparta

pearce8

Sparta dry run should be passing
Add Sparta+openmp and Sparta+rocm to CI on Dane and Tuo or Tioga
Since this will likely go in after the Spack 1.0 support, please update the package to meet Spack 1.0 requirements.

scheibelp

(this looks compatible with #953)

.gitlab/tests/shared_flux_clusters.yml

.gitlab/tests/shared_slurm_clusters.yml

pearce8 · 2025-09-24T19:15:31Z

.gitlab/tests/shared_flux_clusters.yml

+        BENCHMARK: [amg2023, kripke, laghos, raja-perf]
        VARIANT: [+rocm]
        GPUMODE: SPX
+      # sparta needs specific variant


@michaelmckinsey1 What do you mean by "Sparta needs a different version of rocm"?

That's not what this is saying. This benchmark can not be defined the same as the other rocm benchmarks because it requires a specific variant when running with rocm, i.e. fft_kokkos=hipfft. That is why I was thinking this is bad practice to encode this variant in the experiment. I was thinking this is something you would set in the package.py, like depends_on("hipfft", when="rocm"), but after discussing with @scheibelp and @rfhaque yesterday I'm not sure it's so simple.

@scheibelp we need help with this

pearce8 · 2025-09-26T21:27:43Z

experiments/sparta-snl/experiment.py

+        if self.spec.satisfies("+openmp"):
+            kokkos_mode += "t {n_threads_per_proc}"
+        if self.spec.satisfies("+rocm") or self.spec.satisfies("+cuda"):
+            kokkos_mode += " g {n_gpus}"


g should probably be 1 -- or n_gpus/m_gpus_per_node

michaelmckinsey1 · 2025-10-06T19:03:15Z

.github/utils/dryruns.py

+    "sparta-snl+openmp aws-pcluster instance_type=c6g.xlarge",
+    "sparta-snl+openmp aws-pcluster instance_type=c4.xlarge",
+    "sparta-snl+openmp generic-x86",


FYI This is to fix dryrun error where this particular experiment is generated asking for too many cores for these particular systems.

.gitlab/tests/shared_flux_clusters.yml

pearce8 · 2025-10-20T17:19:04Z

@michaelmckinsey1 do you know why lint is failing?

michaelmckinsey1 · 2025-10-20T17:34:04Z

It says [+] Would fix .gitlab/tests/shared_flux_clusters.yml

michaelmckinsey1 · 2025-10-21T20:53:57Z

@pearce8 That lint failure requires yamlfix .gitlab/tests/shared_flux_clusters.yml. I have fixed it. I think #1116 will address when to run certain linter commands in the future

codecov-commenter · 2025-10-21T21:40:32Z

Codecov Report

✅ All modified and coverable lines are covered by tests.
✅ Project coverage is 65.44%. Comparing base (d5e099c) to head (a36561a).

Additional details and impacted files

@@             Coverage Diff             @@
##           develop    #1033      +/-   ##
===========================================
+ Coverage    65.31%   65.44%   +0.12%     
===========================================
  Files           44       44              
  Lines         3241     3241              
  Branches       256      256              
===========================================
+ Hits          2117     2121       +4     
+ Misses        1117     1113       -4     
  Partials         7        7

see 1 file with indirect coverage changes

🚀 New features to boost your workflow:

❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.

sparta rocm implementation

080361b

github-actions bot added experiment New or modified experiment application labels Sep 5, 2025

rfhaque requested a review from pearce8 September 5, 2025 20:38

rfhaque mentioned this pull request Sep 6, 2025

Sparta build for AMD GPUs sparta/sparta#573

Merged

4 tasks

Riyaz Haque and others added 2 commits September 5, 2025 22:06

Fix sed

6394b34

Merge branch 'develop' into experiment_sparta

fb17aa3

pearce8 marked this pull request as ready for review September 6, 2025 15:07

Riyaz Haque and others added 7 commits September 6, 2025 11:43

CUDA build

63f9d3f

Merge branch 'experiment_sparta' of github.com:LLNL/benchpark into ex…

f90fd73

…periment_sparta

Merge branch 'develop' into experiment_sparta

a38e6fb

Use the main source repository

c097b4f

lint

aece333

Remove scaling import

34bb898

Merge branch 'develop' into experiment_sparta

3142a52

pearce8 added the changes requested Changes requested label Sep 10, 2025

pearce8 requested changes Sep 10, 2025

View reviewed changes

pearce8 assigned rfhaque and scheibelp Sep 10, 2025

pearce8 mentioned this pull request Sep 10, 2025

Spack packages needing updates/testing after Spack 1.0 support #1048

Open

6 tasks

scheibelp reviewed Sep 10, 2025

View reviewed changes

Riyaz Haque and others added 5 commits September 18, 2025 14:28

Merge remote-tracking branch 'origin/develop' into experiment_sparta

602e327

Merge remote-tracking branch 'origin/develop' into experiment_sparta

179d606

spack changes

278df38

Merge remote-tracking branch 'origin/develop' into experiment_sparta

6723834

Update shared_flux_clusters.yml

37df216

michaelmckinsey1 reviewed Sep 23, 2025

View reviewed changes

.gitlab/tests/shared_flux_clusters.yml Outdated Show resolved Hide resolved

Update shared_slurm_clusters.yml

54a5872

michaelmckinsey1 reviewed Sep 23, 2025

View reviewed changes

.gitlab/tests/shared_slurm_clusters.yml Show resolved Hide resolved

sparta needs special variant on rocm

cfd4675

pearce8 reviewed Sep 24, 2025

View reviewed changes

pearce8 reviewed Sep 26, 2025

View reviewed changes

Merge remote-tracking branch 'origin/develop' into experiment_sparta

e30132f

rfhaque mentioned this pull request Oct 6, 2025

Sparta status #1019

Open

21 tasks

Riyaz Haque and others added 2 commits October 6, 2025 11:44

Merge remote-tracking branch 'origin/develop' into experiment_sparta

d94b5c4

Update dryruns.py

e5e2a32

michaelmckinsey1 reviewed Oct 6, 2025

View reviewed changes

Update shared_flux_clusters.yml

b556163

github-actions bot added the ci CI, unit tests, GitHub actions label Oct 6, 2025

michaelmckinsey1 reviewed Oct 6, 2025

View reviewed changes

.gitlab/tests/shared_flux_clusters.yml Outdated Show resolved Hide resolved

Riyaz Haque added 3 commits October 6, 2025 20:31

Merge remote-tracking branch 'origin/develop' into experiment_sparta

3e32a98

Add variant for GPU Aware MPI

d4249d5

Add dimension variables

b711f1d

pearce8 previously approved these changes Oct 10, 2025

View reviewed changes

Merge with develop

ebb00b5

rfhaque dismissed pearce8’s stale review via ebb00b5 October 17, 2025 21:06

Change openmp num threads to 1

d90fbe0

rfhaque added ready for review Ready for review and removed changes requested Changes requested labels Oct 17, 2025

Merge remote-tracking branch 'origin/develop' into experiment_sparta

6cd4f06

pearce8 and others added 3 commits October 21, 2025 12:33

lint

7d39d63

Merge branch 'develop' into experiment_sparta

8e067a1

lint

a36561a

pearce8 approved these changes Oct 22, 2025

View reviewed changes

pearce8 merged commit 58e4c2b into develop Oct 22, 2025
58 checks passed

pearce8 deleted the experiment_sparta branch October 22, 2025 18:29

SNL sparta #1033

SNL sparta #1033

Uh oh!

Conversation

rfhaque commented Sep 5, 2025 • edited by pearce8 Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

This PR adds the Sparta application (docs: Adding a Benchmark)

Uh oh!

pearce8 left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

scheibelp left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

pearce8 Sep 24, 2025

Choose a reason for hiding this comment

Uh oh!

michaelmckinsey1 Sep 24, 2025

Choose a reason for hiding this comment

Uh oh!

pearce8 Sep 26, 2025

Choose a reason for hiding this comment

Uh oh!

pearce8 Sep 26, 2025

Choose a reason for hiding this comment

Uh oh!

michaelmckinsey1 Oct 6, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

pearce8 commented Oct 20, 2025

Uh oh!

michaelmckinsey1 commented Oct 20, 2025

Uh oh!

michaelmckinsey1 commented Oct 21, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

codecov-commenter commented Oct 21, 2025

Codecov Report

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants

rfhaque commented Sep 5, 2025 •

edited by pearce8

Loading

pearce8 left a comment •

edited

Loading

michaelmckinsey1 commented Oct 21, 2025 •

edited

Loading