Finish 2 pass candidate search #1778

gunney1 · 2026-01-21T04:11:35Z

Summary

This PR is a refactoring
It does the following:
- Change the two computeClipVolumes3D methods to call the workhorse computeClipVolumes3DTet, which implements the two-pass candidate search (see PR Use 2-pass candidate search for mesh clipping #1756). This is facilitated by two temporary index lists that computeClipVolumes3DTet require.
- Add a few comments and error checks that didn't make it into previous PRs in this series.

This PR, along with #1756, resolves issue #1704.

Performance data

Test problem

I collected clipping times using a 100^3 cubic computational mesh. Each test geometry takes between 6.1% and 9.7% of the domain. I added a "cupmesh", which is a tetrahedral mesh representation of a cup, from the data/quest/cup.proe file. This image is for illustration only. The actual shapes are bigger and centered on the mesh.

Timers

The benchmarks were run on 1 rank, using OpenMP on rzwhippet and HIP on rzadams. Here are the times spent clipping each geometry, the median of 5 repetitions. The "sl" column headings below indicate the level of screening used to avoid unneeded call to the clipping function.

sl0: no screening
sl1: screen out cells using geometry-specific algorithms
sl2: decompose each remaining cell from sl1 into 6 non-degenerate tets and screen the tets.
sl3: sl2 plus try to determine in/out relationship between 2 primitives before computing their overlap volumes.

On rzwhippet:

                                      sl0    sl1    sl2    sl3
    clipping:cone.0                 2.012  1.386  1.249  1.258
    clipping:cupmesh.0             18.410  2.335  0.927  0.806
    clipping:cyl.0                  2.177  0.518  0.453  0.452
    clipping:hex.0                 11.436  0.917  0.291  0.238
    clipping:sor.0                  5.088  1.112  0.982  0.969
    clipping:tet.0                  9.182  0.357  0.115  0.114
    clipping:tetmesh.0             16.058  1.148  0.413  0.357

On rzadams:

                                     sl0    sl1    sl2    sl3
    clipping:cone.0                0.658  0.278  0.258  0.259
    clipping:cupmesh.0             0.811  0.284  0.281  0.306
    clipping:cyl.0                 0.662  0.193  0.207  0.208
    clipping:hex.0                 0.669  0.176  0.201  0.231
    clipping:sor.0                 0.971  0.221  0.236  0.235
    clipping:tet.0                 0.646  0.170  0.164  0.163
    clipping:tetmesh.0             0.696  0.245  0.246  0.246

Remarks about the performance times

The largest gain comes from sl1, the simplest screening. The higher screening levels build on this one.
The other screening levels had small but non-trivial gains on rzwhippet. On rzadams, the gain was negligible and even lost a little in one case. I think it's likely that with a denser computational mesh, this will different. As always, keep in mind that performance is highly configuration-dependent.
The gain from sl1 to sl2 was better before we switched from a 24-tet decomposition to a 18-tet decomposition, which was really a 6-tet decomposition on a cartensian mesh. This makes sense because the larger tets are more likely to intersect the geometry's boundary.
The detailed timers show that the time spent screening is little to negligible.
All screen levels benefit from the 2-pass candidate search. This is not a comparison of the 2-pass vs the single-pass. The original implementation in IntersectionShaper used single-pass and was significantly slower.

computeClipVolumes3DTets, using temporary mock-up of the tets. This way, the computeClipVolumes3D methods gets the 2-pass candidate search, while removing a lot of code. The cose of the mock-up array is negligible.

…2-pass-candidate-search

gunney1 added 5 commits January 16, 2026 16:15

Don't copy mesh to host unless writing to disk.

831cb43

Remove unused code.

484782a

The less-screened computeClipVolumes3D functions call the

2d3133a

computeClipVolumes3DTets, using temporary mock-up of the tets. This way, the computeClipVolumes3D methods gets the 2-pass candidate search, while removing a lot of code. The cose of the mock-up array is negligible.

Silence warnings in opt build.

fef0943

Merge remote-tracking branch 'gh/develop' into feature/gunney/finish-…

9755f06

…2-pass-candidate-search

gunney1 requested review from Arlie-Capps, BradWhitlock, bmhan12, kennyweiss, publixsubfan and rhornung67 January 21, 2026 04:11

gunney1 self-assigned this Jan 21, 2026

gunney1 added Quest Issues related to Axom's 'quest' component Performance Issues related to code performance labels Jan 21, 2026

gunney1 added 2 commits January 20, 2026 20:46

Comment changes that didn't make it into the earlier mesh clipper PRs.

bafb87b

Error checking and annotation name uniformity.

6dd9748

gunney1 force-pushed the feature/gunney/finish-2-pass-candidate-search branch from 8536584 to 6dd9748 Compare January 21, 2026 05:16

gunney1 marked this pull request as ready for review January 21, 2026 06:42

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Finish 2 pass candidate search #1778

Finish 2 pass candidate search #1778

Uh oh!

gunney1 commented Jan 21, 2026 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Finish 2 pass candidate search #1778

Are you sure you want to change the base?

Finish 2 pass candidate search #1778

Uh oh!

Conversation

gunney1 commented Jan 21, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Performance data

Test problem

Timers

Remarks about the performance times

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

gunney1 commented Jan 21, 2026 •

edited

Loading