understory_box_tree: speed up hit testing #99

waywardmonkeys · 2026-01-13T05:00:00Z

Hit testing is a hot path and used to pay avoidable per-query overhead.

Avoid allocation: use IndexGeneric::visit_point instead of query_point so hit testing does not allocate a Vec of candidates each call.
Avoid repeated work: reuse cached world_transform_inverse and depth from Tree::commit while scoring candidates.
Preserve behavior: still filters precisely by local_bounds, local_clip, and ancestor clips (after the coarse AABB query), and keeps the existing z/depth/newer tie-break.

This improves hit testing across all index backends because the allocation/inverse/depth work was backend-agnostic overhead.

Benchmark (synthetic ui_box_tree): hit_test_point/flatvec improved about 22% on my machine (~61us -> ~47us per iteration). Command:
cargo bench -p understory_benches --bench ui_box_tree -- ui_box_tree/hit_test_point/flatvec --noplot

Hit testing is a hot path and used to pay avoidable per-query overhead. - Avoid allocation: use `IndexGeneric::visit_point` instead of `query_point` so hit testing does not allocate a `Vec` of candidates each call. - Avoid repeated work: reuse cached `world_transform_inverse` and `depth` from `Tree::commit` while scoring candidates. - Preserve behavior: still filters precisely by `local_bounds`, `local_clip`, and ancestor clips (after the coarse AABB query), and keeps the existing z/depth/newer tie-break. This improves hit testing across all index backends because the allocation/inverse/depth work was backend-agnostic overhead. Benchmark (synthetic `ui_box_tree`): `hit_test_point/flatvec` improved about 22% on my machine (~61us -> ~47us per iteration). Command: `cargo bench -p understory_benches --bench ui_box_tree -- ui_box_tree/hit_test_point/flatvec --noplot`

tomcur

Nice speed up! The timings you mentioned reproduce on my machine as well.

I've left some nits inline.

For some workloads we can go further; especially if visit_point yields many nodes from the same subtree for a given point, the repeated tree walking could comparatively get very expensive (as we're traversing the same ancestors over and over towards the root). Fixing that requires either allocating or some mutable scratch context.

tomcur · 2026-01-13T09:53:08Z

understory_box_tree/src/tree.rs


-            if let Some(local_clip) = node.local.local_clip
-                && !local_clip.contains(local_point)
+            // Transform once: most candidates fail at the bounds check, so avoid repeated work.


This comment doesn't look entirely accurate to me: in the current box tree model, the local bounds are a node's real geometry, and the contains check is the first necessary condition for a hit by finely testing whether the point in local coordinates is within those bounds. The spatial index does that coarsely and it's probably accurate to say the comment as written applies more to the spatial index.

Perhaps instead:

Suggested change

// Transform once: most candidates fail at the bounds check, so avoid repeated work.

// Finely test whether `point` is within the node's bounds and the node's own clip.

tomcur · 2026-01-13T10:05:27Z

understory_box_tree/src/tree.rs

-                    if z > *z_best
-                        || (z == *z_best
+                None => best = Some((id, z, depth)),
+                Some((best_id, z_best, depth_best)) => {


Suggested change

Some((best_id, z_best, depth_best)) => {

Some((id_best, z_best, depth_best)) => {

or

Suggested change

Some((best_id, z_best, depth_best)) => {

Some((best_id, best_z, best_depth)) => {

tomcur · 2026-01-13T10:08:36Z

understory_box_tree/src/tree.rs

+            // Walk ancestors checking their clips for precise hit filtering.
+            //
+            // This is intentionally only done for candidates that pass the local bounds/clip
+            // checks, since ancestor traversal is comparatively expensive.


This is better than the original comment in this place, except that we may want to keep mentioning explicitly this walks towards the root (just so it's immediately clear which direction this is walking in).

Suggested change

// Walk ancestors checking their clips for precise hit filtering.

//

// This is intentionally only done for candidates that pass the local bounds/clip

// checks, since ancestor traversal is comparatively expensive.

// Walk ancestors towards the node's root checking their clips for precise hit filtering.

//

// This is intentionally only done for candidates that pass the local bounds/clip

// checks, since ancestor traversal is comparatively expensive.

waywardmonkeys requested review from jrmoulton and tomcur January 13, 2026 05:00

waywardmonkeys mentioned this pull request Jan 13, 2026

understory_box_tree: make no-op commit fast #100

Open

tomcur approved these changes Jan 13, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

understory_box_tree: speed up hit testing #99

understory_box_tree: speed up hit testing #99

Uh oh!

waywardmonkeys commented Jan 13, 2026

Uh oh!

tomcur left a comment

Uh oh!

tomcur Jan 13, 2026

Uh oh!

tomcur Jan 13, 2026

Uh oh!

tomcur Jan 13, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

	// Transform once: most candidates fail at the bounds check, so avoid repeated work.
	// Finely test whether `point` is within the node's bounds and the node's own clip.

	Some((best_id, z_best, depth_best)) => {
	Some((id_best, z_best, depth_best)) => {

	Some((best_id, z_best, depth_best)) => {
	Some((best_id, best_z, best_depth)) => {

understory_box_tree: speed up hit testing #99

Are you sure you want to change the base?

understory_box_tree: speed up hit testing #99

Uh oh!

Conversation

waywardmonkeys commented Jan 13, 2026

Uh oh!

tomcur left a comment

Choose a reason for hiding this comment

Uh oh!

tomcur Jan 13, 2026

Choose a reason for hiding this comment

Uh oh!

tomcur Jan 13, 2026

Choose a reason for hiding this comment

Uh oh!

tomcur Jan 13, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants