ANN_BENCH: CAGRA-HNSW build in managed memory by achirkin · Pull Request #1058 · rapidsai/cuvs

achirkin · 2025-06-27T12:35:01Z

This PR replaces the standard configured_raft_resources handle to a customized handle for CAGRA-HNSW benchmark.
This resource handle uses a single managed memory resource for all: RMM default memory resource, RAFT workspace resource, RAFT large workspace resource. For raft workspace resource, a pool is used as usual to speedup frequent allocations.

The rationale behind this change is to allow using all available GPU memory through all stages of CAGRA build.
Before this change, by default, we have a regular device memory pool for everything except the large allocations; the large_memory_resource uses the managed memory. The problem with this behavior is that this pool grows during internal IVF-PQ build/search (the whole IVF-PQ index is stored in there), but doesn't shrink back during the graph optimization stage. As a result, the large allocations during the optimization stage severely oversubscribe UVM and degrade performance to a complete halt.
With the new change, the RMM default memory resource is not a member of the pool. Hence the pool stays relatively small (limited be the workspace resource adapter). And even the small pool that is left can be paged out by UVM when it's not actively in use.

copy-pr-bot · 2025-06-27T12:35:04Z

Auto-sync is disabled for draft pull requests in this repository. Workflows must be run manually.

Contributors can view more details about this message here.

achirkin · 2025-06-27T12:35:15Z

/ok to test

achirkin · 2025-07-07T14:16:15Z

/ok to test

achirkin · 2025-07-11T09:46:09Z

/ok to test

Update cuvs_cagra_hnswlib_wrapper.h

c60aec4

achirkin self-assigned this Jun 27, 2025

achirkin added this to Vector Search, ML, & Data Mining Release Board Jun 27, 2025

achirkin added improvement Improves an existing functionality non-breaking Introduces a non-breaking change labels Jun 27, 2025

github-actions bot added the cpp label Jun 27, 2025

Merge branch 'branch-25.08' into achirkin-cagra-hnsw-managed

78cb278

cjnolet and others added 2 commits July 8, 2025 18:49

Merge branch 'branch-25.08' into achirkin-cagra-hnsw-managed

0ae26f0

Merge branch 'branch-25.08' into achirkin-cagra-hnsw-managed

a1e35bf

achirkin added 6 commits July 14, 2025 16:03

Merge branch 'branch-25.08' into achirkin-cagra-hnsw-managed

95d5ac9

Merge branch 'branch-25.08' into achirkin-cagra-hnsw-managed

d14de10

Merge branch 'branch-25.08' into achirkin-cagra-hnsw-managed

cd41a4d

Merge branch 'branch-25.08' into achirkin-cagra-hnsw-managed

f053545

Merge branch 'branch-25.08' into achirkin-cagra-hnsw-managed

37f7008

Merge branch 'branch-25.08' into achirkin-cagra-hnsw-managed

c556b74

cjnolet moved this to In Progress in Vector Search, ML, & Data Mining Release Board Jan 29, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ANN_BENCH: CAGRA-HNSW build in managed memory#1058

ANN_BENCH: CAGRA-HNSW build in managed memory#1058
achirkin wants to merge 10 commits intobranch-25.08from
achirkin-cagra-hnsw-managed

achirkin commented Jun 27, 2025

Uh oh!

copy-pr-bot bot commented Jun 27, 2025

Uh oh!

achirkin commented Jun 27, 2025

Uh oh!

achirkin commented Jul 7, 2025

Uh oh!

achirkin commented Jul 11, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

achirkin commented Jun 27, 2025

Uh oh!

copy-pr-bot bot commented Jun 27, 2025

Uh oh!

achirkin commented Jun 27, 2025

Uh oh!

achirkin commented Jul 7, 2025

Uh oh!

achirkin commented Jul 11, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants