Fix fp16 overflow by hhy3 · Pull Request #1084 · rapidsai/cuvs

hhy3 · 2025-07-04T05:51:08Z

This PR fixes issue #914 that accumulation using fp16 causes overflow

Signed-off-by: zh Wang <rekind133@outlook.com>

copy-pr-bot · 2025-07-04T05:51:11Z

This pull request requires additional validation before any workflows can run on NVIDIA's runners.

Pull request vetters can view their responsibilities here.

Contributors can view more details about this message here.

achirkin

Thank you for the contribution. I see the PR changes the accumulation type from fp16 to fp32 to avoid fp overflow. However I'm not sure if this is desirable in general and whether the speed drop is acceptable for the cases when the overflow doesn't happen.
Maybe we'd better just advise the user to switch to fp32 variant of the algorithm?
Please support the PR with the benchmark results (using cuvs ann-bench) before and after the PR for fp16 and fp32 if you decide to proceed with this approach.

achirkin · 2025-07-04T06:35:07Z

cpp/src/neighbors/detail/ann_utils.cuh

 template <>
 struct config<half> {
-  using value_t                    = half;
+  using value_t                    = float;


Please check whether this is used outside the IVF-Flat. Changing the accumulation type like this can have a drastic impact on performance.

I did simple benchmark and it showed no significant differences. I'll use cuvs ann-bench to get a more detailed benchmark results later

@hhy3 any updates here? We're about to begin burndown for 25.08 release. Should we consider this for 25.08 or push to 25.10 (October)?

@cjnolet hi, push it to 25.10, thx

Fix fp16 overflow

babf7bd

Signed-off-by: zh Wang <rekind133@outlook.com>

hhy3 requested a review from a team as a code owner July 4, 2025 05:51

github-actions bot added the cpp label Jul 4, 2025

achirkin requested changes Jul 4, 2025

View reviewed changes

cjnolet assigned hhy3 Jul 11, 2025

cjnolet added bug Something isn't working non-breaking Introduces a non-breaking change labels Jul 11, 2025

cjnolet added this to Vector Search, ML, & Data Mining Release Board Jul 11, 2025

github-project-automation bot moved this to Todo in Vector Search, ML, & Data Mining Release Board Jul 11, 2025

cjnolet moved this from Todo to In Progress in Vector Search, ML, & Data Mining Release Board Jul 11, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix fp16 overflow#1084

Fix fp16 overflow#1084
hhy3 wants to merge 1 commit intorapidsai:branch-25.08from
hhy3:fix_fp16_overflow

hhy3 commented Jul 4, 2025

Uh oh!

copy-pr-bot bot commented Jul 4, 2025

Uh oh!

achirkin left a comment

Uh oh!

achirkin Jul 4, 2025

Uh oh!

hhy3 Jul 4, 2025

Uh oh!

cjnolet Jul 22, 2025

Uh oh!

hhy3 Jul 23, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

hhy3 commented Jul 4, 2025

Uh oh!

copy-pr-bot bot commented Jul 4, 2025

Uh oh!

achirkin left a comment

Choose a reason for hiding this comment

Uh oh!

achirkin Jul 4, 2025

Choose a reason for hiding this comment

Uh oh!

hhy3 Jul 4, 2025

Choose a reason for hiding this comment

Uh oh!

cjnolet Jul 22, 2025

Choose a reason for hiding this comment

Uh oh!

hhy3 Jul 23, 2025

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants