BitwiseHamming distance for NN Descent by jinsolp · Pull Request #1101 · rapidsai/cuvs

jinsolp · 2025-07-10T19:11:33Z

Adding bitwise hamming distance for NN Descent

…d-hamming

jinsolp · 2025-07-10T19:15:32Z

-    int num_load_elems = (step == raft::ceildiv(data_dim, TILE_COL_WIDTH) - 1)
-                           ? data_dim - step * TILE_COL_WIDTH
-                           : TILE_COL_WIDTH;
+  if (metric != cuvs::distance::DistanceType::BitwiseHamming) {


The only diff here is wrapping the wmma operation part with if (metric != cuvs::distance::DistanceType::BitwiseHamming) so that we don't perform unnecessary computations for BitwiseHamming. (the diff seems to detect all additional indents as changes which makes it confusing)

divyegala

We should be able to allocate just half the fp16 matrix for bitwise hamming, right?

jinsolp · 2025-07-10T19:24:38Z

+      res,
+      nrow_,
+      build_config.metric == cuvs::distance::DistanceType::BitwiseHamming
+        ? (build_config.dataset_dim + 1) / 2
+        : build_config.dataset_dim)},


Yep that's what we do here!

tarang-jain · 2025-07-16T15:40:24Z

+        const uint8_t* data_n1 = reinterpret_cast<const uint8_t*>(data) + n1 * data_dim;
+        const uint8_t* data_n2 = reinterpret_cast<const uint8_t*>(data) + n2 * data_dim;
+        for (int d = 0; d < data_dim; d++) {
+          s_distances[i] += __popc(static_cast<uint32_t>(data_n1[d] ^ data_n2[d]) & 0xff);


wherever you are doing this, have you ensured that the dim that you pass along is correctly scaled? If you were to convert a half to two uint8s, the dim would have to be doubled.
Furthermore, I haven't looked into the nn descent logic entirely, but in case you are operating in the half space, I dont think you'd have to reinterpret_cast everywhere to uint8. Its more efficient to stay in the half space and do something like:
__popc(static_cast<uint32_t>(data_n1[d] ^ data_n2[d]) & 0xffffu)

wherever you are doing this, have you ensured that the dim that you pass along is correctly scaled? If you were to convert a half to two uint8s, the dim would have to be doubled.

I am allocating dim/2 dimensions for the half type array. To make things straightforward, the dim is always used as the dimension of the "given dataset" (fp32, int8 etc). And dimensions for allocating the fp16 type device array is configured based on the data type (dim/2 for int8 and uint8, as-is for other types)

I dont think you'd have to reinterpret_cast everywhere to uint8.

the issue with keeping the pointer fp16 and doing this for data_dim/2 dimensions results in having to check if the last byte is a valid value or not inside the kernel (because the original int8 data could have an odd number of dimensions). I thought it would be easier to cast to int8 to loop over the original data_dim instead : )

I see. Yes there can be an odd number of dims, for which we can fall back to int8, but if its not an odd number of dims, we can do it in the half space. I'd argue that we can do even more -- if it is divisible by 4, reinterpret_cast to uint32_t so you'd have to popcount only over one fourth the dim (I'm doing the same thing with BitwiseHamming in ivf-flat). However, considering the deadlines, we can look into these optimizations later. Can we create a github issue for this and write it as a TODO here?
Regarding the dims, I just wanted to verify if the dims being used everywhere need to be scaled, but it looks like you have already checked those things, so apart from the creation of that github issue this PR looks okay to me.

Makes sense, added an issue here #1127

divyegala · 2025-07-17T23:32:32Z

/merge

jinsolp added 5 commits July 10, 2025 17:08

uint8 nnd bitwisehamming

a81b370

tests and warnings

4250f32

Merge branch 'rapidsai:branch-25.08' into nnd-hamming

b4cc11e

cleanup headers

b8e44ed

Merge branch 'nnd-hamming' of https://github.com/jinsolp/cuvs into nn…

b19a9dc

…d-hamming

jinsolp self-assigned this Jul 10, 2025

jinsolp requested a review from a team as a code owner July 10, 2025 19:11

jinsolp added feature request New feature or request non-breaking Introduces a non-breaking change labels Jul 10, 2025

github-project-automation Bot added this to Unstructured Data Processing Jul 10, 2025

Merge branch 'branch-25.08' into nnd-hamming

fa662ab

github-project-automation Bot moved this to Todo in Unstructured Data Processing Jul 10, 2025

github-actions Bot added the cpp label Jul 10, 2025

jinsolp moved this from Todo to In Progress in Unstructured Data Processing Jul 10, 2025

jinsolp added 2 commits July 10, 2025 19:12

typo

c2aa027

Merge branch 'nnd-hamming' of https://github.com/jinsolp/cuvs into nn…

e2eb7c3

…d-hamming

jinsolp commented Jul 10, 2025

View reviewed changes

divyegala reviewed Jul 10, 2025

View reviewed changes

jinsolp commented Jul 10, 2025

View reviewed changes

divyegala approved these changes Jul 10, 2025

View reviewed changes

jinsolp added 6 commits July 11, 2025 13:23

Merge branch 'branch-25.08' into nnd-hamming

4c56110

fix

a372adf

Merge branch 'branch-25.08' into nnd-hamming

a58e2ae

Merge branch 'branch-25.08' into nnd-hamming

982b139

Merge branch 'branch-25.08' into nnd-hamming

9ff3cd8

Merge branch 'branch-25.08' into nnd-hamming

e593ba7

tarang-jain reviewed Jul 16, 2025

View reviewed changes

jinsolp added 2 commits July 16, 2025 16:02

Merge branch 'branch-25.08' into nnd-hamming

2d1d17c

todo and issue

6c6d1bd

tarang-jain approved these changes Jul 17, 2025

View reviewed changes

Merge branch 'branch-25.08' into nnd-hamming

883ac1a

jinsolp added 2 commits July 17, 2025 20:35

Merge branch 'branch-25.08' into nnd-hamming

f294140

Merge branch 'branch-25.08' into nnd-hamming

67add6a

rapids-bot Bot merged commit 722b7e6 into rapidsai:branch-25.08 Jul 18, 2025
231 of 249 checks passed

github-project-automation Bot moved this from In Progress to Done in Unstructured Data Processing Jul 18, 2025

jinsolp deleted the nnd-hamming branch July 18, 2025 23:07

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

BitwiseHamming distance for NN Descent#1101

BitwiseHamming distance for NN Descent#1101
rapids-bot[bot] merged 19 commits intorapidsai:branch-25.08from
jinsolp:nnd-hamming

jinsolp commented Jul 10, 2025

Uh oh!

jinsolp Jul 10, 2025 •

edited

Loading

Uh oh!

divyegala left a comment

Uh oh!

jinsolp Jul 10, 2025

Uh oh!

tarang-jain Jul 16, 2025

Uh oh!

jinsolp Jul 16, 2025 •

edited

Loading

Uh oh!

tarang-jain Jul 16, 2025 •

edited

Loading

Uh oh!

jinsolp Jul 16, 2025

Uh oh!

divyegala commented Jul 17, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

jinsolp commented Jul 10, 2025

Uh oh!

jinsolp Jul 10, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

divyegala left a comment

Choose a reason for hiding this comment

Uh oh!

jinsolp Jul 10, 2025

Choose a reason for hiding this comment

Uh oh!

tarang-jain Jul 16, 2025

Choose a reason for hiding this comment

Uh oh!

jinsolp Jul 16, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

tarang-jain Jul 16, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

jinsolp Jul 16, 2025

Choose a reason for hiding this comment

Uh oh!

divyegala commented Jul 17, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

jinsolp Jul 10, 2025 •

edited

Loading

jinsolp Jul 16, 2025 •

edited

Loading

tarang-jain Jul 16, 2025 •

edited

Loading