Expose ivf-flat centers to python/c by benfred · Pull Request #888 · rapidsai/cuvs

benfred · 2025-05-13T00:17:57Z

Similar to #881 - also expose centers for ivf-flat as well as ivf-pq

Similar to rapidsai#881 - also expose centers for ivf-flat as well as ivf-pq

tarang-jain · 2025-05-13T18:56:08Z

cpp/src/neighbors/ivf_flat_c.cpp

+  RAFT_EXPECTS(src.extent(0) == dst.extent(0), "Output centers has incorrect number of rows");
+  RAFT_EXPECTS(src.extent(1) == dst.extent(1), "Output centers has incorrect number of cols");
+
+  cudaMemcpyAsync(dst.data_handle(),


Can we use raft::copy or raft::copy_async in these calls instead of direct calls to cudaMemcpyAsync?

sure! I updated in the last commit to use the raft::copy from raft/util/cudart_utils.hpp .

(Fwiw, I originally tried to use the copy functions from raft/core/copy.hpp and raft/core/copy.cuh but couldn't - since we can't use the '.cuh' version inside the C-api, and the .hpp version was complaining about needing a cuda kernel for the D2H copy iirc, which is why I was using cudaMemCpyAsync directly here).

I am seeing that nn_descent_c.cpp also uses cudaMemcpyAsync. If there is a reason we have avoided the cudart header, we can stick to cudaMemcpyAsync. My understanding was that having the cudaMemcpyAsync call means that the file would have to be compiled with nvcc anyway so we should be reusing raft functions instead.
(cc @cjnolet)

My understanding was that having the cudaMemcpyAsync call means that the file would have to be compiled with nvcc anyway

@tarang-jain cudaMemCpyXX() is part of the CUDA runtime API, so it should not require nvcc to compile. Only the lower-level device function routines will need nvcc. Otherwise, it's just linking against the pre-compiled CUDA routines in the runtime API (kind of similar to what end-users do when they use cuVS C/C++ APIs).

The reason why raft::copy ends up requiring nvcc is because there were recently some device functions added to raft to work specifically with mdspan... to be honest, I'd be in favor of separating those out from the ones that only require the runtime API for this very purpose.

I updated nn_descent_c.cpp in the last commit to use raft::copy.

the cudaMemcpyAsync code is fine (as is copy functions from raft/util/cudart_utils.hpp) - I just couldn't use this code https://github.com/rapidsai/raft/blob/c2dc3124ce3fbcb5ff2ccabd88d7f57570b6aea9/cpp/include/raft/core/copy.cuh#L57-L61 from raft .

fwiw, one nice thing about using raft::copy is that it fixes one issue that this code used to have (wasn't checking the return value from cudaMemCpyAsync , which was a stupid oversight on my part =) ).

…cuvs into python_ivf_flat_centers

cpp/src/neighbors/ivf_pq/ivf_pq_build_common.cu

mythrocks

Some trivial nitpicks. Still coming to terms with the code.

The C++ side looks good to my eye. +1 non-binding.

cpp/include/cuvs/neighbors/ivf_pq.h

mythrocks · 2025-05-19T17:54:57Z

cpp/src/neighbors/ivf_flat_c.cpp

+  if (index->dtype.code == kDLFloat && index->dtype.bits == 32) {
+    auto index_ptr =
+      reinterpret_cast<cuvs::neighbors::ivf_flat::index<float, int64_t>*>(index->addr);
+    return index_ptr->n_lists();
+  } else if (index->dtype.code == kDLFloat && index->dtype.bits == 16) {
+    auto index_ptr =
+      reinterpret_cast<cuvs::neighbors::ivf_flat::index<half, int64_t>*>(index->addr);
+    return index_ptr->n_lists();
+  } else if (index->dtype.code == kDLInt && index->dtype.bits == 8) {
+    auto index_ptr =
+      reinterpret_cast<cuvs::neighbors::ivf_flat::index<int8_t, int64_t>*>(index->addr);
+    return index_ptr->n_lists();
+  } else if (index->dtype.code == kDLUInt && index->dtype.bits == 8) {
+    auto index_ptr =
+      reinterpret_cast<cuvs::neighbors::ivf_flat::index<uint8_t, int64_t>*>(index->addr);
+    return index_ptr->n_lists();


This is a recurring pattern in the code. One wonders if libcudf's type-dispatch pattern might be of value here.

We might consider exploring at a later date.

mythrocks · 2025-05-19T18:14:17Z

cpp/src/neighbors/ivf_pq/ivf_pq_build.cuh

+                                  sizeof(float) * index.dim_ext(),
+                                  sizeof(float) * index.dim(),
+                                  index.n_lists(),
+                                  cudaMemcpyDefault,


TIL cudaMemcpyDefault. I didn't know/realize that the direction could be inferred.

mythrocks · 2025-05-19T18:33:56Z

cpp/src/neighbors/ivf_flat_c.cpp

+  auto res_ptr   = reinterpret_cast<raft::resources*>(res);
+  auto index_ptr = reinterpret_cast<cuvs::neighbors::ivf_flat::index<T, IdxT>*>(index.addr);
+  auto dst       = cuvs::core::from_dlpack<output_mdspan_type>(centers);
+  auto src       = index_ptr->centers();


Would there be value in making any of these const?

I don't have familiarity with the code yet, so I haven't grokked the semantics of raft::copy. Apologies, if this is noise.

Co-authored-by: MithunR <mythrocks@gmail.com>

cjnolet · 2025-05-27T23:01:43Z

/merge

Similar to rapidsai#881 - also expose centers for ivf-flat as well as ivf-pq Authors: - Ben Frederickson (https://github.com/benfred) - Corey J. Nolet (https://github.com/cjnolet) Approvers: - Tarang Jain (https://github.com/tarang-jain) - Corey J. Nolet (https://github.com/cjnolet) URL: rapidsai#888

Expose ivf-flat centers to python/c

a8b06e3

Similar to rapidsai#881 - also expose centers for ivf-flat as well as ivf-pq

benfred requested review from a team as code owners May 13, 2025 00:17

benfred added improvement Improves an existing functionality non-breaking Introduces a non-breaking change labels May 13, 2025

github-actions bot added cpp Python labels May 13, 2025

Use extract_centers to get ivf-pq centers

6ad385a

tarang-jain requested changes May 13, 2025

View reviewed changes

benfred added 4 commits May 13, 2025 12:19

use raft::copy

6f72740

Merge branch 'branch-25.06' into python_ivf_flat_centers

2cfbef8

use raft::copy for nn_descent_c as well

1914251

Merge branch 'python_ivf_flat_centers' of https://github.com/benfred/…

6a864c1

…cuvs into python_ivf_flat_centers

tarang-jain reviewed May 13, 2025

View reviewed changes

cpp/src/neighbors/ivf_pq/ivf_pq_build_common.cu Show resolved Hide resolved

reduce extract_centers boilerplate

7080e26

benfred self-assigned this May 13, 2025

benfred added 2 commits May 13, 2025 15:33

move extract_centers to header

ed474d1

Merge branch 'branch-25.06' into python_ivf_flat_centers

ed1670e

tarang-jain approved these changes May 14, 2025

View reviewed changes

Merge branch 'branch-25.06' into python_ivf_flat_centers

71355fe

mythrocks reviewed May 19, 2025

View reviewed changes

cjnolet and others added 3 commits May 27, 2025 15:08

Merge branch 'branch-25.06' into python_ivf_flat_centers

936b45e

Merge branch 'branch-25.06' into python_ivf_flat_centers

1e26626

Update cpp/include/cuvs/neighbors/ivf_pq.h

cce8a75

Co-authored-by: MithunR <mythrocks@gmail.com>

cjnolet approved these changes May 27, 2025

View reviewed changes

rapids-bot bot merged commit b0f45bb into rapidsai:branch-25.06 May 27, 2025
75 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Expose ivf-flat centers to python/c#888

Expose ivf-flat centers to python/c#888
rapids-bot[bot] merged 13 commits intorapidsai:branch-25.06from
benfred:python_ivf_flat_centers

benfred commented May 13, 2025

Uh oh!

tarang-jain May 13, 2025

Uh oh!

benfred May 13, 2025

Uh oh!

tarang-jain May 13, 2025

Uh oh!

cjnolet May 13, 2025 •

edited

Loading

Uh oh!

benfred May 13, 2025

Uh oh!

Uh oh!

mythrocks left a comment

Uh oh!

Uh oh!

mythrocks May 19, 2025

Uh oh!

mythrocks May 19, 2025

Uh oh!

mythrocks May 19, 2025

Uh oh!

cjnolet commented May 27, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Conversation

benfred commented May 13, 2025

Uh oh!

tarang-jain May 13, 2025

Choose a reason for hiding this comment

Uh oh!

benfred May 13, 2025

Choose a reason for hiding this comment

Uh oh!

tarang-jain May 13, 2025

Choose a reason for hiding this comment

Uh oh!

cjnolet May 13, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

benfred May 13, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

mythrocks left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

mythrocks May 19, 2025

Choose a reason for hiding this comment

Uh oh!

mythrocks May 19, 2025

Choose a reason for hiding this comment

Uh oh!

mythrocks May 19, 2025

Choose a reason for hiding this comment

Uh oh!

cjnolet commented May 27, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

cjnolet May 13, 2025 •

edited

Loading