ScaNN: Overlapped gather for AVQ#1286
Conversation
6d3aa3c to
e223eb7
Compare
e223eb7 to
c225698
Compare
tfeher
left a comment
There was a problem hiding this comment.
Thanks @rmaschal for the PR, looks good overall, I just have one suggestion for a cosmetic change.
In general, had to think about why do we need a specific version of gather here. Although we have h2d gather in RAFT (admittedly not properly exposed), here an explicit prefetch method enables the user to schedule additional GPU operations in parallel. That looks good to me.
tfeher
left a comment
There was a problem hiding this comment.
Thanks for the update, LGTM!
|
PR looks good to me, thanks for the clarification on my questions. |
bkarsin
left a comment
There was a problem hiding this comment.
PR looks good to me, thanks (and very sorry for the mistaken close/open)
|
/merge |
03d62f6
into
rapidsai:branch-25.10
|
Hi @bkarsin, |
Adds a class cluster_loader for AVQ that enables overlapping the gather operation and HtoD copy with GPU computation.
There are two scenarios: