Ray population callback by Waqar-ukaea · Pull Request #13 · Waqar-ukaea/xdg

Waqar-ukaea · 2026-01-19T16:33:08Z

Exploring the idea of exposing ray buffers populating them directly rather than array based versions of the existing API.

I think the array based versions are useful for small test cases but they inherently require host-to-device transfers which does severely limit their actual usability. Any production code on a GPU will never actually want to use these code paths since they end up being very expensive in device transfers.

I was already moving towards an API which exposes ray buffers directly as that is what I had implemented inside of the ray-benchmark miniapp. But I think I have landed on something which is a lot nicer for doing so here.

I'm trying to make use of a callback method within XDG that exposes the ray buffer directly so a downstream code can provide that callback to populate the buffer. Right now I've only tested this with my miniapp writing its own GPRT callback but it does work.

So the downstream app runs something like:

Call `xdg::populate_external_rays(callback, num_rays)
- XDG allocates device memory for rays (if not already large enough)
- XDG passes device pointers to the callback
- User's callback populates the buffers using their preferred compute kernel/shader
- User's callback returns (XDG assumes buffers are now populated)
Call XDG::ray_fire_prepared() to trace the populated rays

I still need to write an equivalent point_in_volume_prepared()

This avoids the unnecessary host-device transfers that the array versions of ray_fire() and point_in_volume() suffer from by allowing users to write directly to XDG's device buffers without any host-side transfers. I don't actually know if the callback approach would play nicely with OpenMP though, we'll have to see...

…al XDG buffers

…fic logic

…n call

…tests

…path

Waqar-ukaea added 9 commits January 19, 2026 15:49

Attempting to implement a callback based method for populating intern…

553e0b9

…al XDG buffers

Removed the now redundant pack_external_rays() code path

f0f7389

Made DeviceRayHitBuffers more opaque to abstract away from GPRT speci…

ce80cdd

…fic logic

Updated some comments

201d069

Renamed ray_fire_packed() to ray_fire_prepared()

7c25fda

Added the required CMake linking to GPRT for ray_benchmark miniapp

2d78b02

Added the ability to trace against multiple volumes within same rayge…

b920f9c

…n call

Abstracted some methods from ray_benchmark into functions for use in …

055eca8

…tests

Added test for filling ray buffers directly + ray_fire_prepared code …

c6c71fa

…path

Waqar-ukaea merged commit e758e69 into batch-query-api Jan 28, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Ray population callback#13

Ray population callback#13
Waqar-ukaea merged 9 commits intobatch-query-apifrom
ray-population-callback

Waqar-ukaea commented Jan 19, 2026 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

Waqar-ukaea commented Jan 19, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Waqar-ukaea commented Jan 19, 2026 •

edited

Loading