Implement `cudax::cufile` by davebayer · Pull Request #6122 · NVIDIA/cccl

davebayer · 2025-10-03T09:49:55Z

This PR implements wrappers of CUfileHandle_t and related APIs.

It introduces:

cudax::cufile type which handles opening/closing the native file handle and registering/deregistering of the cuFile file handle. It owns both of these resources.
cudax::cufile_ref type that is a lightweight non-owning wrapper for the cuFile file handle. It doesn't own the resources (with exception below).
cuda::cufile_driver.[de]register_native_handle(...) -> cuda::cufile_ref that can be used for manual registration of the native file handle. It has basically the same semantics as new/delete.

So, right now these are the scenarios are covered:

the user has some legacy code from where he gets the CUfileHandle_t -> he can use cuda::cufile_ref with our APIs.
the user has some code that gives him the native file handle, but he doesn't own the native file handle -> he can use cuda::cufile_driver.register_native_handle(...).
the user wants us to open the native file handle for him as well -> he can use cuda::cufile

We might introduce a third option: cuda::scoped_cufile or something similar, that would be constructible from cuda::cufile_ref and would deregister the handle when going out of scope, but I would wait with it.

copy-pr-bot · 2025-10-03T09:49:59Z

Auto-sync is disabled for draft pull requests in this repository. Workflows must be run manually.

Contributors can view more details about this message here.

cudax/include/cuda/experimental/__cufile/cufile_ref.cuh

cudax/test/cufile/driver.cu

jvera-nvidia · 2025-10-16T14:59:19Z

cudax/include/cuda/experimental/__cufile/cufile.cuh

+  //! @brief Wrapper for retrieving the open mode.
+  [[nodiscard]] static _CCCL_HOST_API cufile_open_mode __open_mode(native_handle_type __native_handle)
+  {
+    int __oflags = ::fcntl(__native_handle, F_GETFL);


do we really need to retrieve the opening mode?
I'm a bit worried on non-standard things in the bindings for when eventually cufile is windows supported.

jvera-nvidia · 2025-10-16T15:50:42Z

cudax/include/cuda/experimental/__cufile/cufile.cuh

+  _CCCL_HOST_API void open(const char* __filename, cufile_open_mode __open_mode)
+  {
+    if (is_open())
+    {
+      ::cuda::std::__throw_runtime_error("File is already opened.");
+    }
+
+    __native_handle_ = __open_file(__filename, __make_oflags(__open_mode));
+
+    try
+    {
+      __cufile_handle_ = __register_cufile_handle(__native_handle_);
+    }
+    catch (...)
+    {
+      __close_file(::cuda::std::exchange(__native_handle_, __invalid_native_handle));
+      throw;
+    }
+  }


I'm not sure we need this... its two ways of doing the same (this vs constructor) and this actually is forbidden if the constructor was already called. I'd vote to remove this one.

But for example std::fstream also implements the .open(...) method. I'd like to stay as close as what is common in the standard as possible

but we can always implement it later if needed... I don't see why this is needed; we are not trying to stay close to fstream either, are we?

That's true, but I'd like to provide tools users are used to have. Consider this:

cuda::cufile file; // ... file = cuda::cufile{filename, open_mode}; // open the file // ... file = cuda::cufile{}; // close the file

You have a situation like this, when you want to have the object in not opened state. Then, to open the file, you must create another object and move assign it to the original object. And if you want to close the file with detection whether it was closed or not, you must (again) move assign a new default constructed object.

In my opinion this is not a good design. I'd like to do:

cuda::cufile file; // ... file.open(filename, open_mode); // open the file // ... file.close(); // close the file

cudax/include/cuda/experimental/__cufile/cufile.cuh

jvera-nvidia · 2025-10-16T20:12:54Z

Fair enough, those are valid points. I was unsure we needed it but as you said the easier of use and known methods are totally valid and I think it's good to have! Enviado desde Outlook para Android<https://aka.ms/AAb9ysg>

________________________________ From: David Bayer ***@***.***> Sent: Thursday, October 16, 2025 9:15:39 PM To: NVIDIA/cccl ***@***.***> Cc: Javier Vera ***@***.***>; Comment ***@***.***> Subject: Re: [NVIDIA/cccl] Implement `cudax::cufile` (PR #6122) @davebayer commented on this pull request.

________________________________ In cudax/include/cuda/experimental/__cufile/cufile.cuh<#6122 (comment)>:

+ _CCCL_HOST_API void open(const char* __filename, cufile_open_mode __open_mode)

+ { + if (is_open()) + { + ::cuda::std::__throw_runtime_error("File is already opened."); + } + + __native_handle_ = __open_file(__filename, __make_oflags(__open_mode)); + + try + { + __cufile_handle_ = __register_cufile_handle(__native_handle_); + } + catch (...) + { + __close_file(::cuda::std::exchange(__native_handle_, __invalid_native_handle)); + throw; + } + } That's true, but I'd like to provide tools users are used to have. Consider this: cuda::cufile file; // ... file = cuda::cufile{filename, open_mode}; // open the file // ... file = cuda::cufile{}; // close the file You have a situation like this, when you want to have the object in not opened state. Then, to open the file, you must create another object and move assign it to the original object. And if you want to close the file with detection whether it was closed or not, you must (again) move assign a new default constructed object. — Reply to this email directly, view it on GitHub<#6122 (comment)>, or unsubscribe<https://github.com/notifications/unsubscribe-auth/BSUF2R623U6KWV3NPWPNYAL3X7VFXAVCNFSM6AAAAACIGCYCK2VHI2DSMVQWIX3LMV43YUDVNRWFEZLROVSXG5CSMV3GSZLXHMZTGNBWGYZDKOJZGA>. You are receiving this because you commented.Message ID: ***@***.***>

pciolkosz · 2025-10-16T23:08:39Z

cudax/test/cufile/common.h

+
+void test_check_fd_is_valid(int fd)
+{
+  CUDAX_REQUIRE(fcntl(fd, F_GETFD) == 0);


Could correct flags be non-zero here? Internet says to do:
fcntl(fd, F_GETFD) != -1 || errno != EBADF;
For this use case probably just the first part is ok

Oh yeah, you are right

davebayer · 2025-10-23T08:47:36Z

Waiting for https://github.com/nv-gha-runners/roadmap/issues/318

davebayer · 2025-11-14T09:22:20Z

Waiting for https://github.com/nv-gha-runners/roadmap/issues/318

As discussed internally, we will just build the tests for now, we will run those tests once the support is added. I tested it locally with CUDA 12.9 and 13.0 and both compiled & ran fine.

cudax/CMakeLists.txt

cudax/test/CMakeLists.txt

github-actions · 2025-11-18T07:49:24Z

🥳 CI Workflow Results

🟩 Finished in 12h 12m: Pass: 100%/261 | Total: 3d 16h | Max: 1h 41m | Hits: 99%/379106

See results here.

github-project-automation bot moved this to Todo in CCCL Oct 3, 2025

github-project-automation bot added this to CCCL Oct 3, 2025

cccl-authenticator-app bot moved this from Todo to In Progress in CCCL Oct 3, 2025

pciolkosz reviewed Oct 3, 2025

View reviewed changes

cudax/include/cuda/experimental/__cufile/cufile_ref.cuh Show resolved Hide resolved

davebayer mentioned this pull request Oct 8, 2025

[FEA]: C++ bindings for cuFile #6149

Open

6 tasks

jvera-nvidia reviewed Oct 8, 2025

View reviewed changes

cudax/test/cufile/driver.cu Show resolved Hide resolved

davebayer self-assigned this Oct 8, 2025

davebayer force-pushed the cudax_cufile_cufile branch 2 times, most recently from ff5d648 to 123f2f5 Compare October 16, 2025 13:19

davebayer marked this pull request as ready for review October 16, 2025 13:20

davebayer requested a review from a team as a code owner October 16, 2025 13:20

davebayer requested a review from pciolkosz October 16, 2025 13:20

cccl-authenticator-app bot moved this from In Progress to In Review in CCCL Oct 16, 2025

davebayer force-pushed the cudax_cufile_cufile branch 2 times, most recently from e94e43d to d61c94c Compare October 16, 2025 13:52

This comment has been minimized.

Sign in to view

jvera-nvidia reviewed Oct 16, 2025

View reviewed changes

cudax/include/cuda/experimental/__cufile/cufile.cuh Show resolved Hide resolved

This comment has been minimized.

Sign in to view

pciolkosz approved these changes Oct 17, 2025

View reviewed changes

This comment has been minimized.

Sign in to view

This was referenced Oct 17, 2025

Implement cudax::read #6283

Draft

Implement cudax::cufile_batch #6284

Draft

This comment has been minimized.

Sign in to view

davebayer force-pushed the cudax_cufile_cufile branch from acc2cf1 to f683170 Compare October 20, 2025 20:15

This comment has been minimized.

Sign in to view

Implement cudax::cufile and cudax::cufile_ref

4f0b244

davebayer force-pushed the cudax_cufile_cufile branch from f683170 to 4f0b244 Compare November 14, 2025 09:19

davebayer requested review from a team as code owners November 14, 2025 09:19

davebayer requested a review from alliepiper November 14, 2025 09:19

This comment has been minimized.

Sign in to view

alliepiper requested changes Nov 17, 2025

View reviewed changes

cudax/CMakeLists.txt Outdated Show resolved Hide resolved

cudax/test/CMakeLists.txt Outdated Show resolved Hide resolved

cudax/test/CMakeLists.txt Show resolved Hide resolved

github-project-automation bot moved this from In Review to In Progress in CCCL Nov 17, 2025

fix review

3e33528

davebayer requested a review from alliepiper November 17, 2025 19:50

alliepiper approved these changes Nov 17, 2025

View reviewed changes

github-project-automation bot moved this from In Progress to In Review in CCCL Nov 17, 2025

This comment has been minimized.

Sign in to view

davebayer enabled auto-merge (squash) November 18, 2025 07:41

davebayer merged commit 006fa90 into NVIDIA:main Nov 18, 2025
807 of 818 checks passed

github-project-automation bot moved this from In Review to Done in CCCL Nov 18, 2025

Conversation

davebayer commented Oct 3, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

copy-pr-bot bot commented Oct 3, 2025

Uh oh!

Uh oh!

Uh oh!

This comment has been minimized.

jvera-nvidia Oct 16, 2025

Choose a reason for hiding this comment

Uh oh!

jvera-nvidia Oct 16, 2025

Choose a reason for hiding this comment

Uh oh!

davebayer Oct 16, 2025

Choose a reason for hiding this comment

Uh oh!

jvera-nvidia Oct 16, 2025

Choose a reason for hiding this comment

Uh oh!

davebayer Oct 16, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

jvera-nvidia commented Oct 16, 2025 via email

Uh oh!

This comment has been minimized.

pciolkosz Oct 16, 2025

Choose a reason for hiding this comment

Uh oh!

davebayer Oct 17, 2025

Choose a reason for hiding this comment

Uh oh!

This comment has been minimized.

This comment has been minimized.

This comment has been minimized.

This comment has been minimized.

This comment has been minimized.

This comment has been minimized.

davebayer commented Oct 23, 2025

Uh oh!

davebayer commented Nov 14, 2025

Uh oh!

This comment has been minimized.

Uh oh!

Uh oh!

Uh oh!

This comment has been minimized.

This comment has been minimized.

github-actions bot commented Nov 18, 2025

🥳 CI Workflow Results

🟩 Finished in 12h 12m: Pass: 100%/261 | Total: 3d 16h | Max: 1h 41m | Hits: 99%/379106

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

davebayer commented Oct 3, 2025 •

edited

Loading

davebayer Oct 16, 2025 •

edited

Loading