-
Notifications
You must be signed in to change notification settings - Fork 246
Open
Labels
feature requestNew feature or requestNew feature or request
Description
Is your feature request related to a problem? Please describe.
Currently when calling rmm::device_uvector::element we always copies to pageable memory before producing a host scalar value. The use of pageable memory can introduce additional performance overhead that users may wish to avoid. This overhead could be avoided if the data could be copied into pinned memory instead of pageable memory during the cuda mempcy operation.
Describe the solution you'd like
device_uvector should allow specifying a host mr on construction in addition to a device mr. The host mr would be used to allocate memory when D2H copies are made by the internals of the device_uvector code.
Reactions are currently unavailable
Metadata
Metadata
Assignees
Labels
feature requestNew feature or requestNew feature or request
Type
Projects
Status
To-do