Skip to content

Support setting kernel block cluster dimensions #484

@eyalroz

Description

@eyalroz

With the Hopper architecture, NVIDIA has introduced "clusters" of blocks which can use each other's shared memory. The clustering can be set either using a __cluster_dims__(1,2,3) qualifier in the kernel's signature, or at run-time. We need to support the run-time setting within our launch_configuration_t class and in the launch config builder mechanism.

Metadata

Metadata

Assignees

No one assigned

    Labels

    Projects

    No projects

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions