Use macro guard CUDA functions for back compatibility in grouped_topk_kernel.cu #25346

minosfuture · 2025-09-21T16:57:08Z

Summary:
cuda::std::isfinite is not available with earlier CUDA versions. We guard it
with macros and extract a device function for is_finite.

Test Plan: build with 12.4 and 12.8

Reviewed By: houseroad

Differential Revision: D82918389

facebook-github-bot · 2025-09-21T16:57:24Z

@minosfuture has exported this pull request. If you are a Meta employee, you can view the originating diff in D82918389.

gemini-code-assist

Code Review

The pull request introduces a compatibility wrapper is_finite to handle differences in isfinite support across CUDA versions. This is a good approach to maintain backward compatibility. My feedback focuses on improving the implementation of this new function for better robustness and readability.

gemini-code-assist · 2025-09-21T16:58:36Z

csrc/moe/grouped_topk_kernels.cu

+template <typename T>
+__device__ inline bool is_finite(const T val) {
+  #if (__CUDACC_VER_MAJOR__ * 10000 + __CUDACC_VER_MINOR__ * 100 >= 120800)
+  bool res = cuda::std::isfinite(val);
+  #else
+  bool res = isfinite(cuda_cast<float, T>(val));
+  #endif
+  return res;
+}


The implementation of is_finite can be improved for robustness and conciseness.

The preprocessor check for the CUDA version can be made more robust and standard by using the __CUDACC_VER__ macro. This macro combines major, minor, and patch versions into a single integer (e.g., 120800 for 12.8.0), which makes the check cleaner and correctly handles patch versions if needed in the future. The current check only considers major and minor versions.

The function body can be simplified by directly returning the result of the isfinite calls within the #if/#else branches, removing the need for the intermediate res variable.

Here is a suggested improved version:

template <typename T> __device__ inline bool is_finite(const T val) { #if defined(__CUDACC_VER__) && __CUDACC_VER__ >= 120800 return cuda::std::isfinite(val); #else return isfinite(cuda_cast<float, T>(val)); #endif }

CUDACC_VER should be deprecated, rigth?

the training data of gemini needs some refreshing :D

facebook-github-bot · 2025-09-21T17:02:22Z

@minosfuture has exported this pull request. If you are a Meta employee, you can view the originating diff in D82918389.

…_kernel.cu (vllm-project#25346) Summary: cuda::std::isfinite is not available with earlier CUDA versions. We guard it with macros and extract a device function for is_finite. Test Plan: build with 12.4 and 12.8 Reviewed By: houseroad Differential Revision: D82918389 Privacy Context Container: L1370295 Signed-off-by: Ming Yang <[email protected]>

…ject#25250) Signed-off-by: Rahul Tuli <[email protected]> Co-authored-by: Claude <[email protected]>

Signed-off-by: Ming Yang <[email protected]>

yewentao256

LGTM, thanks for the work!

…_kernel.cu (vllm-project#25346) Signed-off-by: Ming Yang <[email protected]> Signed-off-by: Rahul Tuli <[email protected]> Co-authored-by: Rahul Tuli <[email protected]> Co-authored-by: Claude <[email protected]> Co-authored-by: Wentao Ye <[email protected]> Co-authored-by: Lu Fang <[email protected]> Co-authored-by: Ye (Charlotte) Qi <[email protected]>

…_kernel.cu (#25346) Signed-off-by: Ming Yang <[email protected]> Signed-off-by: Rahul Tuli <[email protected]> Co-authored-by: Rahul Tuli <[email protected]> Co-authored-by: Claude <[email protected]> Co-authored-by: Wentao Ye <[email protected]> Co-authored-by: Lu Fang <[email protected]> Co-authored-by: Ye (Charlotte) Qi <[email protected]> Signed-off-by: yewentao256 <[email protected]>

…_kernel.cu (vllm-project#25346) Signed-off-by: Ming Yang <[email protected]> Signed-off-by: Rahul Tuli <[email protected]> Co-authored-by: Rahul Tuli <[email protected]> Co-authored-by: Claude <[email protected]> Co-authored-by: Wentao Ye <[email protected]> Co-authored-by: Lu Fang <[email protected]> Co-authored-by: Ye (Charlotte) Qi <[email protected]> Signed-off-by: gaojc <[email protected]>

…_kernel.cu (vllm-project#25346) Signed-off-by: Ming Yang <[email protected]> Signed-off-by: Rahul Tuli <[email protected]> Co-authored-by: Rahul Tuli <[email protected]> Co-authored-by: Claude <[email protected]> Co-authored-by: Wentao Ye <[email protected]> Co-authored-by: Lu Fang <[email protected]> Co-authored-by: Ye (Charlotte) Qi <[email protected]> Signed-off-by: xuebwang-amd <[email protected]>

…_kernel.cu (vllm-project#25346) Signed-off-by: Ming Yang <[email protected]> Signed-off-by: Rahul Tuli <[email protected]> Co-authored-by: Rahul Tuli <[email protected]> Co-authored-by: Claude <[email protected]> Co-authored-by: Wentao Ye <[email protected]> Co-authored-by: Lu Fang <[email protected]> Co-authored-by: Ye (Charlotte) Qi <[email protected]>

…_kernel.cu (vllm-project#25346) Signed-off-by: Ming Yang <[email protected]> Signed-off-by: Rahul Tuli <[email protected]> Co-authored-by: Rahul Tuli <[email protected]> Co-authored-by: Claude <[email protected]> Co-authored-by: Wentao Ye <[email protected]> Co-authored-by: Lu Fang <[email protected]> Co-authored-by: Ye (Charlotte) Qi <[email protected]> Signed-off-by: xuebwang-amd <[email protected]>

…_kernel.cu (vllm-project#25346) Signed-off-by: Ming Yang <[email protected]> Signed-off-by: Rahul Tuli <[email protected]> Co-authored-by: Rahul Tuli <[email protected]> Co-authored-by: Claude <[email protected]> Co-authored-by: Wentao Ye <[email protected]> Co-authored-by: Lu Fang <[email protected]> Co-authored-by: Ye (Charlotte) Qi <[email protected]>

houseroad added ready ONLY add when PR is ready to merge/full CI is needed ci/build labels Sep 21, 2025

gemini-code-assist bot reviewed Sep 21, 2025

View reviewed changes

minosfuture force-pushed the export-D82918389 branch from da22c1c to c68fe6f Compare September 21, 2025 17:02

minosfuture and others added 3 commits September 21, 2025 10:26

feat: Enable engine-level arguments with speculators models (vllm-pro…

626782f

…ject#25250) Signed-off-by: Rahul Tuli <[email protected]> Co-authored-by: Claude <[email protected]>

fix clang-format

ccabe13

Signed-off-by: Ming Yang <[email protected]>

minosfuture force-pushed the export-D82918389 branch from 3b5ebfd to ccabe13 Compare September 21, 2025 17:26

minosfuture requested review from ProExpertProg, WoosukKwon, hmellor, houseroad, mgoin, robertgshaw2-redhat, simon-mo, tlrmchlsmth, yewentao256 and youkaichao as code owners September 21, 2025 17:26

Merge remote-tracking branch 'origin/main' into export-D82918389

4e1824d

houseroad approved these changes Sep 21, 2025

View reviewed changes

22quinn enabled auto-merge (squash) September 22, 2025 06:32

yewentao256 approved these changes Sep 22, 2025

View reviewed changes

yewentao256 and others added 4 commits September 22, 2025 10:23

Merge branch 'main' into export-D82918389

cba77bf

Merge branch 'main' into export-D82918389

0fc51af

Merge branch 'main' into export-D82918389

bc05ae3

Merge branch 'main' into export-D82918389

ba98b59

zhuohan123 disabled auto-merge September 23, 2025 16:45

zhuohan123 merged commit 527821d into vllm-project:main Sep 23, 2025
76 of 78 checks passed

zhuohan123 mentioned this pull request Sep 23, 2025

[Kernel] Better inf handling for grouped topk cu #24886

Merged

5 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Use macro guard CUDA functions for back compatibility in grouped_topk_kernel.cu #25346

Use macro guard CUDA functions for back compatibility in grouped_topk_kernel.cu #25346

Uh oh!

minosfuture commented Sep 21, 2025

Uh oh!

facebook-github-bot commented Sep 21, 2025

Uh oh!

gemini-code-assist bot left a comment

Uh oh!

gemini-code-assist bot Sep 21, 2025

Uh oh!

houseroad Sep 21, 2025

Uh oh!

minosfuture Sep 22, 2025

Uh oh!

facebook-github-bot commented Sep 21, 2025

Uh oh!

yewentao256 left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

7 participants

Uh oh!

Use macro guard CUDA functions for back compatibility in grouped_topk_kernel.cu #25346

Use macro guard CUDA functions for back compatibility in grouped_topk_kernel.cu #25346

Uh oh!

Conversation

minosfuture commented Sep 21, 2025

Uh oh!

facebook-github-bot commented Sep 21, 2025

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

gemini-code-assist bot Sep 21, 2025

Choose a reason for hiding this comment

Uh oh!

houseroad Sep 21, 2025

Choose a reason for hiding this comment

Uh oh!

minosfuture Sep 22, 2025

Choose a reason for hiding this comment

Uh oh!

facebook-github-bot commented Sep 21, 2025

Uh oh!

yewentao256 left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

7 participants