Skip to content

Commit 33a4498

Browse files
zhewenldevpatelio
authored andcommitted
[CI/Build] Update checking logic in cutlass_group_gemm_supported (vllm-project#27948)
Signed-off-by: zhewenli <[email protected]>
1 parent ff4f8a8 commit 33a4498

File tree

1 file changed

+5
-1
lines changed

1 file changed

+5
-1
lines changed

vllm/_custom_ops.py

Lines changed: 5 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -836,7 +836,11 @@ def cutlass_sparse_scaled_mm_supported(cuda_device_capability: int) -> bool:
836836

837837

838838
def cutlass_group_gemm_supported(cuda_device_capability: int) -> bool:
839-
return torch.ops._C.cutlass_group_gemm_supported(cuda_device_capability)
839+
try:
840+
return torch.ops._C.cutlass_group_gemm_supported(cuda_device_capability)
841+
except AttributeError:
842+
# Return False on non-CUDA platforms where it is not available
843+
return False
840844

841845

842846
def cutlass_sparse_compress(a: torch.Tensor) -> tuple[torch.Tensor, torch.Tensor]:

0 commit comments

Comments
 (0)