Skip to content

Conversation

@gshtras
Copy link
Collaborator

@gshtras gshtras commented Jul 31, 2025

Porting over vllm-project#20330 and vllm-project#21205

Commits included:

Using cuda_compat to defint the WARP_SIZE once

On ROCm toe constant compile time warp size can not be used on the host side that can be shared for multiple architectures with different values
Formatting
Refactor to use cuda_compat, and not the unhippified version
Leaving CUDA side as just a simple define

Commits included:

Using cuda_compat to defint the WARP_SIZE once

Signed-off-by: Gregory Shtrasberg <[email protected]>

On ROCm toe constant compile time warp size can not be used on the host side that can be shared for multiple architectures with different values

Signed-off-by: Gregory Shtrasberg <[email protected]>

Formatting

Signed-off-by: Gregory Shtrasberg <[email protected]>

Refactor to use cuda_compat, and not the unhippified version

Signed-off-by: Gregory Shtrasberg <[email protected]>

Leaving CUDA side as just a simple define

Signed-off-by: Gregory Shtrasberg <[email protected]>
@gshtras gshtras merged commit 5485f8c into ROCm-7.0 Jul 31, 2025
2 of 5 checks passed
@gshtras gshtras deleted the warp_size_fix_rocm-7.0 branch July 31, 2025 15:47
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants