Skip to content

Fix to compile FIL infer_kernel with CUDA arch 1210#6821

Merged
raydouglass merged 1 commit intorapidsai:branch-25.06from
viclafargue:fil-infer-gb10-fix
May 30, 2025
Merged

Fix to compile FIL infer_kernel with CUDA arch 1210#6821
raydouglass merged 1 commit intorapidsai:branch-25.06from
viclafargue:fil-infer-gb10-fix

Conversation

@viclafargue
Copy link
Copy Markdown
Contributor

@viclafargue viclafargue commented May 30, 2025

infer_kernel uses the value of 2048 as a fallback for MAX_THREADS_PER_SM for all unspecified GPU architectures. This PR adds sm121 to the list of architectures whose MAX_THREADS_PER_SM value is 1024.

@viclafargue viclafargue requested a review from a team as a code owner May 30, 2025 10:30
@viclafargue viclafargue requested review from dantegd and lowener May 30, 2025 10:30
@viclafargue viclafargue requested a review from hcho3 May 30, 2025 10:30
@viclafargue viclafargue changed the base branch from branch-25.08 to branch-25.06 May 30, 2025 13:59
@csadorf csadorf added the ! - Hotfix Hotfix is a bug that affects the majority of users for which there is no reasonable workaround label May 30, 2025
@csadorf csadorf requested a review from divyegala May 30, 2025 18:09
@divyegala divyegala added bug Something isn't working non-breaking Non-breaking change labels May 30, 2025
@raydouglass raydouglass merged commit 1c2ba13 into rapidsai:branch-25.06 May 30, 2025
108 of 109 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

! - Hotfix Hotfix is a bug that affects the majority of users for which there is no reasonable workaround bug Something isn't working CUDA/C++ non-breaking Non-breaking change

Projects

None yet

Development

Successfully merging this pull request may close these issues.

5 participants