[3.3.1 cherry pick] Fix Compilation Issue for RTX 5090 GPUs with Compute Capability = 120 (#6131) #6771

davidberard98 · 2025-05-09T16:01:08Z

This PR addresses a compilation issue when targeting RTX 5090 GPUs with compute capability 120. Previously, the function getMMAVersionSafe would trigger an assertion failure for compute capabilities beyond 110. This update ensures that GPUs with compute capability of 120 fall under a valid MMA version category, preventing unnecessary assertion failures.

Changes
Updated the compute capability check in getMMAVersionSafe to handle GPUs with compute capability up to 129.
Assigned MMA version {2} for these cases to maintain compatibility with customer's gpus only supporting version 2.

Motivation
Certain GPUs, including models from the NVIDIA 50 series, have a compute capability of 120, which was previously unhandled, causing compilation failures. This fix ensures compatibility with such GPUs.

New contributor declaration

I am not making a trivial change, such as fixing a typo in a comment.
[x ] I have written a PR description following these rules.
I have run pre-commit run --from-ref origin/main --to-ref HEAD.
Select one of the following.
- I have added tests.
  - /test for lit tests
  - /unittest for C++ tests
  - /python/test for end-to-end tests
- This PR does not need a test because FILL THIS IN.
Select one of the following.
- I have not added any lit tests.
The lit tests I have added follow these best practices, including the "tests should be minimal" section. (Usually running Python code
and using the instructions it generates is not minimal.)

New contributor declaration

I am not making a trivial change, such as fixing a typo in a comment.
I have written a PR description following these
rules.
I have run pre-commit run --from-ref origin/main --to-ref HEAD.
Select one of the following.
- I have added tests.
  - /test for lit tests
  - /unittest for C++ tests
  - /python/test for end-to-end tests
- This PR does not need a test because FILL THIS IN.
Select one of the following.
- I have not added any lit tests.
- The lit tests I have added follow these best practices,
  including the "tests should be minimal" section. (Usually running Python code
  and using the instructions it generates is not minimal.)

…triton-lang#6131) This PR addresses a compilation issue when targeting RTX 5090 GPUs with compute capability 120. Previously, the function getMMAVersionSafe would trigger an assertion failure for compute capabilities beyond 110. This update ensures that GPUs with compute capability of 120 fall under a valid MMA version category, preventing unnecessary assertion failures. Changes Updated the compute capability check in getMMAVersionSafe to handle GPUs with compute capability up to 129. Assigned MMA version {2} for these cases to maintain compatibility with customer's gpus only supporting version 2. Motivation Certain GPUs, including models from the NVIDIA 50 series, have a compute capability of 120, which was previously unhandled, causing compilation failures. This fix ensures compatibility with such GPUs.  # New contributor declaration - [x] I am not making a trivial change, such as fixing a typo in a comment. - [x ] I have written a PR description following these [rules](https://cbea.ms/git-commit/#why-not-how). - [ ] I have run `pre-commit run --from-ref origin/main --to-ref HEAD`. - Select one of the following. - [ ] I have added tests. - `/test` for `lit` tests - `/unittest` for C++ tests - `/python/test` for end-to-end tests - [x] This PR does not need a test because `FILL THIS IN`. - Select one of the following. - [x] I have not added any `lit` tests. - [ ] The `lit` tests I have added follow these [best practices](https://mlir.llvm.org/getting_started/TestingGuide/#filecheck-best-practices), including the "tests should be minimal" section. (Usually running Python code and using the instructions it generates is not minimal.)

davidberard98 · 2025-05-09T16:01:18Z

cc @atalman

atalman

lgtm

Triton is pointing to latest triton pin : https://github.com/triton-lang/triton/tree/release/3.3.x XPU pointing to latest XPU pin: https://github.com/intel/intel-xpu-backend-for-triton/commits/release/3.3.x/ This version contains the fix for: Compilation Issue for RTX 5090 GPUs with Compute Capability = 120. triton-lang/triton#6771 Pull Request resolved: #153951 Approved by: https://github.com/davidberard98

davidberard98 marked this pull request as ready for review May 9, 2025 16:01

davidberard98 requested a review from ptillet as a code owner May 9, 2025 16:01

atalman mentioned this pull request May 13, 2025

[v.3.3.1] Release Tracker #6805

Closed

atalman approved these changes May 13, 2025

View reviewed changes

Jokeren approved these changes May 13, 2025

View reviewed changes

atalman merged commit b79de50 into triton-lang:release/3.3.x May 13, 2025

woct0rdho mentioned this pull request May 19, 2025

Assertion `false && "computeCapability not supported"' failed. #6859

Closed

atalman mentioned this pull request May 20, 2025

Bump triton pin for the release 3.3.1 of triton pytorch/pytorch#153951

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[3.3.1 cherry pick] Fix Compilation Issue for RTX 5090 GPUs with Compute Capability = 120 (#6131) #6771

[3.3.1 cherry pick] Fix Compilation Issue for RTX 5090 GPUs with Compute Capability = 120 (#6131) #6771

Uh oh!

davidberard98 commented May 9, 2025

Uh oh!

davidberard98 commented May 9, 2025

Uh oh!

atalman left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

[3.3.1 cherry pick] Fix Compilation Issue for RTX 5090 GPUs with Compute Capability = 120 (#6131) #6771

[3.3.1 cherry pick] Fix Compilation Issue for RTX 5090 GPUs with Compute Capability = 120 (#6131) #6771

Uh oh!

Conversation

davidberard98 commented May 9, 2025

New contributor declaration

New contributor declaration

Uh oh!

davidberard98 commented May 9, 2025

Uh oh!

atalman left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants