Fix Compilation Issue for RTX 5090 GPUs with Compute Capability = 120 #6131

oteroantoniogom · 2025-03-06T20:07:28Z

This PR addresses a compilation issue when targeting RTX 5090 GPUs with compute capability 120. Previously, the function getMMAVersionSafe would trigger an assertion failure for compute capabilities beyond 110. This update ensures that GPUs with compute capability of 120 fall under a valid MMA version category, preventing unnecessary assertion failures.

Changes
Updated the compute capability check in getMMAVersionSafe to handle GPUs with compute capability up to 129.
Assigned MMA version {2} for these cases to maintain compatibility with customer's gpus only supporting version 2.

Motivation
Certain GPUs, including models from the NVIDIA 50 series, have a compute capability of 120, which was previously unhandled, causing compilation failures. This fix ensures compatibility with such GPUs.

New contributor declaration

I am not making a trivial change, such as fixing a typo in a comment.
[x ] I have written a PR description following these
rules.
I have run pre-commit run --from-ref origin/main --to-ref HEAD.
Select one of the following.
- I have added tests.
  - /test for lit tests
  - /unittest for C++ tests
  - /python/test for end-to-end tests
- This PR does not need a test because FILL THIS IN.
Select one of the following.
- I have not added any lit tests.
- The lit tests I have added follow these best practices,
  including the "tests should be minimal" section. (Usually running Python code
  and using the instructions it generates is not minimal.)

…pports only version 2

Jokeren · 2025-03-06T20:39:33Z

The 110 branch shouldn't be deleted. If you can update the PR I'll close #6120

…ble values

oteroantoniogom · 2025-03-06T20:43:34Z

The 110 branch shouldn't be deleted. If you can update the PR I'll close #6120

Done

…triton-lang#6131) This PR addresses a compilation issue when targeting RTX 5090 GPUs with compute capability 120. Previously, the function getMMAVersionSafe would trigger an assertion failure for compute capabilities beyond 110. This update ensures that GPUs with compute capability of 120 fall under a valid MMA version category, preventing unnecessary assertion failures. Changes Updated the compute capability check in getMMAVersionSafe to handle GPUs with compute capability up to 129. Assigned MMA version {2} for these cases to maintain compatibility with customer's gpus only supporting version 2. Motivation Certain GPUs, including models from the NVIDIA 50 series, have a compute capability of 120, which was previously unhandled, causing compilation failures. This fix ensures compatibility with such GPUs.  # New contributor declaration - [x] I am not making a trivial change, such as fixing a typo in a comment. - [x ] I have written a PR description following these [rules](https://cbea.ms/git-commit/#why-not-how). - [ ] I have run `pre-commit run --from-ref origin/main --to-ref HEAD`. - Select one of the following. - [ ] I have added tests. - `/test` for `lit` tests - `/unittest` for C++ tests - `/python/test` for end-to-end tests - [x] This PR does not need a test because `FILL THIS IN`. - Select one of the following. - [x] I have not added any `lit` tests. - [ ] The `lit` tests I have added follow these [best practices](https://mlir.llvm.org/getting_started/TestingGuide/#filecheck-best-practices), including the "tests should be minimal" section. (Usually running Python code and using the instructions it generates is not minimal.)

…ute Capability = 120 (#6131) (#6771) This PR addresses a compilation issue when targeting RTX 5090 GPUs with compute capability 120. Previously, the function getMMAVersionSafe would trigger an assertion failure for compute capabilities beyond 110. This update ensures that GPUs with compute capability of 120 fall under a valid MMA version category, preventing unnecessary assertion failures. Changes Updated the compute capability check in getMMAVersionSafe to handle GPUs with compute capability up to 129. Assigned MMA version {2} for these cases to maintain compatibility with customer's gpus only supporting version 2. Motivation Certain GPUs, including models from the NVIDIA 50 series, have a compute capability of 120, which was previously unhandled, causing compilation failures. This fix ensures compatibility with such GPUs.  # New contributor declaration - [x] I am not making a trivial change, such as fixing a typo in a comment. - [x ] I have written a PR description following these [rules](https://cbea.ms/git-commit/#why-not-how). - [ ] I have run `pre-commit run --from-ref origin/main --to-ref HEAD`. - Select one of the following. - [ ] I have added tests. - `/test` for `lit` tests - `/unittest` for C++ tests - `/python/test` for end-to-end tests - [x] This PR does not need a test because `FILL THIS IN`. - Select one of the following. - [x] I have not added any `lit` tests. - [ ] The `lit` tests I have added follow these [best practices](https://mlir.llvm.org/getting_started/TestingGuide/#filecheck-best-practices), including the "tests should be minimal" section. (Usually running Python code and using the instructions it generates is not minimal.)  # New contributor declaration - [ ] I am not making a trivial change, such as fixing a typo in a comment. - [ ] I have written a PR description following these [rules](https://cbea.ms/git-commit/#why-not-how). - [ ] I have run `pre-commit run --from-ref origin/main --to-ref HEAD`. - Select one of the following. - [ ] I have added tests. - `/test` for `lit` tests - `/unittest` for C++ tests - `/python/test` for end-to-end tests - [ ] This PR does not need a test because `FILL THIS IN`. - Select one of the following. - [ ] I have not added any `lit` tests. - [ ] The `lit` tests I have added follow these [best practices](https://mlir.llvm.org/getting_started/TestingGuide/#filecheck-best-practices), including the "tests should be minimal" section. (Usually running Python code and using the instructions it generates is not minimal.) Co-authored-by: Antonio Gómez <[email protected]>

fix:added support for sm_120 and adapted to customer nvidia, which su…

ed48270

…pports only version 2

oteroantoniogom requested a review from ptillet as a code owner March 6, 2025 20:07

fix:added 110 because deleting it is wrong as it is part of the possi…

18b7e65

…ble values

Jokeren approved these changes Mar 6, 2025

View reviewed changes

Jokeren mentioned this pull request Mar 6, 2025

Support NVIDIA 50 series GPU #6120

Closed

7 tasks

Jokeren approved these changes Mar 6, 2025

View reviewed changes

Jokeren merged commit f3bd7f7 into triton-lang:main Mar 6, 2025
8 checks passed

woct0rdho mentioned this pull request Mar 15, 2025

Failed to find Python libs woct0rdho/triton-windows#83

Closed

ExtReMLapin mentioned this pull request Apr 10, 2025

Consumer Blackwell GPUs not supported in 3.3 release #6447

Closed

atalman mentioned this pull request May 13, 2025

[v.3.3.1] Release Tracker #6805

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Fix Compilation Issue for RTX 5090 GPUs with Compute Capability = 120 #6131

Fix Compilation Issue for RTX 5090 GPUs with Compute Capability = 120 #6131

Uh oh!

oteroantoniogom commented Mar 6, 2025 •

edited

Loading

Uh oh!

Jokeren commented Mar 6, 2025

Uh oh!

oteroantoniogom commented Mar 6, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Fix Compilation Issue for RTX 5090 GPUs with Compute Capability = 120 #6131

Fix Compilation Issue for RTX 5090 GPUs with Compute Capability = 120 #6131

Uh oh!

Conversation

oteroantoniogom commented Mar 6, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

New contributor declaration

Uh oh!

Jokeren commented Mar 6, 2025

Uh oh!

oteroantoniogom commented Mar 6, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

oteroantoniogom commented Mar 6, 2025 •

edited

Loading