Skip to content

Conversation

@JiamingMai
Copy link

This PR fixes the issue that it failed to compile with NVIDIA 5090. As the computeCapability of some 50 series GPU is 120 which is greater than 110, this PR updates the condition to avoid exception.

New contributor declaration

  • I am not making a trivial change, such as fixing a typo in a comment.

  • I have written a PR description following these
    rules.

  • I have run pre-commit run --from-ref origin/main --to-ref HEAD.

  • Select one of the following.

    • I have added tests.
      • /test for lit tests
      • /unittest for C++ tests
      • /python/test for end-to-end tests
    • This PR does not need a test because FILL THIS IN.
  • Select one of the following.

    • I have not added any lit tests.
    • The lit tests I have added follow these best practices,
      including the "tests should be minimal" section. (Usually running Python code
      and using the instructions it generates is not minimal.)

@JiamingMai JiamingMai requested a review from ptillet as a code owner March 6, 2025 07:10
versionsSupported = {3, 2};
} else if (computeCapability < 110) {
} else if (computeCapability < 130) {
versionsSupported = {5, 2};
Copy link
Collaborator

@masahi masahi Mar 6, 2025

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Consumer cards only support Ampere-style mma. So only the version 2 is supported.

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

If you still get an error go to AccelerateMatmul.cpp and just delete the 5 and leave it like:

  } else if (computeCapability < 130) {
    versionsSupported = {2};

As @masahi pointed, consumer RTX 5090 only supports version 2.

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Deleting the 110 branch is wrong. Need to add a 130 branch with only version {2}.

static int getMMAVersionSafe(int computeCapability, DotOp op) {
// List supported mma version in order of preference.
SmallVector<int> versionsSupported;
op.emitRemark() << "Your computeCapability is " << computeCapability;
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

please remove this to avoid noise.

@Jokeren
Copy link
Contributor

Jokeren commented Mar 6, 2025

Close as it has been addressed in #6131 (review)

@Jokeren Jokeren closed this Mar 6, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

5 participants