Skip to content
Closed
Changes from all commits
Commits
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
3 changes: 2 additions & 1 deletion lib/Dialect/TritonGPU/Transforms/AccelerateMatmul.cpp
Original file line number Diff line number Diff line change
Expand Up @@ -28,13 +28,14 @@ namespace {
static int getMMAVersionSafe(int computeCapability, DotOp op) {
// List supported mma version in order of preference.
SmallVector<int> versionsSupported;
op.emitRemark() << "Your computeCapability is " << computeCapability;
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

please remove this to avoid noise.

if (computeCapability < 75) {
versionsSupported = {1};
} else if (computeCapability < 90) {
versionsSupported = {2};
} else if (computeCapability < 100) {
versionsSupported = {3, 2};
} else if (computeCapability < 110) {
} else if (computeCapability < 130) {
versionsSupported = {5, 2};
Copy link
Collaborator

@masahi masahi Mar 6, 2025

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Consumer cards only support Ampere-style mma. So only the version 2 is supported.

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

If you still get an error go to AccelerateMatmul.cpp and just delete the 5 and leave it like:

  } else if (computeCapability < 130) {
    versionsSupported = {2};

As @masahi pointed, consumer RTX 5090 only supports version 2.

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Deleting the 110 branch is wrong. Need to add a 130 branch with only version {2}.

} else {
assert(false && "computeCapability not supported");
Expand Down