-
Notifications
You must be signed in to change notification settings - Fork 2.4k
Support NVIDIA 50 series GPU #6120
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Changes from all commits
File filter
Filter by extension
Conversations
Jump to
Diff view
Diff view
There are no files selected for viewing
| Original file line number | Diff line number | Diff line change |
|---|---|---|
|
|
@@ -28,13 +28,14 @@ namespace { | |
| static int getMMAVersionSafe(int computeCapability, DotOp op) { | ||
| // List supported mma version in order of preference. | ||
| SmallVector<int> versionsSupported; | ||
| op.emitRemark() << "Your computeCapability is " << computeCapability; | ||
| if (computeCapability < 75) { | ||
| versionsSupported = {1}; | ||
| } else if (computeCapability < 90) { | ||
| versionsSupported = {2}; | ||
| } else if (computeCapability < 100) { | ||
| versionsSupported = {3, 2}; | ||
| } else if (computeCapability < 110) { | ||
| } else if (computeCapability < 130) { | ||
| versionsSupported = {5, 2}; | ||
|
Collaborator
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. Consumer cards only support Ampere-style mma. So only the version 2 is supported.
Contributor
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. If you still get an error go to AccelerateMatmul.cpp and just delete the 5 and leave it like: As @masahi pointed, consumer RTX 5090 only supports version 2.
Contributor
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. Deleting the 110 branch is wrong. Need to add a 130 branch with only version {2}. |
||
| } else { | ||
| assert(false && "computeCapability not supported"); | ||
|
|
||
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
please remove this to avoid noise.