-
Notifications
You must be signed in to change notification settings - Fork 499
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
PR #22593: [XLA:GPU] Fix triton dot op on sm120 (RTX50xx)
Imported from GitHub PR #22593 Triton doesn't currently support sm120 GPUs - adding a patch to fix that, the upstream support should be available soon. Converting "12.0" arch into "10.0" is not correct, as they're not compatible - removing this. Copybara import of the project: -- 5b5752d by Sergey Kozub <skozub@nvidia.com>: [XLA:GPU] Fix triton dot op on sm120 (RTX50xx) Merging this change closes #22593 FUTURE_COPYBARA_INTEGRATE_REVIEW=#22593 from openxla:skozub/sm120_dot 5b5752d PiperOrigin-RevId: 725968626
- Loading branch information
1 parent
6b470af
commit 1854cb5
Showing
3 changed files
with
14 additions
and
5 deletions.
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,13 @@ | ||
diff --git a/lib/Dialect/TritonGPU/Transforms/AccelerateMatmul.cpp b/lib/Dialect/TritonGPU/Transforms/AccelerateMatmul.cpp | ||
index c66c9f4ae..3415d6a91 100644 | ||
--- a/lib/Dialect/TritonGPU/Transforms/AccelerateMatmul.cpp | ||
+++ b/lib/Dialect/TritonGPU/Transforms/AccelerateMatmul.cpp | ||
@@ -33,6 +33,8 @@ static int getMMAVersionSafe(int computeCapability, DotOp op) { | ||
versionsSupported = {3, 2}; | ||
} else if (computeCapability < 110) { | ||
versionsSupported = {5, 2}; | ||
+ } else if (computeCapability == 120) { | ||
+ versionsSupported = {2}; | ||
} else { | ||
assert(false && "computeCapability not supported"); | ||
} |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters