-
Notifications
You must be signed in to change notification settings - Fork 13.5k
[mlir][ROCDL] Remove unneeded bf16 expansion in LowerGPUToROCDL #139603
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
The umbrella pass fol lowering GPU ops to ROCDL (aka lowering to LLVM + the AMDGPU-specific setup) would call the arith patterns that manually implemented extf and truncf on bfloat because the LLVM AMDGPU backend used to not suppport those operaitons. Since the backend does now support these operations and has for quite some time, remove these patterns from the default lowering flow.
@llvm/pr-subscribers-mlir Author: Krzysztof Drewniak (krzysz00) ChangesThe umbrella pass fol lowering GPU ops to ROCDL (aka lowering to LLVM
Since the backend does now support these operations and has for quite some time, remove these patterns from the default lowering flow. Full diff: https://github.com/llvm/llvm-project/pull/139603.diff 1 Files Affected:
diff --git a/mlir/lib/Conversion/GPUToROCDL/LowerGpuOpsToROCDLOps.cpp b/mlir/lib/Conversion/GPUToROCDL/LowerGpuOpsToROCDLOps.cpp
index dd16ec4b73e9f..c52bf505de4a5 100644
--- a/mlir/lib/Conversion/GPUToROCDL/LowerGpuOpsToROCDLOps.cpp
+++ b/mlir/lib/Conversion/GPUToROCDL/LowerGpuOpsToROCDLOps.cpp
@@ -319,7 +319,6 @@ struct LowerGpuOpsToROCDLOpsPass final
{
RewritePatternSet patterns(ctx);
populateGpuRewritePatterns(patterns);
- arith::populateExpandBFloat16Patterns(patterns);
(void)applyPatternsGreedily(m, std::move(patterns));
}
|
@llvm/pr-subscribers-mlir-gpu Author: Krzysztof Drewniak (krzysz00) ChangesThe umbrella pass fol lowering GPU ops to ROCDL (aka lowering to LLVM
Since the backend does now support these operations and has for quite some time, remove these patterns from the default lowering flow. Full diff: https://github.com/llvm/llvm-project/pull/139603.diff 1 Files Affected:
diff --git a/mlir/lib/Conversion/GPUToROCDL/LowerGpuOpsToROCDLOps.cpp b/mlir/lib/Conversion/GPUToROCDL/LowerGpuOpsToROCDLOps.cpp
index dd16ec4b73e9f..c52bf505de4a5 100644
--- a/mlir/lib/Conversion/GPUToROCDL/LowerGpuOpsToROCDLOps.cpp
+++ b/mlir/lib/Conversion/GPUToROCDL/LowerGpuOpsToROCDLOps.cpp
@@ -319,7 +319,6 @@ struct LowerGpuOpsToROCDLOpsPass final
{
RewritePatternSet patterns(ctx);
populateGpuRewritePatterns(patterns);
- arith::populateExpandBFloat16Patterns(patterns);
(void)applyPatternsGreedily(m, std::move(patterns));
}
|
@@ -319,7 +319,6 @@ struct LowerGpuOpsToROCDLOpsPass final | |||
{ | |||
RewritePatternSet patterns(ctx); | |||
populateGpuRewritePatterns(patterns); | |||
arith::populateExpandBFloat16Patterns(patterns); |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Looks good to me, but looks like i don't have "approval" access
The umbrella pass fol lowering GPU ops to ROCDL (aka lowering to LLVM
Since the backend does now support these operations and has for quite some time, remove these patterns from the default lowering flow.