[MLIR][XeGPU] Add unroll patterns for XeGPU (1/N) #137010

chencha3 · 2025-04-23T16:11:17Z

Similar to vector ops, XeGPU ops need to be unrolled into smaller shapes such that they can be dispatched into a hardware instruction. This PR marks the initial phase of a series dedicated to incorporating unroll patterns for XeGPU operations. In this installment, we introduce patterns for the following operations:

createNd
updateNd
prefetchNd
loadNd
storeNd
dpas

github-actions · 2025-04-23T16:13:41Z

✅ With the latest revision this PR passed the C/C++ code formatter.

Garra1980 · 2025-04-30T18:59:22Z

This PR marks the initial phase of a series dedicated to incorporating unroll patterns for XeGPU operations.

Can you please add some justification/explanation regarding those unroll patterns

fschlimb · 2025-05-07T15:25:49Z

mlir/lib/Dialect/XeGPU/Transforms/XeGPUUnroll.cpp

+    for (int64_t i = 0; i < mIters; i++) {
+      for (int64_t j = 0; j < nIters; j++) {
+        Value tmpC;
+        if (c)
+          tmpC = cVals[i * nIters + j]; // init with acc
+        for (int64_t k = 0; k < kIters; k++) {


LLVM coding standards want preincrements (++i): https://llvm.org/docs/CodingStandards.html#prefer-preincrement

Thanks! Fixed.

fschlimb · 2025-05-07T15:31:14Z

mlir/lib/Dialect/XeGPU/Transforms/XeGPUUnroll.cpp

+    auto targetShape = *maybeTargetShape;
+
+    auto convertedTdescTypes = getUnrolledTypes(tdescTy, targetShape);
+    auto convertedTdesc =


Use explicit type instead of auto: https://llvm.org/docs/CodingStandards.html#use-auto-type-deduction-to-make-code-more-readable

Many places in this code might be more readable with explicit types.

Thanks. Fixed.

fschlimb · 2025-05-07T15:35:56Z

mlir/lib/Dialect/XeGPU/Transforms/XeGPUUnroll.cpp

+    if (any_of(ranges, [](auto &v) { return v.size() == 0; }) ||
+        all_of(ranges, [](auto &v) { return v.size() == 1; })) {
+      return failure();
+    }


No {} for single statement if-body: https://llvm.org/docs/CodingStandards.html#don-t-use-braces-on-simple-single-statement-bodies-of-if-else-loop-statements

Thanks! Fixed.

fschlimb · 2025-05-07T15:40:03Z

mlir/test/lib/Dialect/XeGPU/TestXeGPUTransforms.cpp

+            }
+          }
+
+          if (isa<xegpu::DpasOp>(op)) {


thanks! fixed.

fschlimb · 2025-05-07T15:46:38Z

Added a few minor comments. On the monkey-level this looks good to me.
You might want to capitalize the first words in paragraphs and enums/bullet lists within comments.

chencha3 · 2025-05-07T20:32:04Z

Added a few minor comments. On the monkey-level this looks good to me. You might want to capitalize the first words in paragraphs and enums/bullet lists within comments.

Thanks @fschlimb, I made the changes according to your feedback. I hope I addressed all of your concerns.

charithaintc

LGTM

adam-smnk

It'll be interesting to later see if we could generalize and reuse vector unrolling to achieve the same. For now, I think it's a good addition to xegpu infrastructure and we'll see in practice how it holds up.

I take it depends on #138701?

mlir/lib/Dialect/XeGPU/IR/XeGPUOps.cpp

mlir/test/lib/Dialect/XeGPU/CMakeLists.txt

chencha3 · 2025-05-08T17:27:05Z

It'll be interesting to later see if we could generalize and reuse vector unrolling to achieve the same. For now, I think it's a good addition to xegpu infrastructure and we'll see in practice how it holds up.

I take it depends on #138701?

Yeah. these patterns are supposed to be companions to vector unrolling patterns. They share the same idea, one is for XeGPU ops only, and one is for vector ops. A pass are supposed to use both of them.

Garra1980 · 2025-05-08T21:16:40Z

mlir/lib/Dialect/XeGPU/Transforms/XeGPUUnroll.cpp

+  }
+
+private:
+  const char *const packAttrName = "__xetile_blocking_pack__";


xetile->xegpu I guess, here and in tests

good catch. fixed it

Garra1980 · 2025-05-08T21:19:02Z

mlir/include/mlir/Dialect/XeGPU/Transforms/Transforms.h

+/// provide a way to customize the native shape of the operation.
+struct UnrollOptions {
+  using FilterConstraintFnType = std::function<LogicalResult(Operation *op)>;
+  /// Callback function that indicates whether vector unrolling should be


nit: let's place this comment above "using" to have uniform look :)

chencha3 · 2025-05-09T14:50:36Z

It'll be interesting to later see if we could generalize and reuse vector unrolling to achieve the same. For now, I think it's a good addition to xegpu infrastructure and we'll see in practice how it holds up.

I take it depends on #138701?

#138701 has been merged, and this PR is rebased too.

chencha3 · 2025-05-09T16:31:47Z

Hi @fschlimb and @adam-smnk, do you have more suggestions?

fschlimb · 2025-05-09T16:45:35Z

Hi @fschlimb and @adam-smnk, do you have more suggestions?

no, LGTM!

chencha3 · 2025-05-12T14:16:02Z

Hi @adam-smnk, I am going to merge this first. If you have more suggestions, feel free to let me know. Thanks for your help!

chencha3 added 6 commits April 17, 2025 17:54

init

7d332da

Merge branch 'main' into xegpu_unroll_patterns

d4549ad

Merge branch 'main' into xegpu_unroll_patterns

cdd5059

add patterns for createNdOp and StoreNdOp

47f9b3d

refine nativeShapeFn

932747e

refine verifier for TensorDescType

f843d98

chencha3 added 5 commits April 23, 2025 18:29

add loadNd pattern

c6bdd3c

add test pass

1d4dc72

format code

545f937

add unit test

008dbc7

clean up

d077cb0

chencha3 mentioned this pull request Apr 24, 2025

[mlir][xegpu] SIMT distribution patterns for XeGPU CreateNdTdesc, LoadNd, StoreNd and Dpas Ops. #135271

Merged

chencha3 added 8 commits April 28, 2025 18:53

stage

0193a04

Merge branch 'main' into xegpu_unroll_patterns

7f8b00a

add dpas pattern and unit test

456465e

refactor

906d699

fix format

c63a496

fix format

e2ed1ac

refine

35b35f0

refine

6fef430

chencha3 marked this pull request as ready for review April 30, 2025 16:16

chencha3 changed the title ~~[MLIR][XeGPU] Add unroll pass for XeGPU~~ [MLIR][XeGPU] Add unroll patterns for XeGPU (1/N) Apr 30, 2025

chencha3 added 3 commits April 30, 2025 17:53

cleanup and add patterns for rest nd ops

9d24920

fix format

1a92661

cleanup

0126eb9

chencha3 requested review from adam-smnk, charithaintc and fschlimb May 5, 2025 16:04

fschlimb reviewed May 7, 2025

View reviewed changes

switch to explicit types

e873d59

chencha3 force-pushed the users/chencha3/xegpu/xegpu_unroll_patterns branch from e24d75f to e873d59 Compare May 7, 2025 20:29

clean up

b55f43b

charithaintc approved these changes May 8, 2025

View reviewed changes

adam-smnk reviewed May 8, 2025

View reviewed changes

mlir/lib/Dialect/XeGPU/IR/XeGPUOps.cpp Outdated Show resolved Hide resolved

mlir/test/lib/Dialect/XeGPU/CMakeLists.txt Outdated Show resolved Hide resolved

move getUnrolledTypes out

383bd1d

chencha3 added 5 commits May 8, 2025 18:26

addressed comments

4fc35cf

address comments

536a610

fix format

39ca440

Merge branch 'main' into xegpu_unroll_patterns

09cec0b

sync

1d3d12c

Garra1980 reviewed May 8, 2025

View reviewed changes

chencha3 added 2 commits May 8, 2025 21:36

address comments

96cb62b

Merge branch 'main' into xegpu_unroll_patterns

163204a

update cmake

1caac76

chencha3 merged commit db42345 into main May 12, 2025
11 checks passed

chencha3 deleted the users/chencha3/xegpu/xegpu_unroll_patterns branch May 12, 2025 14:16

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[MLIR][XeGPU] Add unroll patterns for XeGPU (1/N) #137010

[MLIR][XeGPU] Add unroll patterns for XeGPU (1/N) #137010

chencha3 commented Apr 23, 2025 •

edited

Loading

github-actions bot commented Apr 23, 2025 •

edited

Loading

Garra1980 commented Apr 30, 2025

fschlimb May 7, 2025

chencha3 May 8, 2025

fschlimb May 7, 2025

fschlimb May 7, 2025 •

edited

Loading

chencha3 May 8, 2025

fschlimb May 7, 2025

chencha3 May 8, 2025

fschlimb May 7, 2025

chencha3 May 8, 2025

fschlimb commented May 7, 2025

chencha3 commented May 7, 2025

charithaintc left a comment

adam-smnk left a comment

chencha3 commented May 8, 2025

Garra1980 May 8, 2025

chencha3 May 8, 2025

Garra1980 May 8, 2025

chencha3 May 8, 2025

chencha3 commented May 9, 2025

chencha3 commented May 9, 2025

fschlimb commented May 9, 2025

chencha3 commented May 12, 2025

[MLIR][XeGPU] Add unroll patterns for XeGPU (1/N) #137010

[MLIR][XeGPU] Add unroll patterns for XeGPU (1/N) #137010

Conversation

chencha3 commented Apr 23, 2025 • edited Loading

github-actions bot commented Apr 23, 2025 • edited Loading

Garra1980 commented Apr 30, 2025

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

fschlimb May 7, 2025 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

fschlimb commented May 7, 2025

chencha3 commented May 7, 2025

charithaintc left a comment

Choose a reason for hiding this comment

adam-smnk left a comment

Choose a reason for hiding this comment

chencha3 commented May 8, 2025

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

chencha3 commented May 9, 2025

chencha3 commented May 9, 2025

fschlimb commented May 9, 2025

chencha3 commented May 12, 2025

chencha3 commented Apr 23, 2025 •

edited

Loading

github-actions bot commented Apr 23, 2025 •

edited

Loading

fschlimb May 7, 2025 •

edited

Loading