[mlir][affine] Support vectorization with the step size exceeding 2^32-1 #66026

Lewuathe · 2023-09-11T23:07:04Z

Since the step size in affine.for is defined as int64_t, we should be able to keep the precision as it is with using the int64_t type int the vectorization pass.

llvmbot · 2023-09-11T23:08:01Z

@llvm/pr-subscribers-mlir-affine

Changes

Since the step size in `affine.for` is defined as `int64_t`, we should be able to keep the precision as it is with using the `int64_t` type int the vectorization pass.

Full diff: https://github.com/llvm/llvm-project/pull/66026.diff

2 Files Affected:

(modified) mlir/lib/Dialect/Affine/Transforms/SuperVectorize.cpp (+1-1)
(modified) mlir/test/Dialect/Affine/SuperVectorize/vectorize_1d.mlir (+13)

diff --git a/mlir/lib/Dialect/Affine/Transforms/SuperVectorize.cpp b/mlir/lib/Dialect/Affine/Transforms/SuperVectorize.cpp
index 072e858220feae3..222536756f9fa75 100644
--- a/mlir/lib/Dialect/Affine/Transforms/SuperVectorize.cpp
+++ b/mlir/lib/Dialect/Affine/Transforms/SuperVectorize.cpp
@@ -1291,7 +1291,7 @@ static Operation *vectorizeAffineForOp(AffineForOp forOp,
   // If we are vectorizing a vector dimension, compute a new step for the new
   // vectorized loop using the vectorization factor for the vector dimension.
   // Otherwise, propagate the step of the scalar loop.
-  unsigned newStep;
+  int64_t newStep;
   if (isLoopVecDim) {
     unsigned vectorDim = loopToVecDimIt->second;
     assert(vectorDim < strategy.vectorSizes.size() && "vector dim overflow");
diff --git a/mlir/test/Dialect/Affine/SuperVectorize/vectorize_1d.mlir b/mlir/test/Dialect/Affine/SuperVectorize/vectorize_1d.mlir
index 9244604128cb723..f4f056745c00e98 100644
--- a/mlir/test/Dialect/Affine/SuperVectorize/vectorize_1d.mlir
+++ b/mlir/test/Dialect/Affine/SuperVectorize/vectorize_1d.mlir
@@ -684,3 +684,16 @@ func.func @vec_vecdim_reduction_rejected(%in: memref<256x512xf32>, %out: memref<
 
 // CHECK-LABEL: @vec_vecdim_reduction_rejected
 // CHECK-NOT: vector
+
+// -----
+
+// CHECK-LABEL: @large_step_size
+// Support the step size exceeding 2^32-1.
+func.func @large_step_size(%A: memref<4294967295xf32>) {
+ %cst = arith.constant 0.000000e+00 : f32
+ // CHECK: affine.for %{{.*}} = 0 to 256 step 549755813760 {
+ affine.for %i = 0 to 256 step 4294967295 {
+   affine.store %cst, %A[%i] : memref<4294967295xf32>
+ }
+ return
+}

llvmbot · 2023-09-11T23:08:01Z

@llvm/pr-subscribers-mlir

Changes

Since the step size in `affine.for` is defined as `int64_t`, we should be able to keep the precision as it is with using the `int64_t` type int the vectorization pass.

Full diff: https://github.com/llvm/llvm-project/pull/66026.diff

2 Files Affected:

(modified) mlir/lib/Dialect/Affine/Transforms/SuperVectorize.cpp (+1-1)
(modified) mlir/test/Dialect/Affine/SuperVectorize/vectorize_1d.mlir (+13)

diff --git a/mlir/lib/Dialect/Affine/Transforms/SuperVectorize.cpp b/mlir/lib/Dialect/Affine/Transforms/SuperVectorize.cpp
index 072e858220feae3..222536756f9fa75 100644
--- a/mlir/lib/Dialect/Affine/Transforms/SuperVectorize.cpp
+++ b/mlir/lib/Dialect/Affine/Transforms/SuperVectorize.cpp
@@ -1291,7 +1291,7 @@ static Operation *vectorizeAffineForOp(AffineForOp forOp,
   // If we are vectorizing a vector dimension, compute a new step for the new
   // vectorized loop using the vectorization factor for the vector dimension.
   // Otherwise, propagate the step of the scalar loop.
-  unsigned newStep;
+  int64_t newStep;
   if (isLoopVecDim) {
     unsigned vectorDim = loopToVecDimIt->second;
     assert(vectorDim < strategy.vectorSizes.size() && "vector dim overflow");
diff --git a/mlir/test/Dialect/Affine/SuperVectorize/vectorize_1d.mlir b/mlir/test/Dialect/Affine/SuperVectorize/vectorize_1d.mlir
index 9244604128cb723..f4f056745c00e98 100644
--- a/mlir/test/Dialect/Affine/SuperVectorize/vectorize_1d.mlir
+++ b/mlir/test/Dialect/Affine/SuperVectorize/vectorize_1d.mlir
@@ -684,3 +684,16 @@ func.func @vec_vecdim_reduction_rejected(%in: memref<256x512xf32>, %out: memref<
 
 // CHECK-LABEL: @vec_vecdim_reduction_rejected
 // CHECK-NOT: vector
+
+// -----
+
+// CHECK-LABEL: @large_step_size
+// Support the step size exceeding 2^32-1.
+func.func @large_step_size(%A: memref<4294967295xf32>) {
+ %cst = arith.constant 0.000000e+00 : f32
+ // CHECK: affine.for %{{.*}} = 0 to 256 step 549755813760 {
+ affine.for %i = 0 to 256 step 4294967295 {
+   affine.store %cst, %A[%i] : memref<4294967295xf32>
+ }
+ return
+}

llvmbot · 2023-09-11T23:08:03Z

@llvm/pr-subscribers-mlir-core

Changes

Since the step size in `affine.for` is defined as `int64_t`, we should be able to keep the precision as it is with using the `int64_t` type int the vectorization pass.

Full diff: https://github.com/llvm/llvm-project/pull/66026.diff

2 Files Affected:

(modified) mlir/lib/Dialect/Affine/Transforms/SuperVectorize.cpp (+1-1)
(modified) mlir/test/Dialect/Affine/SuperVectorize/vectorize_1d.mlir (+13)

diff --git a/mlir/lib/Dialect/Affine/Transforms/SuperVectorize.cpp b/mlir/lib/Dialect/Affine/Transforms/SuperVectorize.cpp
index 072e858220feae3..222536756f9fa75 100644
--- a/mlir/lib/Dialect/Affine/Transforms/SuperVectorize.cpp
+++ b/mlir/lib/Dialect/Affine/Transforms/SuperVectorize.cpp
@@ -1291,7 +1291,7 @@ static Operation *vectorizeAffineForOp(AffineForOp forOp,
   // If we are vectorizing a vector dimension, compute a new step for the new
   // vectorized loop using the vectorization factor for the vector dimension.
   // Otherwise, propagate the step of the scalar loop.
-  unsigned newStep;
+  int64_t newStep;
   if (isLoopVecDim) {
     unsigned vectorDim = loopToVecDimIt->second;
     assert(vectorDim < strategy.vectorSizes.size() && "vector dim overflow");
diff --git a/mlir/test/Dialect/Affine/SuperVectorize/vectorize_1d.mlir b/mlir/test/Dialect/Affine/SuperVectorize/vectorize_1d.mlir
index 9244604128cb723..f4f056745c00e98 100644
--- a/mlir/test/Dialect/Affine/SuperVectorize/vectorize_1d.mlir
+++ b/mlir/test/Dialect/Affine/SuperVectorize/vectorize_1d.mlir
@@ -684,3 +684,16 @@ func.func @vec_vecdim_reduction_rejected(%in: memref<256x512xf32>, %out: memref<
 
 // CHECK-LABEL: @vec_vecdim_reduction_rejected
 // CHECK-NOT: vector
+
+// -----
+
+// CHECK-LABEL: @large_step_size
+// Support the step size exceeding 2^32-1.
+func.func @large_step_size(%A: memref<4294967295xf32>) {
+ %cst = arith.constant 0.000000e+00 : f32
+ // CHECK: affine.for %{{.*}} = 0 to 256 step 549755813760 {
+ affine.for %i = 0 to 256 step 4294967295 {
+   affine.store %cst, %A[%i] : memref<4294967295xf32>
+ }
+ return
+}

sergei-grechanik · 2023-09-22T16:04:42Z

mlir/test/Dialect/Affine/SuperVectorize/vectorize_1d.mlir

+func.func @large_step_size(%A: memref<4294967295xf32>) {
+ %cst = arith.constant 0.000000e+00 : f32
+ // CHECK: affine.for %{{.*}} = 0 to 256 step 549755813760 {
+ affine.for %i = 0 to 256 step 4294967295 {


The problem is that currently loops with non-unit step sizes are not vectorized correctly (we should probably add a check and avoid vectorizing these loops).

Lewuathe requested a review from a team as a code owner September 11, 2023 23:07

llvmbot added mlir:core MLIR Core Infrastructure mlir:affine mlir mlir:afine labels Sep 11, 2023

[mlir][affine] Support vectorization with the step size exceeding 2^32-1

c3aa628

Lewuathe force-pushed the int64-step-size branch from 165104c to c3aa628 Compare September 12, 2023 00:23

dcaballe removed the mlir:afine label Sep 14, 2023

Lewuathe requested a review from sergei-grechanik September 22, 2023 04:19

sergei-grechanik reviewed Sep 22, 2023

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[mlir][affine] Support vectorization with the step size exceeding 2^32-1 #66026

[mlir][affine] Support vectorization with the step size exceeding 2^32-1 #66026

Uh oh!

Lewuathe commented Sep 11, 2023

Uh oh!

llvmbot commented Sep 11, 2023

Since the step size in `affine.for` is defined as `int64_t`, we should be able to keep the precision as it is with using the `int64_t` type int the vectorization pass.

Uh oh!

llvmbot commented Sep 11, 2023

Since the step size in `affine.for` is defined as `int64_t`, we should be able to keep the precision as it is with using the `int64_t` type int the vectorization pass.

Uh oh!

llvmbot commented Sep 11, 2023

Since the step size in `affine.for` is defined as `int64_t`, we should be able to keep the precision as it is with using the `int64_t` type int the vectorization pass.

Uh oh!

sergei-grechanik Sep 22, 2023

Uh oh!

Uh oh!

[mlir][affine] Support vectorization with the step size exceeding 2^32-1 #66026

Are you sure you want to change the base?

[mlir][affine] Support vectorization with the step size exceeding 2^32-1 #66026

Uh oh!

Conversation

Lewuathe commented Sep 11, 2023

Uh oh!

llvmbot commented Sep 11, 2023

Since the step size in affine.for is defined as int64_t, we should be able to keep the precision as it is with using the int64_t type int the vectorization pass.

Uh oh!

llvmbot commented Sep 11, 2023

Since the step size in affine.for is defined as int64_t, we should be able to keep the precision as it is with using the int64_t type int the vectorization pass.

Uh oh!

llvmbot commented Sep 11, 2023

Since the step size in affine.for is defined as int64_t, we should be able to keep the precision as it is with using the int64_t type int the vectorization pass.

Uh oh!

sergei-grechanik Sep 22, 2023

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Since the step size in `affine.for` is defined as `int64_t`, we should be able to keep the precision as it is with using the `int64_t` type int the vectorization pass.

Since the step size in `affine.for` is defined as `int64_t`, we should be able to keep the precision as it is with using the `int64_t` type int the vectorization pass.

Since the step size in `affine.for` is defined as `int64_t`, we should be able to keep the precision as it is with using the `int64_t` type int the vectorization pass.