[mlir][nvvm] Add prefetch.tensormap #67564

grypp · 2023-09-27T14:52:42Z

This PR adds prefetch.tensormap Op. It brings the cache line containing the given tma descriptor for subsequent use by the cp.async.bulk.tensor instruction.

https://docs.nvidia.com/cuda/parallel-thread-execution/index.html#data-movement-and-conversion-instructions-prefetch-prefetchu

llvmbot · 2023-09-27T14:53:46Z

@llvm/pr-subscribers-mlir-nvgpu
@llvm/pr-subscribers-mlir-gpu
@llvm/pr-subscribers-mlir

@llvm/pr-subscribers-mlir-llvm

Changes

This PR adds prefetch.tensormap Op. It brings the cache line containing the given tma descriptor for subsequent use by the cp.async.bulk.tensor instruction.

https://docs.nvidia.com/cuda/parallel-thread-execution/index.html#data-movement-and-conversion-instructions-prefetch-prefetchu

Full diff: https://github.com/llvm/llvm-project/pull/67564.diff

2 Files Affected:

(modified) mlir/include/mlir/Dialect/LLVMIR/NVVMOps.td (+15)
(modified) mlir/test/Conversion/NVVMToLLVM/nvvm-to-llvm.mlir (+11)

diff --git a/mlir/include/mlir/Dialect/LLVMIR/NVVMOps.td b/mlir/include/mlir/Dialect/LLVMIR/NVVMOps.td
index 0d4d734edd2b69b..e9c52c06ed27ebd 100644
--- a/mlir/include/mlir/Dialect/LLVMIR/NVVMOps.td
+++ b/mlir/include/mlir/Dialect/LLVMIR/NVVMOps.td
@@ -1512,6 +1512,21 @@ def NVVM_CpAsyncBulkTensorSharedCTAToGlobalOp : NVVM_Op<"cp.async.bulk.tensor.gl
   let hasVerifier = 1;
 }
 
+def NVVM_PrefetchTensorMapOp : NVVM_Op<"prefetch.tensormap",
+                    [DeclareOpInterfaceMethods<BasicPtxBuilderOpInterface>]>,
+  Arguments<(ins LLVM_i64ptr_any:$tmaDescriptor, PtxPredicate:$predicate)> {
+  let description = [{
+    The Op brings the cache line containing the given $tmaDescriptor for 
+    subsequent use by the `cp.async.bulk.tensor` instruction.
+  }];
+  let assemblyFormat = "$tmaDescriptor (`,` `predicate` `=` $predicate^)? attr-dict `:` type(operands)";
+  let extraClassDefinition = [{
+    std::string $cppClass::getPtx() { 
+      return std::string("prefetch.tensormap [%0];");
+    }
+  }];
+}
+
 //===----------------------------------------------------------------------===//
 // NVVM Wgmma Ops
 //===----------------------------------------------------------------------===//
diff --git a/mlir/test/Conversion/NVVMToLLVM/nvvm-to-llvm.mlir b/mlir/test/Conversion/NVVMToLLVM/nvvm-to-llvm.mlir
index 7ffe1ad2bb2b111..8ff8868e96ace11 100644
--- a/mlir/test/Conversion/NVVMToLLVM/nvvm-to-llvm.mlir
+++ b/mlir/test/Conversion/NVVMToLLVM/nvvm-to-llvm.mlir
@@ -363,3 +363,14 @@ func.func @wgmma_f32_e5m2_e4m3(%descA : i64, %descB : i64) -> !mat32f32 {
       : !mat32f32 -> !mat32f32
   return %result2 : !mat32f32
 }
+
+// -----
+
+// CHECK-LABEL: @init_mbarrier_arrive_expect_tx
+llvm.func @init_mbarrier_arrive_expect_tx(%desc : !llvm.ptr, %pred : i1) {
+  //CHECK: llvm.inline_asm has_side_effects asm_dialect = att "prefetch.tensormap [$0];", "l"
+  nvvm.prefetch.tensormap %desc : !llvm.ptr
+  //CHECK: llvm.inline_asm has_side_effects asm_dialect = att "@$1 prefetch.tensormap [$0];", "l,b"
+  nvvm.prefetch.tensormap %desc, predicate = %pred : !llvm.ptr, i1
+  llvm.return
+}

grypp · 2023-10-16T13:00:12Z

This PR depends on #67102

This PR adds `prefetch.tensormap` Op. It brings the cache line containing the given tma descriptor for subsequent use by the cp.async.bulk.tensor instruction. https://docs.nvidia.com/cuda/parallel-thread-execution/index.html#data-movement-and-conversion-instructions-prefetch-prefetchu

grypp requested review from manishucsd, nicolasvasilache and qcolombet September 27, 2023 14:53

llvmbot added mlir:llvm mlir labels Sep 27, 2023

llvmbot added mlir:gpu mlir:nvgpu labels Sep 27, 2023

grypp added 5 commits October 17, 2023 12:47

implement it in nvgpu dialect as well

8e79371

test

1c07e75

check the arguments in the test

78f5a03

fix the test

b18725e

grypp force-pushed the nvvm-prefetch branch from 8fdcdb6 to b18725e Compare October 17, 2023 10:58

grypp merged commit 39cdefb into llvm:main Oct 17, 2023

grypp deleted the nvvm-prefetch branch October 17, 2023 11:03

madhur13490 mentioned this pull request Oct 20, 2023

Revert commit ba8565fbcb975e2d067ce3ae5a7dbaae4953edd3 madhur13490/llvm-project#3

Closed

banach-space mentioned this pull request Oct 24, 2023

[mlir][vector] Add scalable vectors to tests for vector.contract #70039

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[mlir][nvvm] Add prefetch.tensormap #67564

[mlir][nvvm] Add prefetch.tensormap #67564

Uh oh!

grypp commented Sep 27, 2023

Uh oh!

llvmbot commented Sep 27, 2023 •

edited

Loading

Uh oh!

grypp commented Oct 16, 2023

Uh oh!

Uh oh!

[mlir][nvvm] Add prefetch.tensormap #67564

[mlir][nvvm] Add prefetch.tensormap #67564

Uh oh!

Conversation

grypp commented Sep 27, 2023

Uh oh!

llvmbot commented Sep 27, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

grypp commented Oct 16, 2023

Uh oh!

Uh oh!

llvmbot commented Sep 27, 2023 •

edited

Loading