[flang][cuda] Add entry point to launch cuda fortran kernel #113490

clementval · 2024-10-23T20:01:03Z

No description provided.

llvmbot · 2024-10-23T20:01:40Z

@llvm/pr-subscribers-flang-runtime

Author: Valentin Clement (バレンタインクレメン) (clementval)

Changes

Full diff: https://github.com/llvm/llvm-project/pull/113490.diff

3 Files Affected:

(added) flang/include/flang/Runtime/CUDA/kernel.h (+29)
(modified) flang/runtime/CUDA/CMakeLists.txt (+1)
(added) flang/runtime/CUDA/kernel.cpp (+37)

diff --git a/flang/include/flang/Runtime/CUDA/kernel.h b/flang/include/flang/Runtime/CUDA/kernel.h
new file mode 100644
index 00000000000000..f08cea8c1e4a7d
--- /dev/null
+++ b/flang/include/flang/Runtime/CUDA/kernel.h
@@ -0,0 +1,29 @@
+//===-- include/flang/Runtime/CUDA/kernel.h ---------------------*- C++ -*-===//
+//
+// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.
+// See https://llvm.org/LICENSE.txt for license information.
+// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception
+//
+//===----------------------------------------------------------------------===//
+
+#ifndef FORTRAN_RUNTIME_CUDA_KERNEL_H_
+#define FORTRAN_RUNTIME_CUDA_KERNEL_H_
+
+#include "flang/Runtime/entry-names.h"
+#include <cstddef>
+#include <stdint.h>
+
+namespace Fortran::runtime::cuda {
+
+extern "C" {
+
+// This function uses intptr_t instead of CUDA's unsigned int to match
+// the type of MLIR's index type. This avoids the need for casts in the
+// generated MLIR code.
+void RTDEF(CUFLaunchKernel)(const void *kernelName, intptr_t gridX,
+    intptr_t gridY, intptr_t gridZ, intptr_t blockX, intptr_t blockY,
+    intptr_t blockZ, int32_t smem, void **params, void **extra);
+
+} // extern "C"
+
+} // namespace Fortran::runtime::cuda
diff --git a/flang/runtime/CUDA/CMakeLists.txt b/flang/runtime/CUDA/CMakeLists.txt
index 86523b419f8711..ce87f3efdc3632 100644
--- a/flang/runtime/CUDA/CMakeLists.txt
+++ b/flang/runtime/CUDA/CMakeLists.txt
@@ -17,6 +17,7 @@ add_flang_library(${CUFRT_LIBNAME}
   allocator.cpp
   allocatable.cpp
   descriptor.cpp
+  kernel.cpp
   memory.cpp
   registration.cpp
 )
diff --git a/flang/runtime/CUDA/kernel.cpp b/flang/runtime/CUDA/kernel.cpp
new file mode 100644
index 00000000000000..844aefd59462dc
--- /dev/null
+++ b/flang/runtime/CUDA/kernel.cpp
@@ -0,0 +1,37 @@
+//===-- runtime/CUDA/kernel.cpp -------------------------------------------===//
+//
+// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.
+// See https://llvm.org/LICENSE.txt for license information.
+// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception
+//
+//===----------------------------------------------------------------------===//
+
+#include "flang/Runtime/CUDA/kernel.h"
+#include "../terminator.h"
+#include "flang/Runtime/CUDA/common.h"
+
+#include "cuda_runtime.h"
+
+namespace Fortran::runtime::cuda {
+
+extern "C" {
+
+void RTDEF(CUFLaunchKernel)(const void *kernel, intptr_t gridX, intptr_t gridY,
+    intptr_t gridZ, intptr_t blockX, intptr_t blockY, intptr_t blockZ,
+    int32_t smem, void **params, void **extra) {
+  dim3 gridDim;
+  gridDim.x = gridX;
+  gridDim.y = gridY;
+  gridDim.z = gridZ;
+  dim3 blockDim;
+  blockDim.x = blockX;
+  blockDim.y = blockY;
+  blockDim.z = blockZ;
+  cudaStream_t stream = 0;
+  CUDA_REPORT_IF_ERROR(
+      cudaLaunchKernel(kernel, gridDim, blockDim, params, smem, stream));
+}
+
+} // extern "C"
+
+} // namespace Fortran::runtime::cuda

flang/include/flang/Runtime/CUDA/kernel.h

Renaud-K

Thank you.

)

[flang][cuda] Add entry point to launch cuda fortran kernel

a7f34fb

clementval requested review from wangzpgi and Renaud-K October 23, 2024 20:01

llvmbot added flang:runtime flang Flang issues not falling into any other category labels Oct 23, 2024

Fix endif

6efce45

Renaud-K reviewed Oct 23, 2024

View reviewed changes

flang/include/flang/Runtime/CUDA/kernel.h Outdated Show resolved Hide resolved

Remove namespace

27cd98e

Renaud-K approved these changes Oct 23, 2024

View reviewed changes

wangzpgi approved these changes Oct 23, 2024

View reviewed changes

Fully qualified Terminator

f6b383e

clementval merged commit e2766b2 into main Oct 23, 2024
8 checks passed

clementval deleted the users/clementval/cuf_rt_kernel branch October 23, 2024 20:44

frobtech mentioned this pull request Oct 25, 2024

p/libc fexcept t #113664

Closed

NoumanAmir657 pushed a commit to NoumanAmir657/llvm-project that referenced this pull request Nov 4, 2024

[flang][cuda] Add entry point to launch cuda fortran kernel (llvm#113490

929f57d

)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[flang][cuda] Add entry point to launch cuda fortran kernel #113490

[flang][cuda] Add entry point to launch cuda fortran kernel #113490

clementval commented Oct 23, 2024

Uh oh!

llvmbot commented Oct 23, 2024

Uh oh!

Uh oh!

Renaud-K left a comment

Uh oh!

Uh oh!

Uh oh!

[flang][cuda] Add entry point to launch cuda fortran kernel #113490

[flang][cuda] Add entry point to launch cuda fortran kernel #113490

Conversation

clementval commented Oct 23, 2024

Uh oh!

llvmbot commented Oct 23, 2024

Uh oh!

Uh oh!

Renaud-K left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!