We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
There was an error while loading. Please reload this page.
1 parent 54b878b commit 2ba5115Copy full SHA for 2ba5115
aten/src/ATen/cuda/CUDABlas.cpp
@@ -241,7 +241,6 @@ namespace at::cuda::blas {
241
CUDABLAS_NONNEGINT_CHECK(bgemm<Dtype>, num_batches); \
242
} while (0)
243
244
-
245
namespace {
246
// Following the pattern of CuSparseDescriptor
247
// Defined here for now because this is the only place cublas_lt interface is
aten/src/ATen/cuda/CUDAContextLight.h
@@ -9,6 +9,7 @@
9
10
// cublasLT was introduced in CUDA 10.1 but we enable only for 11.1 that also
11
// added bf16 support
12
+
13
#include <cublasLt.h>
14
15
#ifdef CUDART_VERSION
0 commit comments