Closed
2 of 2 issues completedDescription
This issue tracks an internal discussion with QA. This simple snippet shows why using cuda.core
today on Windows might fail, depending on if it's TCC or WDDM mode:
>>> from cuda import cuda, cudart
>>> print(cudart.cudaGetDevice())
(<cudaError_t.cudaSuccess: 0>, 0)
>>> print(cuda.cuDeviceGetMemPool(0))
(<CUresult.CUDA_ERROR_NOT_SUPPORTED: 801>, <CUmemoryPool 0x0>)
cuda.core
currently assumes the stream-ordered memory allocator is available. However, CUDA on Windows is a bit more complicated than on Linux, since there are two operation modes:
- In the WDDM mode (which is the case during
cuda.core
development), things should work just fine. - In the TCC mode (as reported by QA), this is unsupported
We need some treatments to make it usable on TCC.