Scalar iteration with cat(SubArray{<:CuArray})

There appears to be a mismatch between behavior of normal matrices and CuMatrices in how views of rows and columns are treated that causes a scalar indexing issue. 

In CuMatrices I observe:

```
julia> x=CUDA.randn(4,4)
4×4 CuArray{Float32, 2, CUDA.Mem.DeviceBuffer}:

julia> @views x[1:3,1]
3-element CuArray{Float32, 1, CUDA.Mem.DeviceBuffer}:

julia> @views x[1,1:3]
3-element view(::CuArray{Float32, 2, CUDA.Mem.DeviceBuffer}, 1, 1:3) with eltype Float32:

julia> @views [1;x[1,1:3]]
┌ Warning: Performing scalar indexing on task Task (runnable) @0x0000022d0ded5750.
4-element Vector{Float32}:

julia> @views [1;x[1:3,1]]
4-element CuArray{Float32, 1, CUDA.Mem.DeviceBuffer}:
```

In normal matrices on the other hand:

```
julia> x=randn(4,4)
4×4 Matrix{Float64}:

julia> @views x[1:3,1]
3-element view(::Matrix{Float64}, 1:3, 1) with eltype Float64:

julia> @views x[1,1:3]
3-element view(::Matrix{Float64}, 1, 1:3) with eltype Float64:

julia> @views [1;x[1,1:3]]
4-element Vector{Float64}:

julia> @views [1;x[1:3,1]]
4-element Vector{Float64}:
```

Versioninfo:
```
julia> CUDA.versioninfo()
CUDA runtime 11.8, artifact installation
CUDA driver 12.0
NVIDIA driver 528.92.0

Libraries: 
- CUBLAS: 11.11.3
- CURAND: 10.3.0
- CUFFT: 10.9.0
- CUSOLVER: 11.4.1
- CUSPARSE: 11.7.5
- CUPTI: 18.0.0
- NVML: 12.0.0+528.92

Toolchain:
- Julia: 1.8.5
- LLVM: 13.0.1
- PTX ISA support: 3.2, 4.0, 4.1, 4.2, 4.3, 5.0, 6.0, 6.1, 6.3, 6.4, 6.5, 7.0, 7.1, 7.2
- Device capability support: sm_35, sm_37, sm_50, sm_52, sm_53, sm_60, sm_61, sm_62, sm_70, sm_72, sm_75, sm_80, sm_86

1 device:
  0: NVIDIA GeForce RTX 3050 Laptop GPU (sm_86, 3.886 GiB / 4.000 GiB available)

julia> versioninfo()
Julia Version 1.8.5
Commit 17cfb8e65e (2023-01-08 06:45 UTC)
Platform Info:
  OS: Windows (x86_64-w64-mingw32)
  CPU: 16 × AMD Ryzen 7 5800H with Radeon Graphics
  WORD_SIZE: 64
  LIBM: libopenlibm
  LLVM: libLLVM-13.0.1 (ORCJIT, znver3)
  Threads: 1 on 16 virtual cores
Environment:
  JULIA_EDITOR = code
  JULIA_NUM_THREADS = 
```

Does it make sense to expand Base.vcat and Base.hcat lines 141-144 in https://github.com/JuliaGPU/GPUArrays.jl/blob/master/src/host/base.jl from AbstractGPUArray to AnyGPUArray @maleadt ?

Seperately, should @views x[1:3,1] return a view instead of CuArray for conformity with Base matrices?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Scalar iteration with cat(SubArray{<:CuArray}) #2078

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Scalar iteration with cat(SubArray{<:CuArray}) #2078

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions