Vulkan: add conv_transpose_2d operation #16022

relent95 · 2025-09-16T07:29:10Z

This PR adds conv_transpose_2d operation to Vulkan backend. The code are based on the implementation of the existing conv_2d operation. The shader supports strides (s0, s1), paddings (p0, p1) and dilations (d0, d1). But in ggml_vk_conv_transpose_2d(), they are constrained as s1 = s0, p0 = p1 = 0, d0 = d1 = 1, because of the existing GGML_OP_CONV_TRANSPOSE_2D interface.

…95/llama.cpp into add-ggml-vulkan-conv-transpose-2d

ggml/src/ggml-vulkan/ggml-vulkan.cpp

0cc4m · 2025-09-16T09:44:58Z

If this is based on the existing conv_2d shader, shouldn't it be possible to use the existing .comp file and only change the parts that are actually different with a preprocessor variable?

…m.comp for conv_transpose_2d operation

etasnadi · 2025-09-18T08:30:46Z

If this is based on the existing conv_2d shader, shouldn't it be possible to use the existing .comp file and only change the parts that are actually different with a preprocessor variable?

You are right, if there will be different shaders with code repeats for each conv variant (transpose, gradient, channel-wise), the maintenance will be really though.

Also, there are still many ways left to optimize the conv kernel and I also have a few updates in my private repo that I might polish in the near future and submit.

0cc4m · 2025-09-18T08:35:26Z

@etasnadi Yeah, the PR has already been updated to do that. You could also do a review if you want, you know the shader better than me. I'll mostly make sure that the C++ code is fine and the shader passes on Intel, AMD and Nvidia.

etasnadi · 2025-09-18T08:54:54Z

@etasnadi Yeah, the PR has already been updated to do that. You could also do a review if you want, you know the shader better than me. I'll mostly make sure that the C++ code is fine and the shader passes on Intel, AMD and Nvidia.

Yes, I've just realized that it was updated since.

Now my problem is that the kernel is and will be too complicated (it was really complicated before too), so we might need to introduce some abstraction I guess. Maybe @jeffbolznv has some ideas.

What do you think about adding support for HLSL shaders? As far as I know thae glslangValidatior already has basic support but in the meantime we could use dxc.

ggml/src/ggml-vulkan/vulkan-shaders/conv2d_mm.comp

0cc4m · 2025-09-18T09:08:02Z

What do you think about adding support for HLSL shaders? As far as I know thae glslangValidatior already has basic support but in the meantime we could use dxc.

What's the advantage of HLSL over GLSL? I'm not familiar with it, and not a fan of Microsoft dependencies. It would probably make maintenance harder. If you want to look into a shader language with more modern features, wouldn't slang be more interesting and more open?

Personally I'm hoping one of the projects looking into a C++-based compute shader syntax (similar to CUDA and ROCm) pans out. For now GLSL is good enough for me.

ggml/src/ggml-vulkan/vulkan-shaders/conv2d_mm.comp

etasnadi · 2025-09-18T09:23:42Z

What do you think about adding support for HLSL shaders? As far as I know thae glslangValidatior already has basic support but in the meantime we could use dxc.

What's the advantage of HLSL over GLSL? I'm not familiar with it, and not a fan of Microsoft dependencies. It would probably make maintenance harder. If you want to look into a shader language with more modern features, wouldn't slang be more interesting and more open?

Personally I'm hoping one of the projects looking into a C++-based compute shader syntax (similar to CUDA and ROCm) pans out. For now GLSL is good enough for me.

For example, it seems that it supports templates: https://devblogs.microsoft.com/directx/announcing-hlsl-2021/#template-functions-and-data-types - and templates alone would help a lot. I am not a fan of adding dependencies to projects governed by a single company either, but this kernel will be unmaintainable in the future at this abstraction level and GLSL have limited features to deal with the problem.

Sglang is also a good idea, however I do not know how much it is adopted and if it is mature enough. For example I tried to use coopmats with slang without success ~1 year ago - I guess their compiler does not support all extensions automatically?

etasnadi · 2025-09-18T10:53:12Z

What do you think about adding support for HLSL shaders? As far as I know thae glslangValidatior already has basic support but in the meantime we could use dxc.

What's the advantage of HLSL over GLSL? I'm not familiar with it, and not a fan of Microsoft dependencies. It would probably make maintenance harder. If you want to look into a shader language with more modern features, wouldn't slang be more interesting and more open?

Personally I'm hoping one of the projects looking into a C++-based compute shader syntax (similar to CUDA and ROCm) pans out. For now GLSL is good enough for me.

Now I see that they don't have coopmat support but it's WIP. shader-slang/slang#7634 So I believe it is a good idea to add support for slang in the near future! Also, there are several NV suffixed accounts contributing to the project so I guess Nvidia has a bet on the project.

0cc4m · 2025-09-18T11:15:21Z

@etasnadi I think it's Khronos, not Nvidia specifically, but yeah. Coopmat should already be there, see shader-slang/slang#7170 (comment), but let's not sidetrack this PR. If you want to look into it, go ahead. If discussion is needed, please open an issue about it.

ggml/src/ggml-vulkan/vulkan-shaders/conv2d_mm.comp

jeffbolznv · 2025-09-18T14:55:53Z

HLSL doesn't support spec constants which IMO is a deal breaker. It also only has coopmat1 level of support for use in Vulkan. slang supports coopmat2, spec constants, and generics, and there are cases where generics would be helpful.

…conv_2d shader

…v_transpose_2d operation.

0cc4m

The code runs correctly on my hardware. Looks good, we just need to resolve the last few comments.

ggml/src/ggml-vulkan/ggml-vulkan.cpp

ggml/src/ggml-vulkan/vulkan-shaders/conv2d_mm.comp

…_2d shader.

0cc4m

LGTM

@danbev

* origin/master: (39 commits) ci : disable AMD workflows + update NVIDIA workflows (ggml-org#16200) ci : enable Vulkan workflow on Mac (ggml-org#16194) ggml-cpu: Respect cpumask settings (ggml-org#16164) ggml : fix uninitialized is_on_grid in quantize_row_iq3_xxs_impl (ggml-org#15928) zdnn: refactor codebase + add docs (ggml-org#16178) codeowners : add @danbev to model-conversion example [no ci] (ggml-org#16190) devops: add s390x containers (ggml-org#15915) ggml-cpu : fix typo in gemm comments [no ci] (ggml-org#16189) feat: Add conversion support in GraniteHybrid for non-hybrid (all attn) (ggml-org#16177) clang-tidy : disable warning about performance enum size (ggml-org#16127) ggml : implement set_rows with i32 index (ggml-org#16159) codeowners : update + cleanup (ggml-org#16174) common : enable `--offline` mode without curl support (ggml-org#16137) webui : fix handling incomplete chunks (ggml-org#16107) embedding : fix typos in README (ggml-org#16171) common : remove unused local variables (ggml-org#16140) ggml : extend ggml_can_fuse to work with non-sequential nodes (ggml-org#16123) ggml : add ggml_op_is_empty (ggml-org#16122) codeowners : update ownership for @ngxson and @allozuar (ggml-org#16128) Vulkan: add conv_transpose_2d operation (ggml-org#16022) ...

* Vulkan: add conv_transpose_2d operation * Vulkan: fix typo in conv_transpose_2d shader(s0mp, s0L, s1mp, s1L) * Vulkan: fix incorrect indentation in conv_transpose_2d shader * Vulkan: add checking the push constants size limit and reuse conv2d_mm.comp for conv_transpose_2d operation * Vulkan: revert the order of the index calculation and bound check in conv_2d shader * Vulkan: explicity check push constants limit in supports_op() for conv_transpose_2d operation. * Vulkan: remove unnecessary lower bound checks for H/W_idx in the conv_2d shader.

relent95 added 2 commits September 16, 2025 15:58

Vulkan: add conv_transpose_2d operation

9d1b723

Merge branch 'ggml-org:master' into add-ggml-vulkan-conv-transpose-2d

4272243

relent95 requested a review from 0cc4m as a code owner September 16, 2025 07:29

relent95 added 3 commits September 16, 2025 16:39

Vulkan: fix typo in conv_transpose_2d shader(s0mp, s0L, s1mp, s1L)

5c888ca

Merge branch 'add-ggml-vulkan-conv-transpose-2d' of github.com:relent…

b02a5dd

…95/llama.cpp into add-ggml-vulkan-conv-transpose-2d

Vulkan: fix incorrect indentation in conv_transpose_2d shader

e029cda

github-actions bot added testing Everything test related Vulkan Issues specific to the Vulkan backend ggml changes relating to the ggml tensor library for machine learning labels Sep 16, 2025

jeffbolznv reviewed Sep 16, 2025

View reviewed changes

ggml/src/ggml-vulkan/ggml-vulkan.cpp Show resolved Hide resolved

Vulkan: add checking the push constants size limit and reuse conv2d_m…

f5ae689

…m.comp for conv_transpose_2d operation

relent95 requested a review from jeffbolznv September 17, 2025 04:33

etasnadi reviewed Sep 18, 2025

View reviewed changes

ggml/src/ggml-vulkan/vulkan-shaders/conv2d_mm.comp Show resolved Hide resolved

etasnadi reviewed Sep 18, 2025

View reviewed changes

ggml/src/ggml-vulkan/vulkan-shaders/conv2d_mm.comp Show resolved Hide resolved

jeffbolznv reviewed Sep 18, 2025

View reviewed changes

ggml/src/ggml-vulkan/vulkan-shaders/conv2d_mm.comp Outdated Show resolved Hide resolved

Vulkan: revert the order of the index calculation and bound check in …

12aaeae

…conv_2d shader

relent95 requested review from etasnadi and jeffbolznv September 19, 2025 02:08

Vulkan: explicity check push constants limit in supports_op() for con…

7e24d17

…v_transpose_2d operation.

0cc4m reviewed Sep 21, 2025

View reviewed changes

ggml/src/ggml-vulkan/ggml-vulkan.cpp Show resolved Hide resolved

ggml/src/ggml-vulkan/ggml-vulkan.cpp Show resolved Hide resolved

ggml/src/ggml-vulkan/vulkan-shaders/conv2d_mm.comp Outdated Show resolved Hide resolved

Vulkan: remove unnecessary lower bound checks for H/W_idx in the conv…

55b3fb5

…_2d shader.

relent95 requested a review from 0cc4m September 21, 2025 12:56

0cc4m approved these changes Sep 22, 2025

View reviewed changes

0cc4m merged commit 96fdca0 into ggml-org:master Sep 22, 2025
49 of 53 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Vulkan: add conv_transpose_2d operation #16022

Vulkan: add conv_transpose_2d operation #16022

relent95 commented Sep 16, 2025 •

edited

Loading

Uh oh!

Uh oh!

0cc4m commented Sep 16, 2025

Uh oh!

etasnadi commented Sep 18, 2025 •

edited

Loading

Uh oh!

0cc4m commented Sep 18, 2025

Uh oh!

etasnadi commented Sep 18, 2025 •

edited

Loading

Uh oh!

Uh oh!

0cc4m commented Sep 18, 2025

Uh oh!

Uh oh!

etasnadi commented Sep 18, 2025 •

edited

Loading

Uh oh!

etasnadi commented Sep 18, 2025 •

edited

Loading

Uh oh!

0cc4m commented Sep 18, 2025 •

edited

Loading

Uh oh!

Uh oh!

jeffbolznv commented Sep 18, 2025

Uh oh!

0cc4m left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

0cc4m left a comment

Uh oh!

Uh oh!

Uh oh!

Vulkan: add conv_transpose_2d operation #16022

Vulkan: add conv_transpose_2d operation #16022

Conversation

relent95 commented Sep 16, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

0cc4m commented Sep 16, 2025

Uh oh!

etasnadi commented Sep 18, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

0cc4m commented Sep 18, 2025

Uh oh!

etasnadi commented Sep 18, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

0cc4m commented Sep 18, 2025

Uh oh!

Uh oh!

etasnadi commented Sep 18, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

etasnadi commented Sep 18, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

0cc4m commented Sep 18, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

jeffbolznv commented Sep 18, 2025

Uh oh!

0cc4m left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

0cc4m left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

relent95 commented Sep 16, 2025 •

edited

Loading

etasnadi commented Sep 18, 2025 •

edited

Loading

etasnadi commented Sep 18, 2025 •

edited

Loading

etasnadi commented Sep 18, 2025 •

edited

Loading

etasnadi commented Sep 18, 2025 •

edited

Loading

0cc4m commented Sep 18, 2025 •

edited

Loading