Skip to content

Conversation

junpi3
Copy link
Contributor

@junpi3 junpi3 commented Apr 5, 2024

## Summary
We introduce support for the convolution cases covered by [ATen-VK's default Depthwise implementation](https://github.com/pytorch/pytorch/blob/09c72eaa3f69f90402c86a30abf4fc621298578c/aten/src/ATen/native/vulkan/ops/Convolution.cpp#L68). This is achieved by
- reusing the [existing `conv2d_dw.glsl`](https://github.com/pytorch/pytorch/blob/09c72eaa3f69f90402c86a30abf4fc621298578c/aten/src/ATen/native/vulkan/glsl/conv2d_dw.glsl), and
- [moving special weights prepacking from CPU](https://github.com/pytorch/pytorch/blob/09c72eaa3f69f90402c86a30abf4fc621298578c/aten/src/ATen/native/vulkan/ops/Convolution.cpp#L80-L132) to the GPU in `conv2d_dw_prepack_weights.glsl`.

The implementation is on-par with ATen-VK's Depthwise. This means it only covers:
- `in_channels == groups`, `out_channels == groups`

A full implementation would cover, for any positive integer K:
- `in_channels == groups`, `out_channels == groups * K`

Differential Revision: [D55813511](https://our.internmc.facebook.com/intern/diff/D55813511/)

[ghstack-poisoned]
Copy link

pytorch-bot bot commented Apr 5, 2024

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/2884

Note: Links to docs will display an error until the docs builds have been completed.

✅ No Failures

As of commit 0f0a63f with merge base d3326a2 (image):
💚 Looks good so far! There are no failures yet. 💚

This comment was automatically generated by Dr. CI and updates every 15 minutes.

@facebook-github-bot facebook-github-bot added CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. fb-exported labels Apr 5, 2024
@facebook-github-bot
Copy link
Contributor

This pull request was exported from Phabricator. Differential Revision: D55813511

## Summary
We introduce support for the convolution cases covered by [ATen-VK's default Depthwise implementation](https://github.com/pytorch/pytorch/blob/09c72eaa3f69f90402c86a30abf4fc621298578c/aten/src/ATen/native/vulkan/ops/Convolution.cpp#L68). This is achieved by
- reusing the [existing `conv2d_dw.glsl`](https://github.com/pytorch/pytorch/blob/09c72eaa3f69f90402c86a30abf4fc621298578c/aten/src/ATen/native/vulkan/glsl/conv2d_dw.glsl), and
- [moving special weights prepacking from CPU](https://github.com/pytorch/pytorch/blob/09c72eaa3f69f90402c86a30abf4fc621298578c/aten/src/ATen/native/vulkan/ops/Convolution.cpp#L80-L132) to the GPU in `conv2d_dw_prepack_weights.glsl`.

The implementation is on-par with ATen-VK's Depthwise. This means it only covers:
- `in_channels == groups`, `out_channels == groups`

A full implementation would cover, for any positive integer K:
- `in_channels == groups`, `out_channels == groups * K`

Differential Revision: [D55813511](https://our.internmc.facebook.com/intern/diff/D55813511/)

[ghstack-poisoned]
@facebook-github-bot
Copy link
Contributor

This pull request was exported from Phabricator. Differential Revision: D55813511

junpi3 pushed a commit that referenced this pull request Apr 8, 2024
Pull Request resolved: #2884

## Summary
We introduce support for the convolution cases covered by [ATen-VK's default Depthwise implementation](https://github.com/pytorch/pytorch/blob/09c72eaa3f69f90402c86a30abf4fc621298578c/aten/src/ATen/native/vulkan/ops/Convolution.cpp#L68). This is achieved by
- reusing the [existing `conv2d_dw.glsl`](https://github.com/pytorch/pytorch/blob/09c72eaa3f69f90402c86a30abf4fc621298578c/aten/src/ATen/native/vulkan/glsl/conv2d_dw.glsl), and
- [moving special weights prepacking from CPU](https://github.com/pytorch/pytorch/blob/09c72eaa3f69f90402c86a30abf4fc621298578c/aten/src/ATen/native/vulkan/ops/Convolution.cpp#L80-L132) to the GPU in `conv2d_dw_prepack_weights.glsl`.

The implementation is on-par with ATen-VK's Depthwise. This means it only covers:
- `in_channels == groups`, `out_channels == groups`

A full implementation would cover, for any positive integer K:
- `in_channels == groups`, `out_channels == groups * K`
ghstack-source-id: 221721752
@exported-using-ghexport

Differential Revision: [D55813511](https://our.internmc.facebook.com/intern/diff/D55813511/)
@facebook-github-bot
Copy link
Contributor

This pull request has been merged in c4ac14c.

@mergennachin mergennachin mentioned this pull request Apr 26, 2024
kedarnath03 pushed a commit to kedarnath03/executorch that referenced this pull request Jun 25, 2025
## Summary
We introduce support for the convolution cases covered by [ATen-VK's default Depthwise implementation](https://github.com/pytorch/pytorch/blob/09c72eaa3f69f90402c86a30abf4fc621298578c/aten/src/ATen/native/vulkan/ops/Convolution.cpp#L68). This is achieved by
- reusing the [existing `conv2d_dw.glsl`](https://github.com/pytorch/pytorch/blob/09c72eaa3f69f90402c86a30abf4fc621298578c/aten/src/ATen/native/vulkan/glsl/conv2d_dw.glsl), and
- [moving special weights prepacking from CPU](https://github.com/pytorch/pytorch/blob/09c72eaa3f69f90402c86a30abf4fc621298578c/aten/src/ATen/native/vulkan/ops/Convolution.cpp#L80-L132) to the GPU in `conv2d_dw_prepack_weights.glsl`.

The implementation is on-par with ATen-VK's Depthwise. This means it only covers:
- `in_channels == groups`, `out_channels == groups`

A full implementation would cover, for any positive integer K:
- `in_channels == groups`, `out_channels == groups * K`

Differential Revision: [D55813511](https://our.internmc.facebook.com/intern/diff/D55813511/)

ghstack-source-id: 221526244
Pull Request resolved: pytorch/executorch#2884
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. fb-exported Merged
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants