[ET] enabling half dtype input for quantization #11479

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

Sign up for GitHub

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Jump to bottom

Merged

facebook-github-bot merged 9 commits into gh/ahmtox/12/base from gh/ahmtox/12/head

Jun 14, 2025

Contributor

ahmtox commented Jun 9, 2025 •

edited

Loading

Stack from ghstack (oldest at bottom):

Improving the cpu implementation op_quantize to support input half dtype and adding additional testing

Differential Revision: D76053764


          [ET] enabling half dtype input for quantization

7b7290d

Improving the cpu implementation op_quantize to support input half dtype and adding additional testing

Differential Revision: [D76053764](https://our.internmc.facebook.com/intern/diff/D76053764/)

[ghstack-poisoned]

ahmtox requested review from manuelcandales and swolchok as code owners

June 9, 2025 15:04

pytorch-bot bot commented Jun 9, 2025 •

edited

Loading

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/11479

📄 Preview Python docs built from this PR

Note: Links to docs will display an error until the docs builds have been completed.

❌ 4 New Failures, 1 Cancelled Job

As of commit 316d2c0 with merge base 8cfa858 ():

NEW FAILURES - The following jobs have failed:

Build Presets / linux (pybind, linux.2xlarge, executorch-ubuntu-22.04-clang12) / build (gh)
The runner has received a shutdown signal. This can happen when the runner service is stopped, or a manually started runner is canceled.
Build Presets / linux (pybind, linux.arm64.2xlarge, executorch-ubuntu-22.04-gcc11-aarch64) / build (gh)
docker: Error response from daemon: Get "https://registry-1.docker.io/v2/": read tcp [2600:1f18:5438:e704:39a2:9419:50e5:747]:37566->[2600:1f18:2148:bc01:f43d:e203:cafd:8307]:443: read: connection reset by peer.
Lint / link-check / lint-urls / linux-job (gh)
Lint / lintrunner / linux-job (gh)
The runner has received a shutdown signal. This can happen when the runner service is stopped, or a manually started runner is canceled.

CANCELLED JOB - The following job was cancelled. Please retry:

Build Presets / linux (llm, linux.2xlarge, executorch-ubuntu-22.04-clang12) / build (gh)

This comment was automatically generated by Dr. CI and updates every 15 minutes.

facebook-github-bot added the CLA Signed label

This was referenced Jun 9, 2025

[ET-VK] double, short, and uint16 dtype runtime support #11365

Merged

[ET-VK][Ops] quantize ops skeleton test framework #11366

Merged

[ET-VK][Ops] quantize_per_token.default test setup #11367

Merged

[ET-VK][Ops] quantize_per_tensor.default test setup #11368

Merged

[ET-VK][Ops] quantization op shaders and impl #11369

Merged

[ET-VK][Ops] dequantize ops skeleton test framework #11480

Merged

[ET-VK][Ops] dequantize_per_tensor.default test setup #11481

Merged

[ET-VK][Ops] dequantize_per_token.default test setup #11482

Merged

[ET-VK][Ops] dequantization op shaders and impl #11483

Merged

Contributor

facebook-github-bot commented Jun 9, 2025

This pull request was exported from Phabricator. Differential Revision: D76053764

facebook-github-bot added the fb-exported label


          Update on "[ET] enabling half dtype input for quantization"

58b1237

Improving the cpu implementation op_quantize to support input half dtype and adding additional testing

Differential Revision: [D76053764](https://our.internmc.facebook.com/intern/diff/D76053764/)

[ghstack-poisoned]

Contributor

facebook-github-bot commented Jun 9, 2025

This pull request was exported from Phabricator. Differential Revision: D76053764

manuelcandales approved these changes

View reviewed changes

Contributor

manuelcandales left a comment

LGTM


          Update on "[ET] enabling half dtype input for quantization"

96f1dcd

Improving the cpu implementation op_quantize to support input half dtype and adding additional testing

Differential Revision: [D76053764](https://our.internmc.facebook.com/intern/diff/D76053764/)

[ghstack-poisoned]

This was referenced Jun 11, 2025

[ET] enabling half dtype output for dequantization and making logic consistent #11552

Merged

[ET-VK][Ops] enabling double support for quantization and dequantization ops #11553

Merged

[ET-VK][Ops] choose_qparams ops skeleton test framework #11554

Merged

[ET-VK][Ops] choose_qparams.tensor test setup #11555

Merged

[ET-VK][Ops] choose_qparams_per_token_asymmetric.default test setup #11556

Merged

[ET-VK][Ops] choose_qparams op shaders and impl #11557

Merged

Contributor

facebook-github-bot commented Jun 11, 2025

This pull request was exported from Phabricator. Differential Revision: D76053764

ahmtox added the release notes: vulkan label


          Update on "[ET] enabling half dtype input for quantization"

19a2fc3

Improving the cpu implementation op_quantize to support input half dtype and adding additional testing

Differential Revision: [D76053764](https://our.internmc.facebook.com/intern/diff/D76053764/)

[ghstack-poisoned]

ahmtox mentioned this pull request

[ET-VK][Ops] common test utils for converting aten types to vulkan types #11575

Merged

Contributor

facebook-github-bot commented Jun 11, 2025

This pull request was exported from Phabricator. Differential Revision: D76053764


          Update on "[ET] enabling half dtype input for quantization"

849944d

Improving the cpu implementation op_quantize to support input half dtype and adding additional testing

Differential Revision: [D76053764](https://our.internmc.facebook.com/intern/diff/D76053764/)

[ghstack-poisoned]

Contributor

facebook-github-bot commented Jun 12, 2025

This pull request was exported from Phabricator. Differential Revision: D76053764


          Update on "[ET] enabling half dtype input for quantization"

0d595f7

Improving the cpu implementation op_quantize to support input half dtype and adding additional testing

Differential Revision: [D76053764](https://our.internmc.facebook.com/intern/diff/D76053764/)

[ghstack-poisoned]

Contributor

facebook-github-bot commented Jun 12, 2025

This pull request was exported from Phabricator. Differential Revision: D76053764


          Update on "[ET] enabling half dtype input for quantization"

9e31ad6

Improving the cpu implementation op_quantize to support input half dtype and adding additional testing

Differential Revision: [D76053764](https://our.internmc.facebook.com/intern/diff/D76053764/)

[ghstack-poisoned]

Contributor

facebook-github-bot commented Jun 13, 2025

This pull request was exported from Phabricator. Differential Revision: D76053764


          Update on "[ET] enabling half dtype input for quantization"

a49b0cf

Improving the cpu implementation op_quantize to support input half dtype and adding additional testing

Differential Revision: [D76053764](https://our.internmc.facebook.com/intern/diff/D76053764/)

[ghstack-poisoned]

Contributor

facebook-github-bot commented Jun 13, 2025

This pull request was exported from Phabricator. Differential Revision: D76053764


          Update on "[ET] enabling half dtype input for quantization"

316d2c0

Improving the cpu implementation op_quantize to support input half dtype and adding additional testing

Differential Revision: [D76053764](https://our.internmc.facebook.com/intern/diff/D76053764/)

[ghstack-poisoned]

Contributor

facebook-github-bot commented Jun 13, 2025

This pull request was exported from Phabricator. Differential Revision: D76053764

facebook-github-bot merged commit 1f024fc into gh/ahmtox/12/base

92 of 98 checks passed

facebook-github-bot deleted the gh/ahmtox/12/head branch

June 14, 2025 03:45

facebook-github-bot temporarily deployed to cherry-pick-bot

June 14, 2025 03:45

— with

GitHub Actions Inactive

pytorchbot mentioned this pull request

[ET] enabling half dtype input for quantization #11668

Closed

ahmtox added a commit that referenced this pull request


          Revert vulkan changes from D76646172 fixup patch

179f62b

Summary:
# Context

Need these changes that were reverted in the weekend. Original stack of commits were unable to be merged into main due to an existing lintrunner issue blocking the merge. All the changes already went through [review](#11479) and approved.

Differential Revision: D76737404

ahmtox mentioned this pull request

Revert vulkan changes from D76646172 fixup patch #11727

Merged

SS-JIA added a commit that referenced this pull request


          [ET] enabling half dtype input for quantization

065a647

Pull Request resolved: #11479

Currently the cpu implementation for the quantization operator (which includes `quantize_per_token`, `quantize_per_tensor`, and `quantize_per_channel`), does not inherently support half (fp16) input scalar types. In order to align with the PyTorch implementation that accepts fp16 and bfp16 inputs, this diff aims to enable half input dtype support for the quantization operators. We will be comparing this implementation against the vulkan operators.

As defined in ExecuTorch [scalar_type_util.h](https://github.com/pytorch/executorch/blob/053686242c1687f0d51b3bb8befd14b047d7b025/runtime/core/exec_aten/util/scalar_type_util.h#L190) file, there is a method to enable support simply changing which preprocessor is called to ET_FORALL_FLOATH_TYPES. This enables support for Half (fp16), Float (fp32), and Double (fp64).

I have also included more comprehensive testing against the input dtypes, including adding double testing since it didn't already exist before. Instead of just confirming that all the output dtypes are supported, we also have a check that all input dtypes are supported now as well.
ghstack-source-id: 290376481
@exported-using-ghexport

Differential Revision: [D76053764](https://our.internmc.facebook.com/intern/diff/D76053764/)

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

CLA Signed fb-exported release notes: vulkan