You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Qualcomm AI Engine Direct - Requantization Mechanism Implementation (#2823)
Summary:
- Implement requantization so mix quantization ( e.g., 8bit quant + 16 bit quant) can be properly delegated to QNN.
- Reusing test_qnn_backend_view_permute_matmul unit test to check mix quantization is working as expected.
- Added etdump logic back to qnn_executor_runner that was deleted unintentionally during this PR: a531ca5#diff-f3647de74042ac9a417e2d4000a6f2db00c22c89fd028e9433d3c79ffb7d56f6
- Refactor common arguments in VIT.
Pull Request resolved: #2823
Reviewed By: kirklandsign
Differential Revision: D55705289
Pulled By: cccclai
fbshipit-source-id: c01744ce05f0ab27cf25fc28856ce150462470d5
0 commit comments