fix: Add support for `truncate_long_and_double` in FX #1865

gs-olive · 2023-04-27T21:28:47Z

Description

Add utility capabilities for accepting int64 and float64 inputs to TRTModules to support multiple use cases
Support cases include situations where internal tensors in split modules are int64 (generally used for indexing torch Tensors)
This also supports cases where the user wants to input long or double tensors as forward inputs
Add test cases to verify functionality and accuracy
Enable tests for TRTModuleNext, which are now fully supported on main

Fixes #1864
Addresses #1740

Type of change

Bug fix (non-breaking change which fixes an issue)
New feature (non-breaking change which adds functionality)

Checklist:

[ x ] My code follows the style guidelines of this project (You can use the linters)
[ x ] I have performed a self-review of my own code
[ x ] I have commented my code, particularly in hard-to-understand areas and hacks
[ x ] I have made corresponding changes to the documentation
[ x ] I have added tests to verify my fix or my feature
[ x ] New and existing unit tests pass locally with my changes
[ x ] I have added the relevant labels to my PR in so that relevant reviewers are notified

gs-olive · 2023-04-27T21:30:00Z

py/torch_tensorrt/fx/test/core/test_trt_module.py

+            ref_output,
+            rtol=1e-04,
+            atol=1e-04,
+            check_dtype=False,


The output data type will be different, since TRT cannot output int64 types

py/torch_tensorrt/fx/utils.py

gs-olive · 2023-04-28T22:36:53Z

py/torch_tensorrt/fx/utils.py

+    elif dtype == torch.int64:
+        if truncate_long_and_double:
+            _LOGGER.warn(
+                "Detected Int64 Input, Casting to Int32 for TRT Engine Compatibility"
+            )
+            return trt.int32
+        else:
+            raise TypeError(
+                "Detected Int64 Input which is not supported by tensorrt, enable compilation"
+                + "option truncate_long_and_double=True to cast input to Int32 for TRT Engine"
+            )


Similarly to the TorchScript path, allow the truncate_long_and_double argument to automatically cast inputs as needed by TRT Engines, while informing the user. This is primarily helpful for intermediate inputs (not user-provided), which happen to be long-type tensors (such as indices for embeddings).

narendasan · 2023-05-22T16:21:56Z

@gs-olive is this PR still needed?

gs-olive · 2023-05-22T16:43:03Z

Yes, this PR is still needed to support T5 in the torch_tensorrt.dynamo.compile path, and is also needed to support T5 in the FX aten path (#1740)

narendasan · 2023-05-22T16:46:51Z

Can we create a seperate PR for dynamo so we can land the feature there at least?

- Add utility capabilities for accepting `int64` inputs to TRTModules to support multiple use cases - Support cases include situations where internal tensors in split modules are `int64` (generally used for indexing torch Tensors) - This also supports cases where the user wants to input `long` tensors as `forward` inputs - Add test cases to verify functionality and accuracy - Enable tests for `TRTModuleNext`, which are now fully supported on `main`

- Add support and testing for `double` type inputs

narendasan · 2023-05-30T21:11:47Z

@gs-olive can you create separate PRs for each backend? Will be easier to merge then

gs-olive · 2023-06-14T19:24:02Z

Closed in favor of the more robust #2021 (no need to manually downcast, have the FX graph/Dynamo utilities automatically handle this for us).

gs-olive requested a review from frank-wei April 27, 2023 21:28

gs-olive self-assigned this Apr 27, 2023

github-actions bot added component: api [Python] Issues re: Python API component: fx labels Apr 27, 2023

gs-olive commented Apr 27, 2023

View reviewed changes

facebook-github-bot added cla signed fx labels Apr 27, 2023

github-actions bot requested a review from wushirong April 27, 2023 22:55

gs-olive changed the title ~~fix: Add support for torch.int64 inputs in FX~~ fix: Add support for truncate_long_and_double in FX Apr 28, 2023

gs-olive force-pushed the fx_int64_fix branch 2 times, most recently from 65bc360 to ad8aecf Compare April 28, 2023 22:34

gs-olive commented Apr 28, 2023

View reviewed changes

gs-olive force-pushed the fx_int64_fix branch from ad8aecf to 981f9d2 Compare April 29, 2023 05:18

gs-olive force-pushed the fx_int64_fix branch from 981f9d2 to 1a2fe99 Compare May 22, 2023 16:47

gs-olive mentioned this pull request May 22, 2023

📖 [Story] Improve the torch_tensorrt.dynamo.compile path #1941

Closed

borisfom approved these changes May 24, 2023

View reviewed changes

gs-olive added 2 commits May 30, 2023 12:40

fix: Add support for truncate_long_and_double in FX

6e53805

- Add support and testing for `double` type inputs

gs-olive added the WIP Work is in progress, pull request should not be merged yet label May 30, 2023

github-actions bot requested a review from yinghai May 30, 2023 21:57

gs-olive changed the title ~~fix: Add support for truncate_long_and_double in FX~~ fix: Add support for truncate_long_and_double in Dynamo May 31, 2023

gs-olive changed the title ~~fix: Add support for truncate_long_and_double in Dynamo~~ fix: Add support for truncate_long in Dynamo May 31, 2023

gs-olive changed the title ~~fix: Add support for truncate_long in Dynamo~~ fix: Add support for truncate_long_and_double in FX May 31, 2023

gs-olive force-pushed the fx_int64_fix branch from 9e2502a to 6e53805 Compare May 31, 2023 03:33

gs-olive closed this Jun 14, 2023

gs-olive deleted the fx_int64_fix branch June 14, 2023 19:24

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix: Add support for `truncate_long_and_double` in FX #1865

fix: Add support for `truncate_long_and_double` in FX #1865

gs-olive commented Apr 27, 2023 •

edited

Loading

gs-olive Apr 27, 2023

gs-olive Apr 28, 2023

narendasan commented May 22, 2023

gs-olive commented May 22, 2023

narendasan commented May 22, 2023

narendasan commented May 30, 2023

gs-olive commented Jun 14, 2023

fix: Add support for truncate_long_and_double in FX #1865

fix: Add support for truncate_long_and_double in FX #1865

Conversation

gs-olive commented Apr 27, 2023 • edited Loading

Description

Type of change

Checklist:

gs-olive Apr 27, 2023

Choose a reason for hiding this comment

gs-olive Apr 28, 2023

Choose a reason for hiding this comment

narendasan commented May 22, 2023

gs-olive commented May 22, 2023

narendasan commented May 22, 2023

narendasan commented May 30, 2023

gs-olive commented Jun 14, 2023

fix: Add support for `truncate_long_and_double` in FX #1865

fix: Add support for `truncate_long_and_double` in FX #1865

gs-olive commented Apr 27, 2023 •

edited

Loading