Changed weight map to tensor and fix the refit bug #3573

cehongwang · 2025-06-13T21:59:57Z

Description

Please include a summary of the change and which issue is fixed. Please also include relevant motivation and context. List any dependencies that are required for this change.

Fixes # (issue)

Type of change

Please delete options that are not relevant and/or add your own.

Bug fix (non-breaking change which fixes an issue)
New feature (non-breaking change which adds functionality)
Breaking change (fix or feature that would cause existing functionality to not work as expected)
This change requires a documentation update

Checklist:

My code follows the style guidelines of this project (You can use the linters)
I have performed a self-review of my own code
I have commented my code, particularly in hard-to-understand areas and hacks
I have made corresponding changes to the documentation
I have added tests to verify my fix or my feature
New and existing unit tests pass locally with my changes
I have added the relevant labels to my PR in so that relevant reviewers are notified

cehongwang · 2025-06-13T22:00:28Z

py/torch_tensorrt/dynamo/conversion/converter_utils.py

            # Used for refit
-            ctx.weight_refit_map[name + " CONSTANT"] = numpy_value.reshape(-1)
+            ctx.weight_refit_map[name + " CONSTANT"] = torch_value


Comment the reason of why adding the " Constant"

py/torch_tensorrt/dynamo/conversion/converter_utils.py

narendasan · 2025-06-16T21:02:43Z

py/torch_tensorrt/dynamo/conversion/converter_utils.py

@@ -321,7 +321,15 @@ def cast_int_or_float_to_bool(


 def to_trt_weights(


I think we can streamline some arguments like why do we need target, name, layer name and weight name? can we derive some of these from others?

py/torch_tensorrt/dynamo/conversion/converter_utils.py

py/torch_tensorrt/dynamo/conversion/impl/conv.py

peri044 · 2025-06-16T23:32:26Z

py/torch_tensorrt/dynamo/conversion/_ConversionContext.py

    cpu_weights_reference_holder: dict[str, Union[torch.Tensor]] = field(
        default_factory=dict
    )

+    def record_weight(self, name: str, weight: torch.Tensor) -> None:
+        self.weight_refit_map[name] = weight


add a docstring explaining why we are doing this especially the comment related to self.cpu_weights_reference_holder[name + " CPU_REFERENCE"] = weight

The name " CPU_REFERENCE" is a bit random. Any name could work because all we need is to hold it on CPU. Moreover, since we have refit map, this is actually a bit redundant.

Do we need a suffix here? Its just holding references, it should not be inspected later. I dont think it should even be a dictionary

py/torch_tensorrt/dynamo/conversion/converter_utils.py

peri044 · 2025-06-16T23:34:54Z

py/torch_tensorrt/dynamo/conversion/converter_utils.py

+    supported_weight_types = ["KERNEL", "BIAS", "CONSTANT"]
+    assert (
+        layer_type_name in supported_layer_types
+    ), f"Unsupported layer type: {layer_type_name}. Please add the layer type to this function to enable refitting."


Please add the layer type to this function to enable refitting. - what does this mean ? How do we add this ?

py/torch_tensorrt/dynamo/conversion/impl/conv.py

py/torch_tensorrt/dynamo/conversion/impl/deconv.py

narendasan · 2025-06-17T00:56:44Z

py/torch_tensorrt/dynamo/conversion/converter_utils.py

+    ctx: ConversionContext,
+    value: torch.Tensor,
+    name: str,
+    layer_type_name: str,


We should use those literal type annotations

narendasan · 2025-06-17T00:58:25Z

py/torch_tensorrt/dynamo/conversion/converter_utils.py

+        weight_type_name in supported_weight_types
+    ), f"Encountered unsupported weight type: {weight_type_name}. Supported types are: {supported_weight_types}. Manually calling to_trt_weights with a custom weight type is not intended for general use."
+
+    if weight_type_name == "CONSTANT" and layer_type_name == "CONSTANT":


What is the difference between a weight type and a layer type?

facebook-github-bot added the cla signed label Jun 13, 2025

github-actions bot added component: conversion Issues re: Conversion stage component: api [Python] Issues re: Python API component: dynamo Issues relating to the `torch.compile` or `torch._dynamo.export` paths labels Jun 13, 2025

github-actions bot requested a review from peri044 June 13, 2025 22:00

cehongwang commented Jun 13, 2025

View reviewed changes

Initial attempt

cf064c5

cehongwang force-pushed the refit-map-type-change branch from d7c6735 to cf064c5 Compare June 13, 2025 22:34

Fixed the bug of refitting, but need a more systematic approach

e34fb80

github-actions bot added the component: converters Issues re: Specific op converters label Jun 14, 2025