[WIP] [Do not merge] Reduce overhead of AOT Module #612

anijain2305 · 2022-03-22T23:50:03Z

Inspired heavily from - https://github.com/facebookresearch/torchdynamo/blob/ce0c84a62d3287e2afde22e0d823a8d1ae4758a8/torchdynamo/optimizations/python_key.py#L121

anijain2305 · 2022-03-22T23:51:01Z

functorch/_src/aot_autograd.py

+    #########################################################
+    """
+    (1) Create a new fn_for_tracing that lifts params as inputs (TODO: buffers)
+    (2) A new tracer - MyTracer (slight modification of PythonKeyTracer). This
+    works with Proxy tensors instead of PythonTensors. Goal is to get a torch
+    graph, and not torch.aten graph yet.
+    (3) This traced function is then passed on to aot_function.
+    (4) The params are read and flattened on every forward call, as they can change during training.
+    """
+    #########################################################


@jansel @Chillee This is the flow.

jansel · 2022-03-23T00:34:38Z

functorch/_src/aot_autograd.py

@@ -12,6 +12,8 @@
 from .named_members_polyfill import _named_parameters, _named_buffers
 from typing import Callable, List, Dict, Any, Tuple, Optional

+import torchdynamo


perhaps:

try: from torchdynamo import disable as disable_dynamo except ImportError: def disable_dynamo(x): return x

So we can still use this without dynamo.

jansel · 2022-03-23T00:38:52Z

functorch/_src/aot_autograd.py

+        return fn_with_params_as_args
+
+
+    gm = torch.fx.symbolic_trace(mod)


Why do we need this? Seems not ideal, as this could be lossy.

jansel · 2022-03-23T00:40:35Z

functorch/_src/aot_autograd.py

+            else:
+                params_flat, _ = pytree.tree_flatten(params)
+
+            params_flat = tuple(params_flat)


In the torchdynamo case (not in the general case), this list is a constant you don't need to recompute every time.

For training, the params will change after each update. Don't we have to flatten for each forward call?

The data will change, but the Tensor objects should be the same. This just holds pointers to a bunch of Tensors (that will mutate in-place)

Yeah, I realized while driving today. These are just references. And the update happens in place.

Will remove it.

jansel · 2022-03-23T00:42:07Z

functorch/_src/aot_autograd.py

+
+            if compiled_f is None:
+                fn_with_params_as_args = flattened_fn(gm, nargs)
+                compiled_f = aot_function(fn_with_params_as_args, *top_args, **top_kwargs)


In the torchdynamo case (not in the general case), there should only every be one compiled_f. You can just compute it ahead of time and don't need to wait until the first call.

Ok. Yeah, we can send example_inputs and compile it beforehand.

I think there is also a bunch of pytree overhead inside aot_function, plus an unneeded caching layer.

…uous tensors

anijain2305 · 2022-04-05T23:57:34Z

Closing. Following PRs are covering this

Contiguous - Trace the backward pass assuming contiguous tensors #536
AOT module simplified - AOT Module Simplified - Low Overhead version of AOT Module #660
Disabling torchdynamo - Disbale torchdynamo on AOT Autograd generated graphs #662

facebook-github-bot added the cla signed label Mar 22, 2022

anijain2305 commented Mar 22, 2022

View reviewed changes

jansel reviewed Mar 23, 2022

View reviewed changes

anijain2305 added 2 commits March 23, 2022 07:22

[WIP] Reduce overhead of AOT Module

7494df2

Adding aot_module_simplified and aot_function_simplified

1546a95

anijain2305 force-pushed the param_flat branch from d7046ef to 1546a95 Compare March 23, 2022 17:53

anijain2305 added 2 commits March 23, 2022 17:59

[DO NOT MERGE] [AOT Autograd] Trace the backward pass assuming contig…

e8c6875

…uous tensors

Fallback to aot_module original until we prevent tracing of leaf modules

7774e5a

anijain2305 closed this Apr 5, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[WIP] [Do not merge] Reduce overhead of AOT Module #612

[WIP] [Do not merge] Reduce overhead of AOT Module #612

Uh oh!

anijain2305 commented Mar 22, 2022

Uh oh!

anijain2305 Mar 22, 2022

Uh oh!

jansel Mar 23, 2022

Uh oh!

jansel Mar 23, 2022

Uh oh!

jansel Mar 23, 2022

Uh oh!

anijain2305 Mar 23, 2022 •

edited

Loading

Uh oh!

jansel Mar 23, 2022

Uh oh!

anijain2305 Mar 23, 2022

Uh oh!

jansel Mar 23, 2022

Uh oh!

anijain2305 Mar 23, 2022

Uh oh!

jansel Mar 23, 2022

Uh oh!

anijain2305 commented Apr 5, 2022

Uh oh!

Uh oh!

		return fn_with_params_as_args


		gm = torch.fx.symbolic_trace(mod)

[WIP] [Do not merge] Reduce overhead of AOT Module #612

[WIP] [Do not merge] Reduce overhead of AOT Module #612

Uh oh!

Conversation

anijain2305 commented Mar 22, 2022

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

anijain2305 Mar 23, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

anijain2305 commented Apr 5, 2022

Uh oh!

Uh oh!

anijain2305 Mar 23, 2022 •

edited

Loading