Fix shared_ptr refcycle in graph executor #10222

zou3519 · 2018-08-03T21:08:25Z

When capturing an output, GraphExecutorAutogradFunction creates
SavedVariable with is_output=False and owns it:
https://github.com/pytorch/pytorch/blob/master/torch/csrc/jit/graph_executor.cpp#L87

Constructing SavedVariable with is_output=False makes it own a copy of
the shared_ptr, which causes a reference
cycle:

Line 27 in 6456b94

grad_fn_ = variable.grad_fn();

The solution in this PR is to construct the SavedVariable with
is_output=True if the captured value is an output.

Test Plan

Turn on cuda memory checking for JitTestCase. If the test's name
includes "cuda" or "gpu" in it, the cuda memory checking test happens.

Fixes pytorch#10032 When capturing an output, GraphExecutorAutogradFunction creates SavedVariable with is_output=False and owns it: https://github.com/pytorch/pytorch/blob/master/torch/csrc/jit/graph_executor.cpp#L87 Constructing SavedVariable with is_output=False makes it own a copy of the shared_ptr<GraphExecutorAutogradFunction>, which causes a reference cycle: https://github.com/pytorch/pytorch/blob/6456b944fd3dfe1b7db830b27afd44b15ba5a6e9/torch/csrc/autograd/saved_variable.cpp#L27 The solution in this PR is to construct the SavedVariable with is_output=True if the captured value is an output. Test Plan Turn on cuda memory checking for JitTestCase. If the test's name includes "cuda" or "gpu" in it, the cuda memory checking test happens.

facebook-github-bot

zou3519 has imported this pull request. If you are a Facebook employee, you can view this diff on Phabricator.

Summary: Fixes pytorch#10032 When capturing an output, GraphExecutorAutogradFunction creates SavedVariable with is_output=False and owns it: https://github.com/pytorch/pytorch/blob/master/torch/csrc/jit/graph_executor.cpp#L87 Constructing SavedVariable with is_output=False makes it own a copy of the shared_ptr<GraphExecutorAutogradFunction>, which causes a reference cycle: https://github.com/pytorch/pytorch/blob/6456b944fd3dfe1b7db830b27afd44b15ba5a6e9/torch/csrc/autograd/saved_variable.cpp#L27 The solution in this PR is to construct the SavedVariable with is_output=True if the captured value is an output. Test Plan Turn on cuda memory checking for JitTestCase. If the test's name includes "cuda" or "gpu" in it, the cuda memory checking test happens. cc zdevito Pull Request resolved: pytorch#10222 Reviewed By: ezyang Differential Revision: D9162995 Pulled By: zou3519 fbshipit-source-id: aeace85a09160c7a7e79cf35f6ac61eac87cbf66

zou3519 requested review from apaszke, colesbury, ezyang, gchanan, soumith and zdevito as code owners August 3, 2018 21:08

zou3519 added the oncall: jit Add this issue/PR to JIT oncall triage queue label Aug 3, 2018

soumith approved these changes Aug 3, 2018

View reviewed changes

facebook-github-bot reviewed Aug 3, 2018

View reviewed changes

zdevito approved these changes Aug 4, 2018

View reviewed changes

facebook-github-bot closed this in 29406a2 Aug 4, 2018

zou3519 mentioned this pull request Sep 6, 2018

[jit][easy] Rename cuda tests to have 'cuda' in their names #11332

Closed

ezyang added the merged label Jun 26, 2019

Provide feedback