Fix `local_fill_sink` rewrite for multiple output Elemwise Ops #773

ricardoV94 · 2024-05-16T11:51:19Z

Description

Reported by @tomicapretto in pymc-devs/pymc#7315 (review)

The rewrite implicitly assumed Elemwise nodes have a single output which is not true. The reported issue involved the gradient of BetaIncGrad which includes a ScalarLoop Op with multiple outputs.

The changes get rid of the eager sink at the local node rewriter level. This was actually not working because the nested replacements referenced variables that were never part of the original fgraph and those replacements were being ignored altogether. Instead we wrap the rewrite in an in2out that will safely achieve the intended behavior.

The new test works as a regression for the bug, in that if we were to call the old rewrite (or just try to compile the graph) it would lead to the reported issue when finding the MultiOutput Op.

Checklist

Checked that the pre-commit linting/style checks pass
Included tests that prove the fix is effective or that the new feature works
Added necessary documentation (docstrings and/or example notebooks)
If you are a pro: each commit corresponds to a relevant logical change

Type of change

codecov · 2024-05-16T12:14:48Z

Codecov Report

Attention: Patch coverage is 82.35294% with 3 lines in your changes missing coverage. Please review.

Project coverage is 80.85%. Comparing base (d80c0bf) to head (7019ddf).
Report is 209 commits behind head on main.

Files	Patch %	Lines
pytensor/tensor/rewriting/basic.py	82.35%	1 Missing and 2 partials ⚠️

Additional details and impacted files

@@            Coverage Diff             @@
##             main     #773      +/-   ##
==========================================
- Coverage   80.85%   80.85%   -0.01%     
==========================================
  Files         162      162              
  Lines       47019    47016       -3     
  Branches    11504    11501       -3     
==========================================
- Hits        38018    38014       -4     
- Misses       6750     6751       +1     
  Partials     2251     2251

Files	Coverage Δ
pytensor/tensor/rewriting/basic.py	`93.75% <82.35%> (-0.57%)`	⬇️

... and 2 files with indirect coverage changes

The changes get rid of the eager sink at the local node rewriter level. This was actually not working because the nested replacements referenced variables that were never part of the original fgraph and those replacements were being ignored altogether. Instead we wrap the rewrite in an in2out that will safely achieve the intended behavior.

ricardoV94 · 2024-05-16T12:25:47Z

pytensor/tensor/rewriting/basic.py

+            a, b = inp.owner.inputs
+            if b.type.dtype != inp.dtype:
+                # The input was implicitly casted by the fill operation
+                b = b.cast(inp.dtype)
+            models.append(a)
+            inputs.append(b)


This was also a potential source of bugs in the old rewrite. Ops may behave fundamentally different if the input types change so we shouldn't let that happen

ricardoV94 · 2024-05-16T12:40:23Z

pytensor/tensor/rewriting/basic.py

 @node_rewriter([Elemwise])
 def local_fill_sink(fgraph, node):
    """
    f(fill(a, b), fill(c, d), e) -> fill(c, fill(a, f(b, d, e)))
    f need to be an elemwise that isn't a fill.
    """
-    if not hasattr(node, "op") or not isinstance(node.op, Elemwise) or node.op == fill:
+    if isinstance(node.op.scalar_op, Second):


The extra checks were only needed for the recursive call of this rewrite. A default call will never call it on an node without Op that is not already Elemwise

ricardoV94 · 2024-05-16T12:42:34Z

For reference fill is identical to np.broadcast_arrays(x, y)[1], that is it broadcasts y to the broadcasted shape of x and y

tomicapretto

I'm not able to say anything about the changes here because I'm not familiar with the parts being modified. But I can say this indeed fixes the example I shared on the linked PR, so this is good.

ricardoV94 added bug Something isn't working graph rewriting labels May 16, 2024

ricardoV94 requested review from tomicapretto, jessegrabowski and Dhruvanshu-Joshi May 16, 2024 11:51

ricardoV94 force-pushed the fix_local_fill_sink_multiple_outputs branch 2 times, most recently from ba17e8f to 381b88f Compare May 16, 2024 12:23

ricardoV94 commented May 16, 2024

View reviewed changes

ricardoV94 force-pushed the fix_local_fill_sink_multiple_outputs branch from 381b88f to 7019ddf Compare May 16, 2024 12:26

ricardoV94 mentioned this pull request May 16, 2024

Fix bug in Truncated with Deterministic inputs pymc-devs/pymc#7315

Merged

10 tasks

ricardoV94 commented May 16, 2024

View reviewed changes

tomicapretto reviewed May 17, 2024

View reviewed changes

ricardoV94 merged commit 8c157a2 into pymc-devs:main May 17, 2024
54 of 55 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Fix `local_fill_sink` rewrite for multiple output Elemwise Ops #773

Fix `local_fill_sink` rewrite for multiple output Elemwise Ops #773

Uh oh!

ricardoV94 commented May 16, 2024

Uh oh!

codecov bot commented May 16, 2024 •

edited

Loading

Uh oh!

ricardoV94 May 16, 2024

Uh oh!

ricardoV94 May 16, 2024

Uh oh!

ricardoV94 commented May 16, 2024

Uh oh!

tomicapretto left a comment

Uh oh!

Uh oh!

Uh oh!

Fix local_fill_sink rewrite for multiple output Elemwise Ops #773

Fix local_fill_sink rewrite for multiple output Elemwise Ops #773

Uh oh!

Conversation

ricardoV94 commented May 16, 2024

Description

Checklist

Type of change

Uh oh!

codecov bot commented May 16, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

ricardoV94 May 16, 2024

Choose a reason for hiding this comment

Uh oh!

ricardoV94 May 16, 2024

Choose a reason for hiding this comment

Uh oh!

ricardoV94 commented May 16, 2024

Uh oh!

tomicapretto left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Fix `local_fill_sink` rewrite for multiple output Elemwise Ops #773

Fix `local_fill_sink` rewrite for multiple output Elemwise Ops #773

codecov bot commented May 16, 2024 •

edited

Loading