Performance: prevent blowup in normalization #105

olliemath · 2021-07-07T16:50:24Z

This addresses the case where we have an expression which is small when normalized, but in its current form contains large dual expressions. The blowup happens at the _rdistributive step, so the idea is to normalize subexpressions as much as possible before that. Closes #106

The current test case takes 0.4s with the new code, but took 500s with the old code. We have many of these cases in our codebase and the blow-up is exponential in the size of the original expression. We have tested and this now has reasonable performance with all formulas from our codebase.

The original expression which drew our attention to this issue (below) now takes 30s (~3s under pypy) with the new code and never returns (in fact I run out of memory) with the old code:

        a & (
            (b & c & d & e & f & g)
            | (c & d & f & g & i & j)
            | (c & f & g & h & i & j)
            | (c & f & g & i & j & k)
            | (c & d & f & g & i & n & p)
            | (c & e & f & g & i & m & x)
            | (c & e & f & g & l & o & w)
            | (c & e & f & g & q & s & t)
            | (c & f & g & i & m & n & r)
            | (c & f & g & l & n & o & r)
            | (c & d & f & g & i & l & o & u)
            | (c & e & f & g & i & p & y & ~v)
            | (c & f & g & i & j & z & ~(c & f & g & i & j & k))
            | (
                c & f & g & t
                & ~(b & c & d & e & f & g)
                & ~(c & d & f & g & i & j)
                & ~(c & f & g & h & i & j)
                & ~(c & f & g & i & j & k)
                & ~(c & d & f & g & i & n & p)
                & ~(c & e & f & g & i & m & x)
                & ~(c & e & f & g & l & o & w)
                & ~(c & e & f & g & q & s & t)
                & ~(c & f & g & i & m & n & r)
                & ~(c & f & g & l & n & o & r)
                & ~(c & d & f & g & i & l & o & u)
                & ~(c & e & f & g & i & p & y & ~v)
                & ~(c & f & g & i & j & z & ~(c & f & g & i & j & k))
            )
            | (
                c & f & g & ~t
                & ~(b & c & d & e & f & g)
                & ~(c & d & f & g & i & j)
                & ~(c & f & g & h & i & j)
                & ~(c & f & g & i & j & k)
                & ~(c & d & f & g & i & n & p)
                & ~(c & e & f & g & i & m & x)
                & ~(c & e & f & g & l & o & w)
                & ~(c & e & f & g & q & s & t)
                & ~(c & f & g & i & m & n & r)
                & ~(c & f & g & l & n & o & r)
                & ~(c & d & f & g & i & l & o & u)
                & ~(c & e & f & g & i & p & y & ~v)
                & ~(c & f & g & i & j & z & ~(c & f & g & i & j & k))
            )
        )

This adresses the case where we have an expression which is smal when normalized, but in its current form contains a large dual expression.

prodhype · 2021-12-25T04:35:10Z

boolean/test_boolean.py

+        assert str(cnf) == "a&c&f&g"
+        # Locally, this test takes 0.4s, previously it was 500s.
+        # We allow 30s because of the wide range of possible CPUs.
+        assert t1 - t0 < 30, "Normalizing took too long"


We shouldn't use runtime in an assertion as a general rule... it's brittle as you already indicated in the comment. You may want to explore using mocks and counting function calls. expr.simplify looks like a good candidate.

@prodhype @olliemath should I go ahead and merge though?

@pombredanne It should be rewritten so that it doesn't depend on runtime before being merged.

@prodhype there you go. Thanks!

See 5f93c8b

@pombredanne sorry it took me a while to get back to you, I've been a bit too busy for much open source recently. Thanks for improving the test though!

@prodhype

Reference: #105 (comment) Reported-by: @prodhype Signed-off-by: Philippe Ombredanne <[email protected]>

@prodhype

Reference: #105 (comment) Reported-by: @prodhype Signed-off-by: Philippe Ombredanne <[email protected]>

pombredanne · 2022-05-02T15:22:02Z

I am merging this in #107 ... so I am closing this here

pombredanne · 2022-05-02T15:22:10Z

Thanks!

Performance: prevent blowup in normalization

35a6bd0

This adresses the case where we have an expression which is smal when normalized, but in its current form contains a large dual expression.

olliemath mentioned this pull request Jul 7, 2021

Exponential blowup when normalizing #106

Closed

prodhype reviewed Dec 25, 2021

View reviewed changes

pombredanne added a commit that referenced this pull request May 2, 2022

Make test independant of CPU speed

183e4c5

Reference: #105 (comment) Reported-by: @prodhype Signed-off-by: Philippe Ombredanne <[email protected]>

pombredanne added a commit that referenced this pull request May 2, 2022

Make test independent of CPU speed

5f93c8b

Reference: #105 (comment) Reported-by: @prodhype Signed-off-by: Philippe Ombredanne <[email protected]>

pombredanne merged commit 1305cbf into bastikr:master May 2, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Performance: prevent blowup in normalization #105

Performance: prevent blowup in normalization #105

Uh oh!

olliemath commented Jul 7, 2021 •

edited

Loading

Uh oh!

prodhype Dec 25, 2021

Uh oh!

pombredanne May 2, 2022

Uh oh!

prodhype May 2, 2022

Uh oh!

pombredanne May 2, 2022

Uh oh!

pombredanne May 2, 2022

Uh oh!

olliemath May 4, 2022

Uh oh!

pombredanne commented May 2, 2022

Uh oh!

pombredanne commented May 2, 2022

Uh oh!

Uh oh!

Performance: prevent blowup in normalization #105

Performance: prevent blowup in normalization #105

Uh oh!

Conversation

olliemath commented Jul 7, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

prodhype Dec 25, 2021

Choose a reason for hiding this comment

Uh oh!

pombredanne May 2, 2022

Choose a reason for hiding this comment

Uh oh!

prodhype May 2, 2022

Choose a reason for hiding this comment

Uh oh!

pombredanne May 2, 2022

Choose a reason for hiding this comment

Uh oh!

pombredanne May 2, 2022

Choose a reason for hiding this comment

Uh oh!

olliemath May 4, 2022

Choose a reason for hiding this comment

Uh oh!

pombredanne commented May 2, 2022

Uh oh!

pombredanne commented May 2, 2022

Uh oh!

Uh oh!

olliemath commented Jul 7, 2021 •

edited

Loading