Avoid performing `**` operations on values greater than 1e5 #1610

jacobtylerwalls · 2022-06-09T13:16:21Z

Description

Avoid actually calculating ** operations on values greater than 1e5.

Type of Changes

	Type
✓	🔨 Refactoring

Related Issue

Closes pylint-dev/pylint#6745

coveralls · 2022-06-09T13:24:56Z

Pull Request Test Coverage Report for Build 2468595687

3 of 3 (100.0%) changed or added relevant lines in 1 file are covered.
No unchanged relevant lines lost coverage.
Overall coverage increased (+0.002%) to 92.221%

Totals
Change from base Build 2468320632:	0.002%
Covered Lines:	9342
Relevant Lines:	10130

💛 - Coveralls

Pierre-Sassoulas

Thank you for working on this ! I thought about that issues a little and I was wondering about a time out with a really small threshold (0.01s?) and the possibility to relaunch with a flag if you really want the result. Knowing that something will take a long time to calculate is a NP problem, seeing it took longer than x is a P problem. How long does 9999 ** 9999 takes for exemple ? How long does any mathematical calculation actually takes ? I don't think we want to add a bunch of conditions like that for each possible formula. What do you think?

jacobtylerwalls · 2022-06-09T13:57:52Z

I don't think we want to add a bunch of conditions like that for each possible formula.

Oh I agree, I just didn't think we were likely to run into any additional cases. Exponents seemed like the only case that could blow up.

I was wondering about a time out with a really small threshold (0.01s?)

Interesting, I'm not aware of a means to interrupt a call like this, do you have a suggestion?

Pierre-Sassoulas · 2022-06-09T14:44:57Z

Exponents seemed like the only case that could blow up.

A lot of things have a tendency to surprise us when pylint is ran on a lot of unexpected code 😄 I don't have an example of calculation that would take a long time but I think it will always be possible to do so. Maybe it's time to open a cop/robber question on code golf stackexchange with one team trying to catch pathological cases and the other one trying to make the infer slow 😄 ! But maybe it's enough to handle this case only.

I'm not aware of a means to interrupt a call like this, do you have a suggestion?

Actually doing it crossplatform and multiprocessus compatible is not that easy. (I was thinking of signal for that but I checked and it turns out it only works on unix). Do we need to handle multiprocess for that ? It's possible that we're always inferring on a single thread. We have a max_inference option where we just return uninferable if we recurse more than 100 time (or 500 I don't remember). Having a time limit for inference would be somewhat similar and "solve" all pathological cases at once (we had some other the years, especially with pandas/numpy genuinely large recursion).

jacobtylerwalls · 2022-06-09T15:04:11Z

Cool, I didn't know about signal, but yeah, signal.alarm is not available on Windows.

Problems like this are fun, but, in the interest of expediency, I guess I was thinking a one-off special case would be enough of a "best efforts" basis.

FWIW 1e4 values seem to ** quickly, it's just 1e5 values that start chewing up resources. And * short-circuits straight to OverflowError.

Pierre-Sassoulas · 2022-06-09T17:59:13Z

@cdce8p or @hippo91 do you have an opinion on a timeout option for expansive inference calls ? To sum up the discussion, the idea would be to cut inference taking too long after a threshold time, like we cut inference that recurse above a threshold of calls currently.

jacobtylerwalls · 2022-06-09T18:24:13Z

the idea would be to cut inference taking too long after a threshold time

I love the idea, but are we aware of a way to accomplish this on Windows?

Pierre-Sassoulas · 2022-06-09T18:41:58Z

It seems you need to use multiprocess of threads so harder than what I thought it would be with signal but not impossible. (I did not search thoroughly for a windows solution as this fix could be good enough, maybe we don't need the timeout depending on what astroid's experts think 😄 .)

hippo91 · 2022-06-10T05:56:49Z

@Pierre-Sassoulas i think it is a good idea to limit the time spent in inference process but i would like to hear @cdce8p opinion on this...

cdce8p · 2022-06-21T16:42:30Z

@cdce8p or @hippo91 do you have an opinion on a timeout option for expansive inference calls ? To sum up the discussion, the idea would be to cut inference taking too long after a threshold time, like we cut inference that recurse above a threshold of calls currently.

I'm skeptical if it's truly worth the effort. My understanding is that canceling recursive calls above a threshold is comparatively easy. Any timeout based solution does add additional complexity, even if we can avoid threading / multiprocessing, which needs to be maintained. Additionally, it will make debugging more difficult and might even lead to non-deterministic results which in itself can be a nightmare. In the end we probably have to ask ourselves if it's worth it or if there are better / easier alternatives.

For instance, I do like this PR. It will provide a speedup in some edge cases without adding much complexity.

Avoid performing ** operations on values greater than 1e5

bc29fbd

jacobtylerwalls added the topic-performance label Jun 9, 2022

jacobtylerwalls added this to the 2.12.0 milestone Jun 9, 2022

Pierre-Sassoulas reviewed Jun 9, 2022

View reviewed changes

DanielNoord approved these changes Jun 21, 2022

View reviewed changes

Pierre-Sassoulas approved these changes Jun 21, 2022

View reviewed changes

jacobtylerwalls merged commit 171b222 into pylint-dev:main Jun 21, 2022

jacobtylerwalls deleted the exponents branch June 21, 2022 21:42

jacobtylerwalls mentioned this pull request Jul 9, 2022

Fix a crash when None participates in a ** expression #1696

Merged

2 tasks

jacobtylerwalls mentioned this pull request Jun 27, 2023

Avoid expensive list/tuple multiplication operations #2228

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Avoid performing `**` operations on values greater than 1e5 #1610

Avoid performing `**` operations on values greater than 1e5 #1610

jacobtylerwalls commented Jun 9, 2022

coveralls commented Jun 9, 2022

Pierre-Sassoulas left a comment •

edited

Loading

jacobtylerwalls commented Jun 9, 2022

Pierre-Sassoulas commented Jun 9, 2022

jacobtylerwalls commented Jun 9, 2022

Pierre-Sassoulas commented Jun 9, 2022

jacobtylerwalls commented Jun 9, 2022

Pierre-Sassoulas commented Jun 9, 2022

hippo91 commented Jun 10, 2022

cdce8p commented Jun 21, 2022

Avoid performing ** operations on values greater than 1e5 #1610

Avoid performing ** operations on values greater than 1e5 #1610

Conversation

jacobtylerwalls commented Jun 9, 2022

Description

Type of Changes

Related Issue

coveralls commented Jun 9, 2022

Pull Request Test Coverage Report for Build 2468595687

💛 - Coveralls

Pierre-Sassoulas left a comment • edited Loading

Choose a reason for hiding this comment

jacobtylerwalls commented Jun 9, 2022

Pierre-Sassoulas commented Jun 9, 2022

jacobtylerwalls commented Jun 9, 2022

Pierre-Sassoulas commented Jun 9, 2022

jacobtylerwalls commented Jun 9, 2022

Pierre-Sassoulas commented Jun 9, 2022

hippo91 commented Jun 10, 2022

cdce8p commented Jun 21, 2022

Avoid performing `**` operations on values greater than 1e5 #1610

Avoid performing `**` operations on values greater than 1e5 #1610

Pierre-Sassoulas left a comment •

edited

Loading