~1.32x Faster checking with CleanDocument regex #105

schneems · 2021-11-04T02:31:38Z

Perf difference

Benchmarking with ruby_buildpack.rb.txt:

Before: 0.195756 0.005201 0.200957 ( 0.202021)
After: 0.148460 0.003634 0.152094 ( 0.152692)

Profile code

To generate profiler output, run:

$ DEBUG_PERF=1 bundle exec rspec spec/integration/dead_end_spec.rb

See the readme for more details. You can do that against the commit before this one and this one to see the difference.

Before sha: 58a8d74
After sha: 0b4daca74fab5dc2979813461c0f2649951185e5

How I found the issue

I was using ruby prof to try to find perf opportunities when I saw this output from the CallStackPrinter:

I noticed a lot of time spent in CleanDocument, which was curious as it's an N=1 process that happens before we do any looping or iteration.

I also saw output from the RubyProf::GraphHtmlPrinter that we were creating many many LexValue objects.

The fix

The CleanDocument class removes lines with a comment. It first runs lex over the document, then uses the lex output to determine which lines have comments that it can remove.

Once the comments are removed, the lexer is rerun as you can produce different results with comments removed (specifically :on_ignored_nl)—more info on this PR #78.

Using lex data to remove comments works but is brute force. It means we must lex the document twice, which is expensive and generates many LexValue objects.

Instead of removing lines based on lex values, we can remove lines based on regex.

One caveat is that we can't distinguish via regex if someone is in a heredoc or not. If a line looks like it could be a heredoc string embed like:

source <<~EOM
  #{here} < ==== Looks like a comment when viewed without context
EOM

Then we don't remove the line as it's doubtful someone will create a comment with a #{ pattern in a place we care about (i.e., in-between method calls) like:

User
  #{ this would be weird
  .first

With this approach, we reduce lexing passes from N=2 to N=1.

Profiler thoughts

As a side note, this didn't show up as a hotspot in the perf tools. Looking back at the screenshots, it's not obvious that there was a problem here, or I could get a 1.3x speed boost by making this change. The only reason I investigated is I knew the code well and believed that it shouldn't be spending much time here. The amount reported isn't outrageously high, it's just surprising to me.

Even running the profiler before and after, it's not clear it got faster based on the output:

(Before is top & after is bottom)

You can see before was 1.1264283200034697 while after was 0.8452127130003646 at the top. It's not clear why the output for CleanDocument shows 17% after when it was only 12% before.

Side by side, it maybe makes a bit more sense in that the problem dropped in relative priority down the list. However, I'm still not entirely clear what the numbers are indicating:

(Before left, After, right)

You can see the HTML results for yourself https://www.dropbox.com/sh/kueagukss7ho2rm/AABk3V8FQXaFWugKI21ADZJ0a?dl=0

## Perf difference Benchmarking with `ruby_buildpack.rb.txt`: Before: 0.195756 0.005201 0.200957 ( 0.202021) After: 0.148460 0.003634 0.152094 ( 0.152692) ## Profile code To generate profiler output, run: ``` $ DEBUG_PERF=1 bundle exec rspec spec/integration/dead_end_spec.rb ``` See the readme for more details. You can do that against the commit before this one and this one to see the difference. Before sha: 58a8d74 After sha: 0b4daca74fab5dc2979813461c0f2649951185e5 ## How I found the issue I was using ruby prof to try to find perf opportunities when I saw this output from the `CallStackPrinter`: ![](https://www.dropbox.com/s/7m6wi502fqoel2g/Screen%20Shot%202021-11-03%20at%204.03.10%20PM.png?raw=1) I noticed a lot of time spent in `CleanDocument`, which was curious as it's an N=1 process that happens before we do any looping or iteration. I also saw output from the `RubyProf::GraphHtmlPrinter` that we were creating many many LexValue objects. ## The fix The `CleanDocument` class removes lines with a comment. It first runs lex over the document, then uses the lex output to determine which lines have comments that it can remove. Once the comments are removed, the lexer is rerun as you can produce different results with comments removed (specifically `:on_ignored_nl`)—more info on this PR #78. Using lex data to remove comments works but is brute force. It means we must lex the document twice, which is expensive and generates many LexValue objects. Instead of removing lines based on lex values, we can remove lines based on regex. One caveat is that we can't distinguish via regex if someone is in a heredoc or not. If a line looks like it could be a heredoc string embed like: ```ruby source <<~EOM #{here} < ==== Looks like a comment when viewed without context EOM ``` Then we don't remove the line as it's doubtful someone will create a comment with a `#{` pattern in a place we care about (i.e., in-between method calls) like: ```ruby User #{ this would be weird .first ``` With this approach, we reduce lexing passes from N=2 to N=1. ## Profiler thoughts As a side note, this didn't show up as a hotspot in the perf tools. Looking back at the screenshots, it's not obvious that there was a problem here, or I could get a 1.3x speed boost by making this change. The only reason I investigated is I knew the code well and believed that it shouldn't be spending much time here. The amount reported isn't outrageously high, it's just surprising to me. Even running the profiler before and after, it's not clear it got faster based on the output: ![](https://www.dropbox.com/s/j8t7xd0i5unyim3/Screen%20Shot%202021-11-03%20at%204.11.55%20PM.png?raw=1) (Before is top & after is bottom) You can see before was 1.1264283200034697 while after was 0.8452127130003646 at the top. It's not clear why the output for `CleanDocument` shows 17% after when it was only 12% before. Side by side, it maybe makes a bit more sense in that the problem dropped in relative priority down the list. However, I'm still not entirely clear what the numbers are indicating: ![](https://www.dropbox.com/s/jgxitymh2wtoxpr/Screen%20Shot%202021-11-03%20at%204.30.08%20PM.png?raw=1) (Before left, After, right) You can see the HTML results for yourself https://www.dropbox.com/sh/kueagukss7ho2rm/AABk3V8FQXaFWugKI21ADZJ0a?dl=0

schneems · 2021-11-04T15:35:46Z

Other perf prs:

Insertion sort instead of sort! ~1.15x faster perf w/ CodeFrontier insertion sort #104
Call Ripper less 9bfed9b
Call Ripper WAY less (the same technique, but more aggressive) 9bfed9b

schneems added the skip changelog label Nov 4, 2021

schneems changed the title ~~~1.32x Faster CleanDocument~~ ~1.32x Faster checking with CleanDocument regex Nov 4, 2021

schneems force-pushed the schneems/quicker-picker-upper branch from 539ac0f to c088263 Compare November 4, 2021 15:16

schneems merged commit 5c0c3a7 into main Nov 4, 2021

schneems deleted the schneems/quicker-picker-upper branch November 4, 2021 15:16

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

~1.32x Faster checking with CleanDocument regex #105

~1.32x Faster checking with CleanDocument regex #105

Uh oh!

schneems commented Nov 4, 2021 •

edited

Loading

Uh oh!

schneems commented Nov 4, 2021

Uh oh!

Uh oh!

~1.32x Faster checking with CleanDocument regex #105

~1.32x Faster checking with CleanDocument regex #105

Uh oh!

Conversation

schneems commented Nov 4, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Perf difference

Profile code

How I found the issue

The fix

Profiler thoughts

Uh oh!

schneems commented Nov 4, 2021

Uh oh!

Uh oh!

schneems commented Nov 4, 2021 •

edited

Loading