Range performance fixes #9

wjt · 2015-04-30T22:13:13Z

I'm using Annotator on pages with really large tables, highlighting potentially thousands of rows of text. Selecting any amount of text which spans more than one row was extremely expensive. NormalizedRange#limit and #getTextNodes were the main culprits according to Chrome's profiler. These patches make them an order of magnitude faster for me.

Happy to write more tests if you're not convinced that this new logic is right! Also, the existing code is inconsistent in whether it uses $.contains or util.contains; shout if there's some hidden reason to use one or the other that I've not seen.

Agh, while writing this pull request I've noticed that I introduced a use of childNodes in #limit -- based on the comment in #getTextNodes I guess I need to remove that for IE9's sake.

tilgovi · 2015-05-01T00:55:49Z

src/range.coffee

@@ -189,13 +189,33 @@ class Range.NormalizedRange
  #
  # Returns updated self or null.
  limit: (bounds) ->
-    nodes = $.grep this.textNodes(), (node) ->
-      node.parentNode == bounds or $.contains(bounds, node.parentNode)
+    if @commonAncestor == bounds or $.contains(bounds, @commonAncestor)


I would make this === for clarity.

I've not written CoffeeScript before yesterday, but the docs suggest that == in CoffeeScript compiles to === in JS.

I'd forgotten this project was still coffee! Brain splat.

wjt · 2015-05-01T07:40:13Z

To no-one's great surprise, using TreeWalker is a bit faster (or no slower) and more concise. I didn't know about it, thanks! I also found a bug in my open-coded implementation of #limit – the tests now cover that.

wjt · 2015-05-01T07:40:55Z

(Let me know if you would like the intermediate patches squashed away.))

tilgovi · 2015-05-03T00:03:08Z

That's excellent. Squash away and I'll be glad to merge it!

tilgovi · 2015-05-03T00:15:57Z

If you could also remove the flatten function, which is no longer used now, that would be excellent.

Previously, only the left-hand side was tested; and grandchildren of the bounds were not tested at all. With [] indicating a text node: <h1>[My Heading]</h1> <p> [My paragraph] <span> [ conti] [nues] </span> <p> <p>[Another paragraph begins]</p> I wrote an implemetation where, given a range ending in the second paragraph, limiting it to the first paragraph would incorrectly update the `@end` to ` conti` not `nues`. The old test case did not catch it.

@CommonAncestor

Traversing the whole tree is unnecessary -- and costly if the @CommonAncestor is <body> and you have many nodes in your document. All we actually need to do is update @start and @EnD to point at the first and last text nodes in the bounds, if they currently fall outside it. * master: 1.61s ±2.26% (6 runs sampled) -- but this is a little unfair since it was implemented in terms of a very slow getTextNodes call * this implementation: 0.01ms ±2.91% (85 runs sampled)

The old implementation was really expensive on a range covering many rows of table. I think it's because an array was being allocated at every level of the tree, then flattened out into the level above. Instead here we just do a depth-first search for text nodes and build up a single array as we go. In a little benchmark with a range covering every cell of a 5000×3 table: * master: 1.57s ±1.14% (6 runs sampled) * TreeWalker: 17.5ms ±0.74% (72 runs sampled)

Per openannotation/annotator#527 . We actually don't need to use global at all -- we can get the document from the element we're about to walk.

wjt · 2015-05-03T09:10:58Z

How's that?

tilgovi · 2015-05-03T21:01:31Z

Looks great :)

Range performance fixes

mwidner · 2015-11-17T00:34:00Z

@tilgovi @wjt How challenging do you think it'd be to backport this to 1.2.x? I'm seeing the same performance issue there and can't upgrade to 2.0 yet. Thanks.

tilgovi · 2015-11-17T00:53:27Z

@mwidner there's a v1.2.x branch on annotator so we have a place to backport fixes and make maintenance releases. Do you think you could extract this into a PR there?

tilgovi · 2015-11-17T00:54:33Z

It shouldn't be too hard because when this change happened the files were still coffee and still had the same names and paths, so it might be as simple as taking a copy of the annotator repo, branching off v1.2.x, adding this repo as a remote and cherry-picking these changes.

…otation/xpath-range#9

mwidner · 2015-11-17T18:09:55Z

Done! Thanks for the pointers.

tilgovi reviewed May 1, 2015
View reviewed changes

tilgovi mentioned this pull request May 3, 2015

Performance Slowness with Flatten function openannotation/annotator#525

Closed

wjt added 5 commits May 3, 2015 10:03

Remove Util#flatten: it's now unused

bbe2b2e

Stop using Util.getGlobal() to get document

90c2209

Per openannotation/annotator#527 . We actually don't need to use global at all -- we can get the document from the element we're about to walk.

wjt force-pushed the range-performance branch from 6a37aac to 90c2209 Compare May 3, 2015 09:10

tilgovi added a commit that referenced this pull request May 3, 2015

Merge pull request #9 from wjt/range-performance

062cd91

Range performance fixes

tilgovi merged commit 062cd91 into openannotation:master May 3, 2015

mwidner added a commit to PoeticMediaLab/annotator that referenced this pull request Nov 17, 2015

Performance fix for long documents in 1.2.x branch, based off openann…

34e7581

…otation/xpath-range#9

mwidner mentioned this pull request Nov 17, 2015

Performance fix for long documents in 1.2.x branch openannotation/annotator#577

Open

wjt deleted the range-performance branch May 7, 2017 09:50

wjt restored the range-performance branch May 7, 2017 09:50

wjt deleted the range-performance branch January 19, 2019 21:53

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Range performance fixes #9

Range performance fixes #9

Uh oh!

wjt commented Apr 30, 2015

Uh oh!

tilgovi May 1, 2015

Uh oh!

wjt May 1, 2015

Uh oh!

tilgovi May 1, 2015 via email

Uh oh!

wjt commented May 1, 2015

Uh oh!

wjt commented May 1, 2015

Uh oh!

tilgovi commented May 3, 2015

Uh oh!

tilgovi commented May 3, 2015

Uh oh!

wjt commented May 3, 2015

Uh oh!

tilgovi commented May 3, 2015

Uh oh!

mwidner commented Nov 17, 2015

Uh oh!

tilgovi commented Nov 17, 2015

Uh oh!

tilgovi commented Nov 17, 2015

Uh oh!

mwidner commented Nov 17, 2015

Uh oh!

Uh oh!

Range performance fixes #9

Range performance fixes #9

Uh oh!

Conversation

wjt commented Apr 30, 2015

Uh oh!

tilgovi May 1, 2015

Choose a reason for hiding this comment

Uh oh!

wjt May 1, 2015

Choose a reason for hiding this comment

Uh oh!

tilgovi May 1, 2015 via email

Choose a reason for hiding this comment

Uh oh!

wjt commented May 1, 2015

Uh oh!

wjt commented May 1, 2015

Uh oh!

tilgovi commented May 3, 2015

Uh oh!

tilgovi commented May 3, 2015

Uh oh!

wjt commented May 3, 2015

Uh oh!

tilgovi commented May 3, 2015

Uh oh!

mwidner commented Nov 17, 2015

Uh oh!

tilgovi commented Nov 17, 2015

Uh oh!

tilgovi commented Nov 17, 2015

Uh oh!

mwidner commented Nov 17, 2015

Uh oh!

Uh oh!