Rewind index to improve error recovery #292

stasm · 2018-10-15T22:26:06Z

The error recovery logic in the reference parser and the optimistic runtime parser is able to resume parsing at an offset which precedes the exact index of the error. Consider the following example:

foo = {
bar = Bar
     ^----- Expected token: "}"

The error occurs well into the bar… line. At the same time, I would expect bar to be a functional message and to parse correctly. Messages preceding any given message should not be allowed to break it.

This wasn't easy to achieve with the ParserStream architecture based on an iterator over the source string. I re-wrote it to use a simpler cursor-based approach. As it turned out, this simplified stream.js quite much and it also improved the performance :)

jsshell 63 (ms, compared to zeroseven):
  mean:   28.86 (-28%)
  stdev:  1.35
  sample: 30

node.js 9.11 (ms, compared to zeroseven):
  mean:   8.51 (-32%)
  stdev:  1.05
  sample: 30

Pike

As much as I understand the parser, this looks good to me. Simpler code is good, and also more consistency between stream API usage in parser and ftlstream.

I found one issue, where our error position is outside of our junk entry, and I don't think that's right. Good news, AFAICT, you have all the right information in getEntryOrJunk already.

And one stylistic nit.

The behavior tests are always a bit of black magic to me. They look better now than before (wth # key-05), but that's not a strong statement.

I'd love to cross-check the fixtures after the fix, so doing a request-changes instead of approve-with-comments.

Pike · 2018-10-16T09:27:24Z

fluent-syntax/test/fixtures_structure/unclosed.json

+          "span": {
+            "type": "Span",
+            "start": 16,
+            "end": 16


I think we should rewind the error position, too.

If we rewind the error position, pretty printing the annotations (or highlighting them in IDEs) would always point to the beginning of the line. Which I'll admit is sometimes good, and sometimes bad :) It's the choice between:

A) Error position unchanged. # This makes sense to me. ! E0003 on line 2: | foo = { | invalid placeable } … ^----- Expected token: "}" # Meh, this is kind of misleading. ! E0003 on line 2: | foo = { | bar = Bar … ^----- Expected token: "}"

And:

B) Error position rewound. # Meh, we could do better. ! E0003 on line 2: | foo = { | invalid placeable } … ^----- Expected token: "}" # This makes sense to me. ! E0003 on line 2: | foo = { | bar = Bar … ^----- Expected token: "}"

I take it that you vote for B, as it's likely more common?

Pike · 2018-10-16T09:55:11Z

fluent-syntax/src/stream.js

-
-    return ret === ch;
+    this.peekOffset++;
+    return this.currentPeek;


Just sugar, I'd love for next() and peek() to be symmetric in their implementation. I can see reasons for both ways, maybe perf can throw a dice?

I see what you mean. There's no difference in the perf benchmarks, but I like the idea of making the implementation symmetric. Also, I think I'll ditch the pre-increment operator in next(). It's easy to miss when reading the code.

Pike

r=me, not feeling strongly about the const nit, I'm not zibi ;-)

Pike · 2018-10-16T11:32:38Z

fluent-syntax/src/parser.js

      ps.skipToNextEntryStart(entryStartPos);
-      const nextEntryStart = ps.index;
+      let nextEntryStart = ps.index;


Nit, this can still be a const, right?

Ah, yes. I stopped using const in most of the code I write these days and only use them for actual constants. I typed let out of habit. I'll revert to const for consistency with the rest of the file.

stasm requested a review from zbraniecki October 15, 2018 22:26

stasm force-pushed the rewind branch from 7f38b53 to 57ded07 Compare October 15, 2018 22:39

Pike suggested changes Oct 16, 2018

View reviewed changes

stasm force-pushed the rewind branch from d26db2b to 65b7299 Compare October 16, 2018 11:27

Pike approved these changes Oct 16, 2018

View reviewed changes

stasm force-pushed the rewind branch from d0b01d1 to 9a377ac Compare October 16, 2018 11:41

stasm added 2 commits October 16, 2018 13:51

Use a cursor in ParserStream

5c1c446

Rewind index to improve error recovery

ca17a53

stasm force-pushed the rewind branch from 9a377ac to ca17a53 Compare October 16, 2018 11:51

stasm merged commit bcf6799 into projectfluent:zeroseven Oct 16, 2018

stasm deleted the rewind branch October 16, 2018 12:18

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Rewind index to improve error recovery #292

Rewind index to improve error recovery #292

Uh oh!

stasm commented Oct 15, 2018

Uh oh!

Pike left a comment

Uh oh!

Pike Oct 16, 2018

Uh oh!

stasm Oct 16, 2018

Uh oh!

Pike Oct 16, 2018

Uh oh!

stasm Oct 16, 2018

Uh oh!

Pike left a comment

Uh oh!

Pike Oct 16, 2018

Uh oh!

stasm Oct 16, 2018

Uh oh!

Uh oh!

Rewind index to improve error recovery #292

Rewind index to improve error recovery #292

Uh oh!

Conversation

stasm commented Oct 15, 2018

Uh oh!

Pike left a comment

Choose a reason for hiding this comment

Uh oh!

Pike Oct 16, 2018

Choose a reason for hiding this comment

Uh oh!

stasm Oct 16, 2018

Choose a reason for hiding this comment

Uh oh!

Pike Oct 16, 2018

Choose a reason for hiding this comment

Uh oh!

stasm Oct 16, 2018

Choose a reason for hiding this comment

Uh oh!

Pike left a comment

Choose a reason for hiding this comment

Uh oh!

Pike Oct 16, 2018

Choose a reason for hiding this comment

Uh oh!

stasm Oct 16, 2018

Choose a reason for hiding this comment

Uh oh!

Uh oh!