Compiling MessageContext #71

spookylukey · 2018-07-27T12:17:52Z

This is a second implementation of MessageContext using a compile-to-python strategy. The PR builds on top of #67 and should be reviewed after that has been merged.

My idea is that both implementations will be available, with the compiler the default because it is much faster (some benchmarks included in the new benchmark script). The interpreter is much easier to add new things to (which could be important for other contributors) and has some features/behaviour that the compiler doesn't have (documented). In addition, the two implementations can help test each other to some extent, and if you have any planned extensions, having two implementations, although twice the work, can help to ensure that the new feature doesn't limit you to a specific implementation strategy.

For example, I have an escapers feature that I need for django-ftl - see docs , which I've been able to implement for both the interpreter and compiler, but starting with the interpreter was easier. (That branch hasn't been updated for the 0.6 spec changes yet).

This branch is mostly complete, but probably needs some final cleanup. I'm opening it now in order to make @stasm aware of its existence (especially as I'm away for a bit now). It doesn't change that much from the interpreter MessageContext already implemented, but does clean up and clarify a few things (e.g. the error handling strategy), and MessageContext gains a check_message method (currently undocumented).

If you want to look at it, I imagine that starting with reading the tests/test_compiler.py tests will be the best way to go - it will illustrate the whole strategy, and the kind of optimizations that are implemented, which should explain a lot of the compiler.py and codegen.py code.

and other bug fixes

…etails

…mbers Thanks Babel!

…the start

dedent_ftl was converting unicode strings to bytestrings

Pike · 2018-08-02T11:01:48Z

Thanks for sharing this.

I see that you mention profiling on your README, can you share what's showing up in the profiles for you? Asking 'cause I've done so in my tooling environment, and I found the Fluent parser to be pretty bad, compared to other l10n fileformats we check for Firefox.

My take from the profile I had was that this is mostly because the python interpreter reacts to the code differently than the js engines (which JIT). I personally expect that we'll want to have a python parser which is written to perform well on python in the not-so-distant future.

On the actual change here, I share the concerns about security that you added to the README, and I think we'll need strong answers.

spookylukey · 2018-08-02T19:50:52Z

@Pike
None of the profiling I've done has looked at the parser at all. For my use case (generating translations in a Django app, see django-ftl ), it's of very little interest, because the parsing step (and compiling to Python step added in this branch) are one-time costs that are done at application startup.

So all of my benchmarks so far have focussed on the runtime performance of the compiled code. In many cases, the code is extremely minimal (e.g. returning a constant for the simple case - https://github.com/projectfluent/python-fluent/pull/71/files#diff-8ea7f46014c70a54a5d42179849a3a5aR56 ). For more complex cases, it is dominated by things like number formatting, or the plural rules lookup (which uses Babel's implementation and can be quite expensive for complicated rules).

The benchmark script I've added could be extended to do benchmarks of the parser though.

zbraniecki · 2018-08-20T17:27:49Z

@spookylukey - is your patchset ready for review at this point?

spookylukey · 2018-08-20T18:35:48Z

@zbraniecki - it builds on top of #67, which is not yet merged or reviewed, otherwise I think it is ready, but it might make more sense to wait for #67, or at least compare with the final commit on that branch instead of comparing with current master.

I've also been working on elm-fluent over the past few weeks, which is an FTL-to-Elm compiler implemented in Python, and so shares a large amount of structure with this compiler. Some of the work I've done on that has made me think of clean ups and improvements I could do for this branch, but they can wait.

zbraniecki · 2018-08-20T18:40:03Z

Thanks! I take it that #67 is ready for review then?

spookylukey · 2018-08-20T18:46:03Z

@zbraniecki Yes, it's ready, I fixed the last issue with the test suite failing on Python 2.

zbraniecki · 2018-08-20T19:02:17Z

Thank you! We'll coordinate and try to get it on the review schedule as soon as possible!

Motivation: 1. Simplicity - makes it easier for us to type the return values of these functions, because they no longer return tuples. 2. Performance - less tuple packing/unpacking. This will probably only make a difference when messages call other messages, because we still have to pack the tuple for the outer MessageContext.format call.

Specifically, multi-assignments using tuple unpacking syntax

…e errors

in line with projectfluent#67 (comment)

spookylukey · 2019-02-02T10:40:52Z

This PR is obsolete, my latest branch is https://github.com/django-ftl/python-fluent/tree/compiler_implementation which has many changes from this, but I will start a new PR for that later.

spookylukey added 30 commits May 11, 2018 21:06

Initial implementation of MessageContext

a540993

Beginnings of implementing resolve

3059ec2

Beginnings of resolving external arguments

ea68596

Fixed term/message mixup

97da936

format: Implemented attribute lookup

5fd424b

and other bug fixes

format: another test

f76c965

MessageContext: removed/changed methods that exposed implementation d…

b280a9d

…etails

Avoid name clash with Python builtin ReferenceError

b5d0a2f

format: Tests for missing attributes

b78f33f

format: support for accessing attributes directly

c77ff6f

format: initial support for variant forms

89ef658

format: implemented select expressions

8d879ea

format: select expression with numbers

cef0734

format: implemented function calls

5cb1066

format: improved handling of numbers

a011eae

utils: added 'cachedproperty' decorator

d38d932

format: implemented plural rule forms, plus consistent handling of nu…

fea2742

…mbers Thanks Babel!

resolver: doc string plus better argument order

8c1b02a

format: handle named/keyword arguments to functions

84fca75

format: support for Term

5f8d8ec

format: fixed handling of missing messages/terms

5c36e5c

format: bulked out some tests

0e30a76

format: Bulked out tests for numbers

fc7ad8b

format: handling floating point numbers

0188bea

fluent: report missing variants

104da18

format: implemented NUMBER builtin, with partial application

bbde8e5

format: test addition

e9c9e66

MessageContext: added convenience add_messages_from_file

edf121b

format: made 'args' optional.

874c999

format: there is no need to support bytestrings, we keep them out at …

87fc3c3

…the start

spookylukey added 8 commits July 26, 2018 15:45

compiler: fixes for v0.6 spec and AST

f139ffb

compiler: inline terms and variants

363e4e0

compiler: future proofing some codegen

c771d91

compiler: cleaned up some unused codgen operators

3e67f20

Fixed failing tests on Python 2.7

dc6406a

dedent_ftl was converting unicode strings to bytestrings

Fixed unused import - flake8 warning

f0f719c

Merge branch 'implement_format' into compiling_message_context

6797d4f

Post merge fixup

cc25a6a

spookylukey mentioned this pull request Sep 10, 2018

Implemented escaping mechanism #75

Closed

spookylukey added 4 commits September 15, 2018 20:32

compiler: removed unnecessary temporary variable for message calls.

b1ceda9

compiler: removed some unused functionality

b3fb776

Specifically, multi-assignments using tuple unpacking syntax

Fixed CompilingMessageContext.check_messages to always include compil…

f84038a

…e errors

spookylukey mentioned this pull request Oct 30, 2018

Implement MessageContext.format #67

Merged

spookylukey added 6 commits October 30, 2018 22:03

Merge branch 'master' into implement_format

e2cb337

Fixes for Fluent syntax 0.7

e59724e

Better clarity in README regarding performance

b590a25

Optimized 'format' hot path, for 10% improvement in simplest case

0576421

Fixed failing test

3945ad4

in line with projectfluent#67 (comment)

Merge branch 'implement_format' into compiling_message_context

6090673

spookylukey closed this Feb 2, 2019

spookylukey deleted the compiling_message_context branch March 3, 2019 01:20

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Compiling MessageContext #71

Compiling MessageContext #71

Uh oh!

spookylukey commented Jul 27, 2018

Uh oh!

Pike commented Aug 2, 2018

Uh oh!

spookylukey commented Aug 2, 2018

Uh oh!

zbraniecki commented Aug 20, 2018

Uh oh!

spookylukey commented Aug 20, 2018

Uh oh!

zbraniecki commented Aug 20, 2018

Uh oh!

spookylukey commented Aug 20, 2018

Uh oh!

zbraniecki commented Aug 20, 2018

Uh oh!

spookylukey commented Feb 2, 2019

Uh oh!

Uh oh!

Compiling MessageContext #71

Compiling MessageContext #71

Uh oh!

Conversation

spookylukey commented Jul 27, 2018

Uh oh!

Pike commented Aug 2, 2018

Uh oh!

spookylukey commented Aug 2, 2018

Uh oh!

zbraniecki commented Aug 20, 2018

Uh oh!

spookylukey commented Aug 20, 2018

Uh oh!

zbraniecki commented Aug 20, 2018

Uh oh!

spookylukey commented Aug 20, 2018

Uh oh!

zbraniecki commented Aug 20, 2018

Uh oh!

spookylukey commented Feb 2, 2019

Uh oh!

Uh oh!