Discuss variable priorities and shadowing #310

mihnita · 2022-10-26T21:23:27Z

The PR #305 trying to document the hoisting decision introduced the wording that has effect on the shadowing resolution.

To resolve the value of a Variable, its Name is used to identify either a local variable, or a variable defined elsewhere.
If a local variable and an externally defined one use the same name, the local variable takes precedence.

That was not part of a discussion, and the decision is debatable.

mihnita · 2022-10-26T21:25:49Z

The PR discussion thread, for convenience:

@stasm stasm 8 days ago
This implies that name shadowing is possible. My current thinking is that it should be OK, and I see use-cases for it. However, I don't recall an explicit discussion on the subject. It might also make sense to start with a stricter design and disallow shadowing at first.

@eemeli eemeli 8 days ago
I rather think that local variables must be allowed to shadow externally provided named arguments.

For instance, I could well imagine translation systems with the following attributes:

Translators are able to define additional local variables for a message.
A developer may pass the same basket of arguments to multiple messages, not all of which use each of them.
If shadowing were not allowed, then a combination of the above could result in a runtime error which would be difficult to discover ahead of time.

@mihnita mihnita 23 seconds ago
I think that this part of the change (shadowing) is a bit of an overreach.
It is unrelated to the hoisting problem, and it was not commonly agreed.

In general I don't think it is a good idea to allow translators to override a developer decision.
So "Translators are able to define additional local variables for a message." is a cons in my book, not a pro.

And "A developer may pass the same basket of arguments to multiple messages, not all of which use each of them." does not look like something related to shadowing.

There is a reason many linters warn about shadowing.

"If shadowing were not allowed, then a combination of the above could result in a runtime error which would be difficult to discover ahead of time."

Not necessarily.
One can say that translators can't define variables (and that is a good thing).
It is an error, but the decision between runtime error and simply ignore the conflicting local variable is implementation dependent.
We should not use that as an argument for what the standard should do.

But TLDR:

I don't see valid use-cases for it. In fact the proposal we had used different sigils for local variables (macros at the time) and arguments.
I don't recall an explicit discussion on the subject.
It might also make sense to start with a stricter design and disallow shadowing at first.

mihnita · 2022-10-26T21:35:35Z

Adding to it a bit: in one of the discussions Eemeli introduced the idea that we can have a "map of variables" passed at parse time, in addition to the one we pass at formatting time.

For example:

pattern = '
    let $foo = {$count :number} // LOCAL VARIABLES
    {Hello {$foo} and {$bar}!}
'

mf = MessageFormat2.parse(pattern, { 'foo': 42, 'bar': 'success', 'baz': 'this works' } ) // VARIABLE MAP 1
result = mf.format({ 'foo': 101, $bar: 'fail', 'count': 12345 } ) // VARIABLE MAP 2

So when the messages uses $foo and $bar we need to decide not only local variables vs variables defined elsewhere, but also the priority between the various "elsewhere"

I remember that in a previous discussion we discussed having several "bags of arguments", and the simple answer was to merge the bags before passing them to message format.
So the decision would be with the developer what overrides what, without complicating the standard.

But anyway, if we allow for more bags, we need to decide the priority.

mihnita · 2023-06-15T22:01:31Z

To summarize my arguments against this functionality:

There is no useful functionality added by allowing variable overrides

This is probably already reasonably well captured in previous comments, so I will not add any details.

Adds complexity to the spec and implementations

For example we previously discussed the risk or circular dependencies in definitions:

let b = {$a ...}
let c = {$b ...}
let a = {$c ...}

The mitigation was to not do hoisting, voted for by all people in that thread:
(zbraniecki, macchiati, eemeli, stasm, mihnita)
#297

But with this change we reintroduce the risk of circular dependencies:

let a = {$foo ...}
let b = {$a ...}
let a = {$b ...}

It is not hoisting, the declaration of a in line 3 can see the declaration of b in line 2, which can see the declaration of a in line 1.
Unless the definition of b actually "sees" the definition of a in line 1. So that definition is no completely shadowed by line 3, as a "left over" is captured in the definition of b.

In other words, if I do let a = {42} let b = {$a} let a = {13} let c = {$a} {The values are {b} and {c} !}
What is the result?
It it is "The values are 13 and 13 !", this is hoisting-like, and allows circular dependencies.
And it if is "The values are 42 and 13 !", so the first definition is not "fully shadowed".
(behaves like macros in C :-)

So the spec would have to explain what happens in case 2, how to avoid circular definitions (if we shadow 100%).
And implementations will have to follow that spec.

That is adding complexity to the spec / implementations, for no good benefit
("no benefit" is a personal judgement, see item 1, we can debate it)

I'm fine to add complexity as a price for something useful (balancing price vs benefit).

And this PR does not seem to explain the behavior in the above cases.

People are used to it

If memory serves me, I think this was mentioned in the meeting 2 weeks ago.

If not, I apologize, this would be a straw-man, please ignore.

So, would people expect it, because it is common?
Yes and no.

a. Many linters warn about shadowing.

So it is not necessarily a good thing.

b. Ant properties are immutable

See https://ant.apache.org/manual/Tasks/property.html:
<property name="foo.dist" value="dist"/>

And I've seen huge ant files implementing complex functionality despite this "limitation"

So it is not unheard of, and it is not that limiting.

c. In Rust you let a = 42, and it is immutable by default.

To make it mutable you need to say let mut a = 42
And in code the review the first question will probably "why does this have to be mutable?"
So the same idea from item a.: this is not a good thing, needs to serve a purpose.
Even the keyword is let, like we have.
So Rust does not try to accommodate some expectation of mutability.

d. Even in JS let is not like what we do here

In JS let can be updated but not re-declared

let a = 43 // OK
a = 13 // OK
let = 101 // Error

But with this PR we allow re-declaration, so we are even more "lax" than JS
(because for us declaration is initialization, which is a good thing)

macchiati · 2023-06-15T22:46:28Z

For my part, I see no reason why variables need to be mutable (aka mut). It is not a particular imposition on the writer to use a slightly altered name. The exposition and implementation are both simpler.

…

On Thu, Jun 15, 2023 at 3:01 PM Mihai Nita ***@***.***> wrote: To summarize my arguments against this functionality: ------------------------------ 1. There is no useful functionality added by allowing variable overrides This is probably already reasonably well captured in previous comments, so I will not add any details. ------------------------------ 1. Adds complexity to the spec and implementations For example we previously discussed the risk or circular dependencies in definitions: let b = {$a ...} let c = {$b ...} let a = {$c ...} The mitigation was to not do hoisting, voted for by all people in that thread: (zbraniecki, macchiati, eemeli, stasm, mihnita) #297 <#297> But with this change we reintroduce the risk of circular dependencies: let a = {$foo ...} let b = {$a ...} let a = {$b ...} It is not hoisting, the declaration of a in line 3 can see the declaration of b in line 2, which can see the declaration of a in line 1. Unless the definition of b actually "sees" the definition of a in line 1. So that definition is no completely shadowed by line 3, as a "left over" is captured in the definition of b. In other words, if I do let a = {42} let b = {$a} let a = {13} let c = {$a} {The values are {b} and {c} !} What is the result? It it is "The values are 13 and 13 !", this is hoisting-like, and allows circular dependencies. And it if is "The values are 42 and 13 !", so the first definition is not "fully shadowed". (behaves like macros in C :-) So the spec would have to explain what happens in case 2, how to avoid circular definitions (if we shadow 100%). And implementations will have to follow that spec. That is adding complexity to the spec / implementations, for no good benefit ("no benefit" is a personal judgement, see item 1, we can debate it) I'm fine to add complexity as a price for something useful (balancing price vs benefit). And this PR does not seem to explain the behavior in the above cases. ------------------------------ 1. People are used to it If memory serves me, I think this was mentioned in the meeting 2 weeks ago. If not, I apologize, this would be a straw-man, please ignore. So, would people expect it, because it is common? Yes and no. a. Many linters warn about shadowing. So it is not necessarily a good thing. b. Ant properties are immutable See https://ant.apache.org/manual/Tasks/property.html: <property name="foo.dist" value="dist"/> And I've seen huge ant files implementing complex functionality despite this "limitation" So it is not unheard of, and it is not that limiting. c. In Rust you let a = 42, and it is immutable by default. To make it mutable you need to say let mut a = 42 And in code the review the first question will probably *"why does this have to be mutable?"* So the same idea from item a.: this is not a good thing, needs to serve a purpose. Even the keyword is let, like we have. So Rust does not try to accommodate some expectation of mutability. d. Even in JS let is not like what we do here In JS let can be updated but not re-declared let a = 43 // OK a = 13 // OK let = 101 // Error But with this PR we allow re-declaration, so we are even more "lax" than JS (because for us declaration is initialization, which is a good thing) — Reply to this email directly, view it on GitHub <#310 (comment)>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/ACJLEMAP6Y5DFDFNW2LFTN3XLOA4RANCNFSM6AAAAAARPNDAZI> . You are receiving this because you are subscribed to this thread.Message ID: ***@***.***>

eemeli · 2023-06-16T07:34:16Z

From @mihnita: (note: example edited to fix syntax)
In other words, if I do
let $a = {42}
let $b = {$a}
let $a = {13}
let $c = {$a}
{The values are {$b} and {$c} !}
What is the result? It it is "The values are 13 and 13 !", this is hoisting-like, and allows circular dependencies. And it if is "The values are 42 and 13 !", so the first definition is not "fully shadowed". (behaves like macros in C :-)

So the spec would have to explain what happens in case 2, how to avoid circular definitions (if we shadow 100%). And implementations will have to follow that spec. [...]

And this PR does not seem to explain the behavior in the above cases.

Presuming that this is in reference to PR #381, that proposal currently includes this:

To resolve the value of a variable, its name is used to identify either a local variable, or a variable defined elsewhere.

If more than one declaration binds a value to the same name, or if an externally defined variable and a declaration use the same name, the resolved value of the most recent declaration preceding the variable is used.

Taking this one step at a time:

To format the pattern, we need to resolve the expression {$b}, which contains a variable $b.
1. To resolve the value of $b here, we find its most recent declaration preceding this variable on line 2.
2. The declaration on line 2 setting the value of $b has a right-hand side expression {$a}, which contains a variable $a.
3. To resolve the value of $a here, we find its most recent declaration preceding this variable on line 1.
4. The declaration on line 1 setting the value of $a has a right-hand side expression {42}, which contains the literal 42.
5. The value of this $a resolves to "42", and so the value of this $b also resolves to "42".
Next, we need to first resolve the expression {$c}, which contains a variable $c.
1. To resolve the value of $c here, we find its most recent declaration preceding this variable on line 4.
2. The declaration on line 4 setting the value of $c has a right-hand side expression {$a}, which contains a variable $a.
3. To resolve the value of $a here, we find its most recent declaration preceding this variable on line 3.
4. The declaration on line 3 setting the value of $a has a right-hand side expression {13}, which contains the literal 13.
5. The value of this $a resolves to "13", and so the value of this $c also resolves to "13".

Put together, the message resolves and formats the same as if it had been:

{The values are {42} and {13} !}

@mihnita, with reference to the above, would you be willing to acknowledge that the behaviour in these cases is explained, and does not include any circular references?

aphillips · 2023-06-16T18:01:28Z

@eemeli replied overnight, while I had the below to sleep on... but I think it's still worth sharing.

There are several reasons why a variable might want to be mutable. In fact, it might be useful not to think of them as "variables" so much as them being the "argument list" or "keys". I'll use the word variable below, but keep in mind that we are not a programming language. If anything we are closer to being a declarative language.

One reason to allow mutation is: the variable name may be referenced by the message pattern already. When the pattern is a simple message, altering the name of the variable in successive let statements and then in the expression is perhaps trivial. But when there are selectors there can be multiple expressions that reference the variable. Each such reference is probably a segment in the translation memory. Changing the name globally in the message is error prone and breaks TM leveraging. For example, here's the English (other languages will also have few, many, two, etc.):

let $foo = {$foo :transform}
match {$a :plural} {$b :plural}
when 0   0   {...{$foo}...}
when 0   one {...{$foo}...}
when 0   *   {...{$foo}...}
when one 0   {...{$foo}...}
when one one {...{$foo}...}
when one *   {...{$foo}...}
when *   0   {...{$foo}...}
when *   one {...{$foo}...}
when *   *   {...{$foo}...}

@stasm also called out that it makes for a bit of weirdness, in that externally defined variables might be overridden, but locally defined ones produce an error. If we make variables immutable, I would probably favor making it an error to override an externally supplied value (which might produce hard-to-debug errors if the argument list is dynamically generated or used across multiple messages--the latter of these is more common than one might think)

Also, we don't allow function nesting, so any transformation that requires multiple function calls results in multiple "waste values" since, as noted above, can't just "fix up" the original value. A reasonable use for reassignment is decoration or normalization of the original value. Stuff like:

let $foo = {$foo :text-transform transform=uppercase}
let $foo = {$foo :trim}
let $foo = {$foo :sanitize target=html}

... and I might very much want to do such processing in my message because (a) laboriously writing transforms into my expressions is messy and (b) if I need to nest transforms I have to write a custom function to do it (because no nesting). Yes, I can assign a new variable to each transform, but why do I have to?

I don't think anyone is proposing "forward definitions" or circular dependencies. If the message refers to an as-yet undefined value, that's an error.

One of @mihnita's questions was the answer to this riddle:

In other words, if I do let a = {42} let b = {$a} let a = {13} let c = {$a} {The values are {b} and {c} !}
What is the result?

I think it makes sense if the answer is:

And it if is "The values are 42 and 13 !", so the first definition is not "fully shadowed".

... because as noted at the start, we are not a programming language and these are not variables: they are keyed values. So $b was assigned the value of $a, not the reference to $a

The above arguments work best on primitives (string, number, etc.) and less well with values that are objects (person, currency value, inventory item, "Object Foo"), where each let assignment could require a deep copy of values on the right hand side.

I'll be interested to see how we land on this issue, as there is merit to making keyed values into constants. But I think there is reasonable utility to allowing reassignment of a key because I value utility and ease for message authors more than the internal troubles for MF implementers in this case.

mihnita · 2023-06-18T21:45:22Z

And many localization tools can leverage changes in placeholders already.
They leverage (and change the placeholder "on the fly") from "Hello %s" to "Hello %12s" to "Hello {0}"

But that's not the main point. We can just use another names.

the variable name may be referenced by the message pattern already

Then we can always add another intermediate name before.

For the first example:

let $foo = {$foo :transform}
match {$a :plural} {$b :plural}
  when 0   0   {...{$foo}...}
  ...
  when *   *   {...{$foo}...}

can be refactored without changing the messages proper (affected by leveraging):

let $foo_tmp = {$foo :transform1}
let $foo = {$foo_tmp :transform2}
match {$a :plural} {$b :plural}
  when 0   0   {...{$foo}...}
  ...
  when *   *   {...{$foo}...}

This one:

let $foo = {$foo :text-transform transform=uppercase}
let $foo = {$foo :trim}
let $foo = {$foo :sanitize target=html}
```
can similarly use different names, and it will not affect leveraging:

let $foo_tmp1 = {$foo :text-transform transform=uppercase}
let $foo_tmp2 = {$foo_tmp1 :trim}
let $foo = {$foo_tmp2 :sanitize target=html}


---

> so any transformation that requires multiple function calls results in multiple "waste values" since, as noted above
> ...
> And it if is "The values are 42 and 13 !", so the first definition is not "fully shadowed".
> ... because as noted at the start, we are not a programming language and these are not variables: they are keyed values.

I thing that between these 2 clarifications what we are "wasting" are not values, but value names.

If the "shadowed definition" is still accessible somehow (to give us 42 in the example) it means the whole expression still exists, without a name. So that is maybe just a minor optimization? Not the concern of a spec?

---

> because as noted at the start, we are not a programming language and these are not variables: they are keyed values.

I don't think that is obvious, or even 100% agreed.

Or, if it is for (some of) us, it is not clear for programmers, using programming languages all day long.
This looks like a programming language, but it does not behave like one.
And they use (this feature) rarely enough to not know or understand this is not the same.
The fact that we call them variables, but are not, does not help.

The example above is unclear unless you really read and really understand the spec (if already captured, and I am not sure).
For me the behavior was not clear.

Using different names makes 100% obvious without reading the spec.

---

Should I add here my comments from https://github.com/unicode-org/message-format-wg/pull/381?

> I don't think anyone is proposing "forward definitions" or circular dependencies.
> If the message refers to an as-yet undefined value, that's an error.

If the answer to my previous question was "13 ... 13", then it was a technically a circular dependency.

mihnita · 2023-06-18T21:59:57Z

Eemeli quotes the proposal to the spec:

If more than one declaration binds a value to the same name

I think that the meaning of "binds" is not defined.

Zibi used "binding" all the time, and I asked many times for a clarification of what that means.
And I also pointed out to existing "binding" techniques that are incompatible with the behavior he described.

TLDR: I don't think there is one universal definition of "bind" that I don't understand, and everyone else does.
So we can't just use that in a spec and assume it is understood.

There are difference between "static binding", "dynamic binding", "early binding", "late binding"

So we don't know what "bind" means here.

And we can completely bypass the need to define and understand it by just saying that the local "things" we declare & define (in one single step) are immutable.
And maybe we should not call them variables :-)

And I don't think that if we document what binding means in our context helps.
We force developers to read and understand that.
And if our meaning of "binding" is different from what they use day to day in their programming languages, we introduce "cognitive friction".

mihnita · 2023-06-18T22:03:46Z

A good read: https://courses.cs.vt.edu/~cs3304/Spring02/lectures/lect06.pdf

eemeli · 2023-06-19T08:17:10Z

@mihnita, I earlier asked the question:

@mihnita, with reference to the above, would you be willing to acknowledge that the behaviour in these cases is explained, and does not include any circular references?

As your answer to this appears to boil down to:

I think that the meaning of "binds" is not defined.
[...] And I don't think that if we document what binding means in our context helps.
We force developers to read and understand that.

I take from this that other than the use of the word "binds", you do acknowledge that the behaviour in the cases you specify is explained and that the spec does not introduce any circular references. I've now updated the PR to read:

To resolve the value of a variable, its name is used to identify either a local variable, or a variable defined elsewhere.

If the left-hand side variable of more than one declaration uses the same name, or if an externally defined variable and a declaration use the same name, the resolved value of the most recent declaration preceding the variable is used.

Do you have any remaining concerns regarding this PR?

To be honest, I think it's simpler to effectively say "you can reassign the value of a variable" rather than what we currently say, which is either "you cannot reassign the value of a variable" or "you can reassign the value of a variable, but only once, and only if it's originally an external variable", depending on how you interpret the current text.

If PR #381 is not accepted, I would ask that someone else propose spec text to resolve this ambiguity.

macchiati · 2023-06-20T15:06:01Z

There are several reasons why a variable might want to be mutable. In fact, it might be useful not to think of them as "variables" so much as them being the "argument list" or "keys". I'll use the word variable below, but keep in mind that we are not a programming language. If anything we are closer to being a declarative language.

The chief value of immutability is avoiding mistakes: where people reassign a variable without meaning to, or they use a variable without realizing that it has been changed.

It also prevents them from inadvertently messing up an input variable, which is easy to do because they are not declared.

let $foo = {$foo :text-transform transform=uppercase}
let $foo = {$foo :trim}
let $foo = {$foo :sanitize target=html}

If people have to do actions like this in more than a very small % of cases, then we have bigger problems in terms of usability.

See unicode-org#310

mihnita · 2023-06-26T18:18:59Z

Just as an interesting piece of history:
https://internals.rust-lang.org/t/history-of-shadowing-in-rust/6441

It tells us how (same level) shadowing ended up in Rust, and the fact that this was not some very useful feature, well though out. More of an accident, considered for removal, and in the end "nobody else really cared"

And several people in that thread seem to agree that shadowing is error prone.

See unicode-org#310

catamorphism · 2023-07-05T03:49:41Z

I filed #413 to specifically discuss the question of mutability / immutability.

eemeli · 2023-07-06T08:16:51Z

Two more reasons for mutability, specifically for allowing local values to override external ones:

This allows for writing messages that effectively "strongly type" their arguments. For example:

let $count = {$count :number}
let $date = {$date :datetime dateStyle=long}
match {$count}
when 1 {You received one message on {$date}}
when * {You received {$count} messages on {$date}}

Here, we've made it explicit for both the translator and developer formatting this message what the types of $count and $date ought to be, and we've ensured that we get the expected output even if the formatting function is called with

{ count: '1234', date: 1688630222849 }

Not needing to create new variable names means that within the message body there's less potential for making mistakes. Continuing with the same message as above:

If we did not allow for local values to override external ones, the developer would end up writing the message as something like

let $_count = {$count :number}
let $_date = {$date :datetime dateStyle=long}
match {$_count}
when 1 {You received one message on {$_date}}
when * {You received {$count} messages on {$_date}}

and if we used a different sigil for local variables (say, &), we'd get a message like

let &count = {$count :number}
let &date = {$date :datetime dateStyle=long}
match {&count}
when 1 {You received one message on {&date}}
when * {You received {$count} messages on {&date}}

With the first approach, we end up with $count, $_count, $date, and $_date all being valid within the message body. With the second approach, $count, &count, $date, and &date are all valid in the message body.

Now, did you notice that my examples above use $count in the second variant's placeholder where they should use $_count or &count? Please be honest when replying. That's an error which no linter would notice, and it's an error which absolutely would be made by developers and translators.

If instead we allow local variables to override external ones, only $count and $date are valid in the message body, and this category of error disappears.

catamorphism · 2023-07-20T18:55:56Z

Two more reasons for mutability, specifically for allowing local values to override external ones:

This allows for writing messages that effectively "strongly type" their arguments. For example:
[snip]

Not needing to create new variable names means that within the message body there's less potential for making mistakes.

I see (1) and (2) as good arguments for allowing local variables to override external variables, but not for mutability in general or for local variables overriding other local variables. External variables are already "special" in the sense that they don't need to be declared with a let, so I don't see a problem with making them more "special" in that they can be overridden, though that's a matter of opinion.

(I can also imagine ways to achieve the same goals as (1) and (2) without allowing any overriding. For example, a type system -- but that would be a major change and I assume it's been discussed in the past, so I can understand the reasoning behind (1) and/or (2) as workarounds for untypedness.)

aphillips · 2023-07-22T20:59:05Z

@catamorphism

A potential problem with allowing external variables to be overridden is that it allows the creation of unintentional bugs or transient runtime failures. Because the external arguments are undeclared, it is easy for a message to declare a variable that turns out to conflict with a given argument array, producing unintended results.

If we make all variables immutable and external and local vars share a namespace, passing an argument that shares a name with a local declaration can cause a message to fail.

{ "arg1": "37"}
...
let $arg1 = {|42| :number maxFractionDigits=2}  // error

If we make all variables immutable but external and local vars do not share a namespace, this problem goes away. However, a local variable cannot be used to augment or annotate an external variable.

{ "arg1": "42" }
...
let #arg1 = {$arg1 :number maxFractionDigits=2}
{Now I have to change {$arg} to {#arg}...}

If variables are mutable and namespaces are shared, it's easy to write a message that never fails but does produce unintended or unexpected results (from the caller's point of view).

{"arg1": "10000"}
...
let $arg1 = {42}
{This always says {$arg1} == 42}

If variables are mutable but namespaces are not shared, its easy for developers or translators to reference the wrong one:

{"arg1": "10000"}
...
let #arg1 = {42 :number maxFractionDigits=2}
{This always says {$arg1} == 10000 because it should say {#arg1}}

So none of these solutions is "perfect". They are all balancing some tradeoff.

Elsewhere on this thread I have tended to favor mutability to allow multiple transforms and the modification of formatting directives (including on a language-specific basis) without modifying the pattern parts of a message. The number of text segments affected can become quite large when there are multiple match expressions, so I favor the utility a bit.

Note that the argument list is not always explicitly composed by the caller. I've worked with integrating message formatting into a number of templating and data languages in which the argument map is precomputed (and often contains hundreds of fields: think of an item detail page on Amazon). It's okay to reference one of these values, but there is always the danger that you unintentionally smash the value (silently, which is bad) or conflict with an immutable value (runtime failures that are harder to diagnose).

@macchiati noted:

let $foo = {$foo :text-transform transform=uppercase}
let $foo = {$foo :trim}
let $foo = {$foo :sanitize target=html}
If people have to do actions like this in more than a very small % of cases, then we have bigger problems in terms of usability.

See my previous discussion about frequency. The percentage of total strings for nearly all of our features is pretty small. Multiple transforms are not that uncommon for us because we don't have subroutines or function nesting. I think a better question is: which feature is more important: multiple transforms or preventing accidental masking?

macchiati · 2023-07-23T19:02:22Z

Nice outline of the options, and food for thought. After reading it, I propose: You can have at most one "let X" for any X. - Specifically, you can override an input variable with a formatted version of that input variable, once. - Otherwise you can't override any variables. OK let $count = {$count :number sig=3} let $date = {$date :datetime skeleton=yMMMd} match {$count} {$date} when one {You have {$count} book on {$date}.} when * {You have {$count} books on {$date}.} FAIL (input variable set twice) let $count = {$count :number sig=3} let $count = {$count :number sig=4} ... FAIL (local variable, being reset) let $count = {|42|} let $count = {$count :number sig=4} ...

…

On Sat, Jul 22, 2023 at 1:59 PM Addison Phillips ***@***.***> wrote: @catamorphism <https://github.com/catamorphism> A potential problem with allowing external variables to be overridden is that it allows the creation of unintentional bugs or transient runtime failures. Because the external arguments are undeclared, it is easy for a message to declare a variable that turns out to conflict with a given argument array, producing unintended results. If we make *all* variables immutable and external and local vars share a namespace, passing an argument that shares a name with a local declaration can cause a message to fail. { "arg1": "37"} ... let $arg1 = {|42| :number maxFractionDigits=2} // error If we make *all* variables immutable but external and local vars do not share a namespace, this problem goes away. However, a local variable cannot be used to augment or annotate an external variable. { "arg1": "42" } ... let #arg1 = {$arg1 :number maxFractionDigits=2} {Now I have to change {$arg} to {#arg}...} If variables are mutable and namespaces are shared, it's easy to write a message that never fails but does produce unintended or unexpected results (from the caller's point of view). {"arg1": "10000"} ... let $arg1 = {42} {This always says {$arg1} == 42} If variables are mutable but namespaces are not shared, its easy for developers or translators to reference the wrong one: {"arg1": "10000"} ... let #arg1 = {42 :number maxFractionDigits=2} {This always says {$arg1} == 10000 because it should say {#arg1}} So none of these solutions is "perfect". They are all balancing some tradeoff. Elsewhere on this thread I have tended to favor mutability to allow multiple transforms and the modification of formatting directives (including on a language-specific basis) without modifying the pattern parts of a message. The number of text segments affected can become quite large when there are multiple match expressions, so I favor the utility a bit. Note that the argument list is not always explicitly composed by the caller. I've worked with integrating message formatting into a number of templating and data languages in which the argument map is precomputed (and often contains hundreds of fields: think of an item detail page on Amazon). It's okay to reference one of these values, but there is always the danger that you unintentionally smash the value (silently, which is bad) or conflict with an immutable value (runtime failures that are harder to diagnose). @macchiati <https://github.com/macchiati> noted: let $foo = {$foo :text-transform transform=uppercase} let $foo = {$foo :trim} let $foo = {$foo :sanitize target=html} If people have to do actions like this in more than a very small % of cases, then we have bigger problems in terms of usability. See my previous discussion about frequency. The percentage of total strings for nearly all of our features is pretty small. Multiple transforms are not that uncommon for us because we don't have subroutines or function nesting. I think a better question is: which feature is more important: multiple transforms or preventing accidental masking? — Reply to this email directly, view it on GitHub <#310 (comment)>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/ACJLEMFB6AAQL2B4BMPNEXTXRQ5KJANCNFSM6AAAAAARPNDAZI> . You are receiving this because you were mentioned.Message ID: ***@***.***>

stasm · 2023-07-23T21:08:26Z

@macchiati Do you mean that it would only ever be possible for local variables to shadow input variables? Or is this fact accidental in your examples? Can I also have let $month = {$date :datetime skeleton=MMMM} there?

macchiati · 2023-07-23T23:41:54Z

You could definitely do that, as long as there is no let $month = {something} before that statement. What I've focussed on is the chief example that people cite for wanting to reassign the meaning of a variable.

…

On Sun, Jul 23, 2023 at 2:08 PM Stanisław Małolepszy < ***@***.***> wrote: @macchiati <https://github.com/macchiati> Do you mean that it would *only ever* be possible for local variables to shadow input variables? Or is this fact accidental in your examples? Can I also have let $month = {$date :datetime skeleton=MMMM} there? — Reply to this email directly, view it on GitHub <#310 (comment)>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/ACJLEMHBMO3AEAJZMM762Q3XRWHFNANCNFSM6AAAAAARPNDAZI> . You are receiving this because you were mentioned.Message ID: ***@***.***>

aphillips · 2023-07-24T15:16:38Z

@macchiati I think that's too hard to explain. I would prefer to find a solution that lets us syntactically check violations rather than relying on (invisible) functionality that has to be carefully explained to developers and translators.

Perhaps introduce new keywords:

// immutable local variable
let $foo = {|42|} 

// augment foo
modify {$foo :number maxFractionDigits=0}
// multiple times?
modify {$foo :number groupingUsed=false}

// this is an error because foo is already assigned
let $foo = {|57|}

catamorphism · 2023-07-25T00:00:55Z

I haven't had a chance to read through all the comments yet, but note that we need to be careful to distinguish between mutabilty and name shadowing. Mutability makes it impractical to implement lazy evaluation with memoization (see #413 (comment)). Name shadowing does not.

macchiati · 2023-07-25T00:16:43Z

@macchiati I think that's too hard to explain.

I think it is actually very easy to explain.

It is an error to have the line

let X = …

if you have

let X = …

above it in the message.

eemeli · 2023-07-25T06:12:15Z

I'd be ok with allowing local variables to shadow external ones, but not other local variables, i.e. what we effectively now have in the spec and what @macchiati proposes. That would address my main concerns, though my preference would be to allow local variable shadowing as well.

I also note that no-one has yet answered the question I ask in my previous comment regarding separate name spaces.

stasm · 2023-07-25T07:59:44Z

I also note that no-one has yet answered the question I ask in #310 (comment) regarding separate name spaces.

I actually did notice — really! — but only because I was suspecting a set-up :)

But I see your point. Not having to change all variable references in the message body would be a welcome property of the solution that we'll end up designing.

It sounds like we have 3 requirements (not everyone agrees on all three, though):

Be able to re-annotate input variables under the same name without having to rename them in the message body.
Allow static analysis to detect mistakes when you try to reference an undefined local variable (Allow to statically analyze messages to detect undefined local variables #403).
Be able to re-annotate variables multiple times, to address the lack of nesting.

I think we can address all three with namespacing of local variable and by introducing new syntax for re-annotation, as @aphillips suggests:

// Syntax TBD.

// $count is an input variable
modify {$count :number minFrac=2}

// $$count is a local variable; no conflict thanks to namespacing
let $$count = {|42|} 

// re-annotating local variables is fine
modify {$$count :number minFrac=2} 

// re-annotating multiple times is fine
modify {$names :person.each declension=dative name=first}
modify {$names :list style=long type=conjunction}

// local variables are useful when we need to create things with new meaning
let $$hostName = {$host :person firstName=long}
let $$guestName = {$guest :person firstName=long}
let $$guestsOther = {$guestCount :number offset=1}

stasm · 2023-07-25T09:16:36Z

A few more thoughts:

In my previous comment, I originally used the annotate keyword. I've edited the comment to use modify, in line with @aphillips's comment. Let's focus on the requirements, and bikeshed the syntax later :) Maybe it's going to be set and let, annotate and local, modify and const, or something else completely.

The third requirement on my list says:

Be able to re-annotate variables multiple times, to address the lack of nesting.

I realize that this is still phrased in terms of a solution, rather then the use-case. Let me try again:

Enable complex use-cases which require multiple transformations (annotations) without the need to give a new name to each intermedite step. Naming is hard; remembering to use the new names is error-prone.

In the teleconference yesterday I said namespacing and shadowing were orthogonal and I suggested to make a decision about namespecing first. However, I now see that they're likely tightly-coupled, unfortunately. That's beause namespacing without a discussion about shadowing directly contradicts the first requirement above: Be able to re-annotate input variables under the same name without having to rename them in the message body. We need to pair with something like the modify keyword solution.

eemeli · 2023-11-29T10:49:26Z

@aphillips I'm pretty sure this has been resolved by our current approach using input and local, yes?

aphillips · 2023-11-29T13:40:28Z

Yes. The Seville consensus was to use input/local and make values immutable, eliminating this issue. These solutions have reached the spec, so closing. Reopen if you (the reader) disagree.

mihnita mentioned this issue Oct 26, 2022

Add variable resolution #305

Merged

mihnita added the blocker-candidate The submitter thinks this might be a block for the next release label Nov 3, 2022

aphillips added blocker Blocks the release editorial Issue is non-normative and removed blocker-candidate The submitter thinks this might be a block for the next release labels Jan 9, 2023

catamorphism mentioned this issue May 22, 2023

Clarify that variables declarations may override previous ones #381

Closed

catamorphism added a commit to catamorphism/message-format-wg that referenced this issue Jun 24, 2023

Require variable names to be globally unique

6fd8a47

See unicode-org#310

catamorphism mentioned this issue Jun 24, 2023

Require variable names to be globally unique #402

Closed

catamorphism added a commit to catamorphism/message-format-wg that referenced this issue Jun 24, 2023

Require variable names to be globally unique

2bbefef

See unicode-org#310

catamorphism added a commit to catamorphism/message-format-wg that referenced this issue Jun 24, 2023

Require variable names to be globally unique

bc6be29

See unicode-org#310

catamorphism added a commit to catamorphism/message-format-wg that referenced this issue Jun 24, 2023

Require variable names to be globally unique

77f2e6c

See unicode-org#310

stasm mentioned this issue Jun 26, 2023

Allow to statically analyze messages to detect undefined local variables #403

Closed

catamorphism added a commit to catamorphism/message-format-wg that referenced this issue Jul 5, 2023

Require variable names to be globally unique

8c2aecb

See unicode-org#310

catamorphism mentioned this issue Jul 5, 2023

Should variables be mutable? #413

Closed

aphillips mentioned this issue Sep 4, 2023

Design document for variable mutability and namespacing #469

Merged

eemeli added the resolve-candidate This issue appears to have been answered or resolved, and may be closed soon. label Nov 29, 2023

aphillips closed this as completed Nov 29, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Discuss variable priorities and shadowing #310

Discuss variable priorities and shadowing #310

mihnita commented Oct 26, 2022

mihnita commented Oct 26, 2022

mihnita commented Oct 26, 2022

mihnita commented Jun 15, 2023

macchiati commented Jun 15, 2023 via email

eemeli commented Jun 16, 2023

aphillips commented Jun 16, 2023

mihnita commented Jun 18, 2023

mihnita commented Jun 18, 2023

mihnita commented Jun 18, 2023

eemeli commented Jun 19, 2023

macchiati commented Jun 20, 2023

mihnita commented Jun 26, 2023

catamorphism commented Jul 5, 2023

eemeli commented Jul 6, 2023 •

edited

Loading

catamorphism commented Jul 20, 2023

aphillips commented Jul 22, 2023

macchiati commented Jul 23, 2023 via email

stasm commented Jul 23, 2023

macchiati commented Jul 23, 2023 via email

aphillips commented Jul 24, 2023

catamorphism commented Jul 25, 2023 •

edited

Loading

macchiati commented Jul 25, 2023

eemeli commented Jul 25, 2023

stasm commented Jul 25, 2023 •

edited

Loading

stasm commented Jul 25, 2023 •

edited

Loading

eemeli commented Nov 29, 2023

aphillips commented Nov 29, 2023

Discuss variable priorities and shadowing #310

Discuss variable priorities and shadowing #310

Comments

mihnita commented Oct 26, 2022

mihnita commented Oct 26, 2022

mihnita commented Oct 26, 2022

mihnita commented Jun 15, 2023

macchiati commented Jun 15, 2023 via email

eemeli commented Jun 16, 2023

aphillips commented Jun 16, 2023

mihnita commented Jun 18, 2023

mihnita commented Jun 18, 2023

mihnita commented Jun 18, 2023

eemeli commented Jun 19, 2023

macchiati commented Jun 20, 2023

mihnita commented Jun 26, 2023

catamorphism commented Jul 5, 2023

eemeli commented Jul 6, 2023 • edited Loading

catamorphism commented Jul 20, 2023

aphillips commented Jul 22, 2023

macchiati commented Jul 23, 2023 via email

stasm commented Jul 23, 2023

macchiati commented Jul 23, 2023 via email

aphillips commented Jul 24, 2023

catamorphism commented Jul 25, 2023 • edited Loading

macchiati commented Jul 25, 2023

eemeli commented Jul 25, 2023

stasm commented Jul 25, 2023 • edited Loading

stasm commented Jul 25, 2023 • edited Loading

eemeli commented Nov 29, 2023

aphillips commented Nov 29, 2023

eemeli commented Jul 6, 2023 •

edited

Loading

catamorphism commented Jul 25, 2023 •

edited

Loading

stasm commented Jul 25, 2023 •

edited

Loading

stasm commented Jul 25, 2023 •

edited

Loading