Add metadata & frontmatter #15

eemeli · 2023-11-30T10:32:22Z

Closes #14
CC @zbraniecki and @flodolo, if you've any comments

This adds syntax for @ prefixed key-value metadata, and for --- as a separator between resource-level comments & metadata (aka the "frontmatter") and the resource body. Together, they look like this (using ini highlighting, which mostly works):

@locale en-US
---

one = A message with no properties

@version 3
@since 2023-11-30
two = A message with some properties

# Freeform comments must come before properties
@param $foobar - An input argument
                 with a multiline value
three = Some {$foobar} message

# Metadata also attaches to section-heads
@deprecated
[section]

four = Foo

Like comments, metadata attaches to the "next thing" in the syntax. To allow for resource-level attachment, a --- frontmatter separator is included in the syntax. This allows using the same syntax and concepts for resources as for sections and entries. As detailed in #14, it's also a common pattern used in other formats. While many other formats with frontmatter do not themselves have something like @metadata and so need to pick a separately defined format for their frontmatter content (often YAML), e.g. YAML itself uses %directives in its frontmatter.

Metadata values use the same value construct as message entries, which means that they may be multiline if indented, and their inner syntax (beyond the keyword) will need to be defined separately. The @ prefix is rather intentionally chosen to allow matching Javadoc/JSDoc/TSDoc syntax, which is relatively well known.

It's likely that not all consumers of a resource will care about all or any of the metadata. For example, while something like @version could be important to a system tracking which messages need re-translation, it probably would not have any effect during the message's formatting. On the other hand, a resource-level @locale could end up significant for all consumers.

At this level, the syntax does not differentiate between metadata fields depending on e.g. their relevance to formatting; that should be done separately. One purely mechanical way to allow for some formatting-relevant metadata would be to only support that for the frontmatter. Another alternative would be to introduce another sigil beyond the @ to differentiate such. Or we could define an explicit list of keywords with a formatting impact.

Comments or empty lines are not allowed between metadata lines and the line they're attaching to. This is intentional, and meant to ensure that they stay together.

The id rule needs to get narrowed a bit as a part of this change, as it can't start with ---.

flodolo · 2023-11-30T14:19:25Z

The proposal makes sense to me.

My initial reaction was that putting metadata within the comment would be a better approach, since we likely need to display both to users (e.g. in Pontoon). The main complication is figuring out how to manage multiline values. But, on second thoughts, I can see a world where we interpolate metadata via scripts, and having them separate from comments makes things a lot easier.

eemeli · 2023-11-30T16:57:43Z

My thought was that in some contexts (like formatting), the metadata could be effectively ignored just as comments are.

With that sort of a parser, they can skip comments by first recognising one from the first # character, and then skip to the next \n. To skip metadata, that's recognised from the first @, after which we can skip to the next \n, and check if the next character is a space or a tab. If so, it's a metadata continuation line, and we can again skip to the next \n and repeat until we get something different.

So it should be just as easy, and this way metadata lines don't need a double sigil like # @ at their start.

nordzilla · 2023-12-04T21:56:07Z

I definitely like and support the idea of having separate tags for comments and attributes.

The distinction in the syntax feels much more readable to me overall.

I also personally don't like it when comments have semantic meaning that is tied to the code/config, with perhaps the exception of ``` code blocks ``` in Rust documentation comments being runnable as tests.

Attributes are a common practice in most languages that I'm familiar with, and I think this solution feels natural.

zbraniecki · 2023-12-04T22:11:53Z

I'm ambivalent on this design.

I'm used to think of attributes as part of comments, maybe due to @jsdoc, maybe due to the length of settling with Semantic Comments for Fluent, but I can see the argument @nordzilla made.

In result I feel comfortable supporting this proposal as is.

I looked at the syntax from the error recovery heuristics perspective and it seems like the general level remains the same and multiline attributes do not induce new vectors.

eemeli · 2023-12-13T09:40:29Z

Merging, as this seems like a good next step and lets us start working on actual metadata fields.

Add metadata & frontmatter

48fc00f

eemeli requested a review from stasm November 30, 2023 10:32

eemeli mentioned this pull request Dec 1, 2023

Add data model as TypeScript definitions #16

Merged

mathjazz mentioned this pull request Dec 5, 2023

Avoid translation of strings (and parts of strings) that should not be translated mozilla/pontoon#3043

Open

eemeli merged commit 9e9cdde into main Dec 13, 2023

eemeli deleted the metadata branch December 13, 2023 09:40

eemeli mentioned this pull request Dec 13, 2023

Update README with syntax example #17

Merged

eemeli mentioned this pull request Jan 15, 2024

Allow message-level attributes to enable gender/etc-aware workflows unicode-org/message-format-wg#595

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add metadata & frontmatter #15

Add metadata & frontmatter #15

Uh oh!

eemeli commented Nov 30, 2023 •

edited

Loading

Uh oh!

flodolo commented Nov 30, 2023

Uh oh!

eemeli commented Nov 30, 2023

Uh oh!

nordzilla commented Dec 4, 2023

Uh oh!

zbraniecki commented Dec 4, 2023

Uh oh!

eemeli commented Dec 13, 2023

Uh oh!

Uh oh!

Add metadata & frontmatter #15

Add metadata & frontmatter #15

Uh oh!

Conversation

eemeli commented Nov 30, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

flodolo commented Nov 30, 2023

Uh oh!

eemeli commented Nov 30, 2023

Uh oh!

nordzilla commented Dec 4, 2023

Uh oh!

zbraniecki commented Dec 4, 2023

Uh oh!

eemeli commented Dec 13, 2023

Uh oh!

Uh oh!

eemeli commented Nov 30, 2023 •

edited

Loading