[spec] Allow impls to limit code point range #488

rossberg · 2017-05-29T12:50:46Z

As discussed on WebAssembly/design#1016, make it legal for implementations in environments that do not understand (all of) Unicode to only support smaller character subsets.

jfbastien

Fine with me, @sunfishcode approved in a prior discussion (and @lukewagner agreed later), but as I mentioned here I want to make sure this is discussed independently.

Specifically, I'd like to get feedback from @annevk, @domenic, and @tabatkins.

tabatkins · 2017-05-30T18:25:52Z

As long as it's a subset (such as ASCII) and not a different charset entirely (like Shift-JIS), yeah, no problem here.

domenic · 2017-05-30T21:12:34Z

I don't really understand the spec text for this restriction, but it seems like other people do, so maybe it's fine.

Reading the other threads, it seems like the actual interpretation is that implementations are free to reject incoming source bytes if certain bits in those bytes are set to certain values? E.g. an implementation is free to reject incoming source bytes if the most-significant-bit is set, effectively only allowing ASCII names?

It would be a lot clearer to me if things were stated that way, but as I said, it seems like others aren't having this comprehension problem, so maybe it's fine.

RyanLamansky · 2017-05-30T21:40:27Z

@tabatkins @domenic The terms "code point" and "common subsets" are clear enough to me that we're still talking about Unicode values, not binary bytes/bits. It might be helpful to be more explicit about this distinction, though.

domenic · 2017-05-30T21:44:18Z

How are we talking about Unicode values? Isn't this spec discussing implementation-specific limitations on the inputs, which are definitely bytes?

RyanLamansky · 2017-05-30T21:48:41Z

@domenic I have to look at the whole file; not shown in he GitHub "Files changed" feature are the section headings, which add more context to the changes.

domenic · 2017-05-30T21:52:16Z

Right, I guess I don't understand what the first change applies to, i.e. the "Syntactic Limits" heading.

rossberg · 2017-05-31T06:59:44Z

@domenic, it's described in terms of the abstract syntax, which defines names as sequences of Unicode code points. That makes it independent of the concrete input format (e.g. binary or text format).

rossberg · 2017-06-06T11:05:16Z

Seems like there is approval and no objections, so I'll merge.

Includes [spec] Allow impls to limit code point range (#488).

[test] Unify the error message of `"null structure reference"`.

[spec] Allow impls to limit code point range

b4171e3

rossberg mentioned this pull request May 29, 2017

[spec] Implementation restrictions #483

Merged

jfbastien requested review from domenic and annevk May 29, 2017 19:23

jfbastien approved these changes May 29, 2017

View reviewed changes

rossberg merged commit 56ac21e into spec.limits Jun 6, 2017

rossberg deleted the spec.limits.jf branch June 6, 2017 11:05

rossberg added a commit that referenced this pull request Jun 6, 2017

[spec] Implementation restrictions (#483)

6d6648b

Includes [spec] Allow impls to limit code point range (#488).

dhil pushed a commit to dhil/webassembly-spec that referenced this pull request Jan 25, 2024

Merge pull request WebAssembly#488 from q82419/main

bc887cd

[test] Unify the error message of `"null structure reference"`.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[spec] Allow impls to limit code point range #488

[spec] Allow impls to limit code point range #488

Uh oh!

rossberg commented May 29, 2017

Uh oh!

jfbastien left a comment

Uh oh!

tabatkins commented May 30, 2017

Uh oh!

domenic commented May 30, 2017

Uh oh!

RyanLamansky commented May 30, 2017

Uh oh!

domenic commented May 30, 2017

Uh oh!

RyanLamansky commented May 30, 2017

Uh oh!

domenic commented May 30, 2017

Uh oh!

rossberg commented May 31, 2017

Uh oh!

rossberg commented Jun 6, 2017

Uh oh!

Uh oh!

[spec] Allow impls to limit code point range #488

[spec] Allow impls to limit code point range #488

Uh oh!

Conversation

rossberg commented May 29, 2017

Uh oh!

jfbastien left a comment

Choose a reason for hiding this comment

Uh oh!

tabatkins commented May 30, 2017

Uh oh!

domenic commented May 30, 2017

Uh oh!

RyanLamansky commented May 30, 2017

Uh oh!

domenic commented May 30, 2017

Uh oh!

RyanLamansky commented May 30, 2017

Uh oh!

domenic commented May 30, 2017

Uh oh!

rossberg commented May 31, 2017

Uh oh!

rossberg commented Jun 6, 2017

Uh oh!

Uh oh!