add opcode definitions section #237

MikeHolman · 2015-06-29T16:07:58Z

No description provided.

jfbastien · 2015-06-29T17:03:40Z

BinaryEncoding.md

+  * the generic section header
+  * a table containing, for each opcode-space, a standardized string literal
+    type name (where index defines its type), offset (within the section),
+    sorted by offset, followed by


Could you make these sub-bullets?

What do you mean by "index defines its type"?

I mean that when we have places where we need to reference a type (e.g. function definitions), we don't want to put "int32", would rather put 0 (if for example the int32 opcodes were first in this list).

jfbastien · 2015-06-29T17:07:50Z

What does it mean to have multiple sections with functions in each? Can one section's function call a function in another section?

Or do we just have one code section for now?

How do I access different data sections?

It looks like we're close to a container format... This is related to #74 about using ELF.

The definition is also beginning to look like BNF!

MikeHolman · 2015-06-29T17:42:40Z

What does it mean to have multiple sections with functions in each? Can one section's function call a function in another section? Or do we just have one code section for now?

I don't see a logical reason to have more than one code section right now, so I think limiting to one should be ok.

How do I access different data sections?

I guess there are a couple ways we could go about it. If different data section types are all singletons (e.g. you can only have a single import sections), then you can directly access it and decoder will know where to look when you ask for import[0]. Otherwise you can indirect through the section list for the section you want and access your data from there (not quite as efficient though).

It looks like we're close to a container format... This is related to #74 about using ELF.

I don't really know ELF, but the conversation made it sound like the format will not map well for us. So I'd like to move forward with something else hedging against it.

sunfishcode · 2015-06-29T18:27:49Z

ELF isn't yet ruled out. These various tables could just map to special sections/segments in ELF. But I don't think we need to worry about that now. Let's design what we want first, and figure out whether ELF makes sense once we have that.

jfbastien · 2015-06-29T21:40:30Z

BinaryEncoding.md

-  * a table (sorted by offset) containing, for each section, its type and offset (within the module), followed by
-  * a sequence of sections.
+* A module contains (in this order):
+  * A header


Define header.

You caught me. I don't know what the headers contain. At the very least, the module header contains the magic number, but besides that I don't really have anything in particular decided. Maybe some things like whether heap is 32/64 bit, source language (for ABI), and entry point. This will need to be figured out, but based on the level of detail in the rest of our design docs I'm not sure how much detail to go in here.

Maybe I should just mention some things like this as "ideas" for what a header would contain?

Yeah that sounds good. I like what you're adding overall, so I'll step back, take this improvement, and we can iterate later :-)

jfbastien · 2015-06-29T21:45:25Z

Maybe I'm going into too many details?

MikeHolman · 2015-06-29T22:31:27Z

@jfbastien Maybe a bit with null terminated UTF8 (which is what I had in mind, but thought I was already bordering on too verbose), but you are right that "header" and "type" deserved some clarifications.

jfbastien · 2015-06-30T17:58:30Z

lgtm

sunfishcode · 2015-06-30T18:14:31Z

BinaryEncoding.md

+* A module contains (in this order):
+  - A header, containing:
+    + The [magic number](https://en.wikipedia.org/wiki/Magic_number_%28programming%29)
+    + Other data TBD (possibly entrypoint, memory bitness, source language, etc.)


What is memory bitness?

At an initial glance, source language seems like something we'd specifically try to avoid including in the main header, because it suggests special magical per-source-language semantics.

What is memory bitness?

I mean whether your linear memory has a 64 bit or 32 bit address space (i.e. whether ptr type is int32 or int64). Maybe not necessary here, but was just an idea for something we might want. I think we might need some version info for the module format as well. We may never need to break compat, but I can imagine a scenario where we eventually want to make format changes, and having a byte to allow for that would be useful.

I'll just say other data TBD for now and remove the rest

In the AstSemantics.md, we've just been assuming that one can index the
linear memory with either 32-bit or 64-bit offsets. In the v8 native
prototype, there are different bytecode numbers for whether the memory
offset operand is an Int32 or an Int64.

On Tue, Jun 30, 2015 at 10:29 PM, Michael Holman [email protected]
wrote:

In BinaryEncoding.md
#237 (comment):

@@ -65,20 +65,40 @@ Yes:

Global structure

-* A module contains:

* a header followed by

* a table (sorted by offset) containing, for each section, its type and offset (within the module), followed by

* a sequence of sections.
+* A module contains (in this order):

A header, containing:

The magic number

Other data TBD (possibly entrypoint, memory bitness, source language, etc.)

What is memory bitness?

I mean whether your linear memory has a 64 bit or 32 bit address space
(i.e. whether ptr type is int32 or int64). Maybe not necessary here, but
was just an idea for something we might want. I think we might need some
version info for the module format as well. We may never need to break
compat, but I can imagine a scenario where we eventually want to make
format changes, and having a byte to allow for that would be useful.

—
Reply to this email directly or view it on GitHub
https://github.com/WebAssembly/design/pull/237/files#r33619752.

jfbastien · 2015-07-01T03:10:42Z

I think this is good to go, we can iterate on top.

add opcode definitions section

MikeHolman added 3 commits June 29, 2015 09:07

add opcode definitions section

2ebd97b

added type id to section

adad31c

combined tables

f6625e1

jfbastien reviewed Jun 29, 2015
View reviewed changes

some cleanup

f638520

jfbastien reviewed Jun 29, 2015
View reviewed changes

cleanup and clarifications

ad4a73e

sunfishcode reviewed Jun 30, 2015
View reviewed changes

MikeHolman added 2 commits June 30, 2015 13:32

tiny update

1abb706

made code sections have 64 bit offsets

90f8acd

jfbastien added a commit that referenced this pull request Jul 1, 2015

Merge pull request #237 from WebAssembly/definition-section

5f700c1

add opcode definitions section

jfbastien merged commit 5f700c1 into master Jul 1, 2015

jfbastien deleted the definition-section branch July 1, 2015 03:10

rossberg mentioned this pull request May 3, 2016

Simplify br_if by removing its value operand. #681

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

add opcode definitions section #237

add opcode definitions section #237

MikeHolman commented Jun 29, 2015

jfbastien Jun 29, 2015

MikeHolman Jun 29, 2015

jfbastien commented Jun 29, 2015

MikeHolman commented Jun 29, 2015

sunfishcode commented Jun 29, 2015

jfbastien Jun 29, 2015

MikeHolman Jun 29, 2015

jfbastien Jun 29, 2015

jfbastien commented Jun 29, 2015

MikeHolman commented Jun 29, 2015

jfbastien commented Jun 30, 2015

sunfishcode Jun 30, 2015

MikeHolman Jun 30, 2015

MikeHolman Jun 30, 2015

titzer Jun 30, 2015

jfbastien commented Jul 1, 2015

add opcode definitions section #237

add opcode definitions section #237

Conversation

MikeHolman commented Jun 29, 2015

Choose a reason for hiding this comment

Choose a reason for hiding this comment

jfbastien commented Jun 29, 2015

MikeHolman commented Jun 29, 2015

sunfishcode commented Jun 29, 2015

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

jfbastien commented Jun 29, 2015

MikeHolman commented Jun 29, 2015

jfbastien commented Jun 30, 2015

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Global structure

jfbastien commented Jul 1, 2015