Remove protobufjs in favor of protons-runtime #60

talentlessguy · 2025-02-08T21:14:46Z

This closes #31

Switches to protons-runtime

All tests pass now

…protobufjs

package.json

alanshaw · 2025-02-10T09:28:14Z

@talentlessguy you might gain more savings switching to protons and protons-runtime instead? It would dudupe with protons-runtime in ipfs-unixfs-exporter.

alanshaw · 2025-02-10T09:32:24Z

Note: I think in general I'd prefer to switch to protobuf-es as it seems to have a big community and usage behind it.

talentlessguy · 2025-02-10T09:53:28Z

@talentlessguy you might gain more savings switching to protons and protons-runtime instead? It would dudupe with protons-runtime in ipfs-unixfs-exporter.

thanks for the suggestion

achingbrain · 2025-02-21T10:49:05Z

FWIW, last time I ran the benchmark suite, protons was just over 20% faster than protobuf-es when serializing/deserializing - https://github.com/ipfs/protons/tree/main/packages/protons-benchmark#usage

These kind of performance characteristics are very important to high traffic deployments like Lodestar so it's unlikely to be switched away from elsewhere.

talentlessguy · 2025-02-21T10:57:52Z

FWIW, last time I ran the benchmark suite, protons was just over 20% faster than protobuf-es when serializing/deserializing - https://github.com/ipfs/protons/tree/main/packages/protons-benchmark#usage

do you mean protobufjs, and not protons?

achingbrain · 2025-02-21T11:50:32Z

No, I mean protons:

protobuf.js is also quite a bit faster than protobuf-es but it's CJS and the stack is ESM-only so that rules it out.

talentlessguy · 2025-02-22T13:36:39Z

I wonder how hard it would be to replace protobuf-es with protons, will give it a shot rn

achingbrain · 2025-02-22T13:41:12Z

I guess the only thing is that protons compiles protobuf definitions to TypeScript and this is still a js project.

alanshaw · 2025-04-03T17:23:23Z

@achingbrain is the benchmark encode or decode? We're only encode in this repo.

…protobufjs

alanshaw · 2025-04-23T13:42:24Z

src/codec.js

-      Data: content.byteLength > 0 ? content : undefined,
-      filesize: content.byteLength,
+      Data: content.byteLength > 0 ? content : EMPTY_BUFFER,
+      filesize: BigInt(content.byteLength),


Does the dag-pb encoder support bigint?

I'm not sure about dag-pb, is it relevant to the PR?

buffer-shim.js

alanshaw · 2025-05-20T10:35:20Z

src/codec.js

-      Data: content.byteLength > 0 ? content : undefined,
-      filesize: content.byteLength,
+      Data: content.byteLength > 0 ? content : EMPTY_BUFFER,
+      filesize: content.length === 0 ? Object.assign(0n, { __forceEncode: true }) : BigInt(content.length),


Why do we need __forceEncode here?

Adding these undefined fields and setting __forceEncode seems to be repeated often - shouldn't it be moved to encodePB so we get less change in other places?

this is because protons-runtime ignores 0n because it thinks it's a default and because of that doesn't include it in the protobuf

Protons is a proto3 implementation though it tries it's best with proto2. The big difference is there's no "required" modifier any more because the authors considered it harmful.

If you want a value to always be in the protobuf in proto3 you should mark it optional and set a value. This is compatible on the wire with required in proto2.

E.g. these will result in the same bytes:

syntax = "proto2"; message Derp { required int64 Value = 1; }

syntax = "proto3"; message Derp { optional int64 Value = 1; }

You can see the proto3 definition ipfs-unixfs uses here and the bytes generated are compatible with older proto2 parsers.

There was some discussion about this on the libp2p specs repo a while back which goes over the various pros and cons and how to maintain backwards compatibility - libp2p/specs#465

So what's the best solution? migrating js-unixfs to use proto3 or just keep it the way it is?

In general, there is a lot of change in this PR that makes me nervous about merging it and retaining compatibility.

Would more comprehensive tests make you more comfortable?

tests only change numbers to bigint's, the rest didn't change. that's the only breaking change actually, I made sure not to alter anything else

@achingbrain I'm not sure why but protons with proto3 are unable to properly encode 0-ish values:

/** * * @param {Uint8Array} content * @returns {UnixFS.ByteView<UnixFS.Raw>} */ export const encodeRaw = content => encodePB( { Type: NodeType.Raw, Data: content, filesize: BigInt(content.length), // // @ts-ignore blocksizes: [], fanout: 0n, mode: 0, hashType: 0n, }, [] )

Produces:

1) format neunaces raw with no content: AssertionError: expected Uint8Array[ 10, 0 ] to deeply equal Uint8Array[ 10, 4, 8, 0, 24, 0 ] + expected - actual -{"0":10,"1":0} +{"0":10,"1":4,"2":8,"3":0,"4":24,"5":0}

This used to work with proto2:

export const encodeRaw = content => encodePB( { Type: NodeType.Raw, Data: content.byteLength > 0 ? content : EMPTY_BUFFER, filesize: content.length === 0 ? Object.assign(0n, { __forceEncode: true }) : BigInt(content.length), // @ts-ignore blocksizes: EMPTY, fanout: 0n, }, [] )

Protobuf has the concept of default values, for example if a field type is uint64, and no value for it is present in the protobuf binary, on deserialization it will be set to the default value of 0n, unless the field is marked optional, in which case it will be set to undefined.

If a field is not marked optional (e.g. it is singular in proto3 nomenclature), and is set to the default value, it will be omitted from the protobuf binary.

Consequently values for optional fields will be written into the protobuf, even if they are equal to the default value otherwise later it would be impossible to know if the field in question was explicitly intended to be the default value or if it was intended to be omitted from the message.

Above you are setting values for optional fields which is why there are more bytes in the serialized form of the message than you are expecting.

Above you are setting values for optional fields which is why there are more bytes in the serialized form of the message than you are expecting.

not sure if I follow, the "expected" diff has these bytes: [ 10, 4, 8, 0, 24, 0 ], while the actual has only two, so thats' the exact opposite to what you have said. Should I change something in the proto declaration maybe to work around this?

src/codec.js

Co-authored-by: ash <[email protected]>

talentlessguy added 9 commits February 8, 2025 22:23

feat: move from protobufjs to protobuf-es

bbe7cdc

chore: remove mocha because of weird esm error and use node --test

4d331d6

chore: fix some of the tests

729d2c5

fix all undefined errors in tests

863de21

Merge branch 'main' of https://github.com/ipld/js-unixfs into remove-…

38bbe2a

…protobufjs

fill in some default data

f5064c7

switch to proto3

6d0583c

fix 3 unit tests

b6f3772

make some bits more correct

0ae878f

alanshaw reviewed Feb 10, 2025

View reviewed changes

package.json Outdated Show resolved Hide resolved

talentlessguy added 2 commits February 22, 2025 16:07

use protons-runtime instead

b7e9d34

remove protobuf-es remainders

13021f2

talentlessguy changed the title ~~Remove protobufjs in favor of protobuf-es~~ Remove protobufjs in favor of protons-runtime Feb 22, 2025

talentlessguy added 7 commits February 22, 2025 16:19

remove some more from bufbuild

decc353

fix

d44a8a3

move proto back to root to lower diffing

73fc3d3

try to fix ci

c074d4f

use mocha again

1607431

totally will fix mocha

3f4b568

fix web tests

9538944

fix symlink test

92bb7e8

talentlessguy added 3 commits May 19, 2025 22:01

fix format neunaces tests for files with no content

336d329

fix failing test because of exposed t arg

7350693

remove unused functions

8afbd2c

talentlessguy marked this pull request as ready for review May 19, 2025 19:15

talentlessguy requested a review from alanshaw May 19, 2025 19:15

Merge branch 'main' of https://github.com/ipld/js-unixfs into remove-…

0d95d9e

…protobufjs

alanshaw requested changes May 20, 2025

View reviewed changes

talentlessguy and others added 3 commits May 20, 2025 13:49

Update src/codec.js

e1697dd

Co-authored-by: ash <[email protected]>

move buffer shim to a test dir

c289653

remove unused readable-stream dev dep

2ac02eb

Remove protobufjs in favor of protons-runtime #60

Are you sure you want to change the base?

Remove protobufjs in favor of protons-runtime #60

Uh oh!

Conversation

talentlessguy commented Feb 8, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

alanshaw commented Feb 10, 2025

Uh oh!

alanshaw commented Feb 10, 2025

Uh oh!

talentlessguy commented Feb 10, 2025

Uh oh!

achingbrain commented Feb 21, 2025

Uh oh!

talentlessguy commented Feb 21, 2025

Uh oh!

achingbrain commented Feb 21, 2025

Uh oh!

talentlessguy commented Feb 22, 2025

Uh oh!

achingbrain commented Feb 22, 2025

Uh oh!

alanshaw commented Apr 3, 2025

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

achingbrain May 20, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

2color Jun 19, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

talentlessguy Jun 25, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

talentlessguy commented Feb 8, 2025 •

edited

Loading

achingbrain May 20, 2025 •

edited

Loading

2color Jun 19, 2025 •

edited

Loading

talentlessguy Jun 25, 2025 •

edited

Loading