Long-term performance: AssetGraph #787

matanlurey · 2017-12-24T05:11:28Z

I've uploaded the e2e_example/.../asset_graph.json (formatted).

It is about 1.5mb, for what basically is a "Hello World" with DDC. I'm not sure what a typical angular project or something a bit more substantial might look like - maybe not a huge deal?

A couple things I saw looking at the output:

We compile a lot of packages that aren't used at runtime. It doesn't seem like a huge deal until you realize we analyze it, create unlinked summaries, create linked summaries, create DDC'd JavaScript, create .errors. outputs, create source maps etc.

Maybe we need a way of excluding tooling-only packages manually (or with heuristics)?

Maybe the asset graph should be split into different parts per package, considering that only the nodes in the current package are actually likely to change?

I might misunderstand, though.

As @jakemac53 mentions originally in Evaluate other storage formats for dependency graph. #41, maybe we just need an output format that is better for fast-incremental writes. I don't have much expertise here.

The text was updated successfully, but these errors were encountered:

jakemac53 · 2017-12-24T05:16:41Z

We do have nodes in the asset graph for all those things, but we don't actually compute modules/summaries/ddc for anything that isn't imported by a real entrypoint - or at least not if you set them as "optional" which should be done for those actions as well as the module action.

matanlurey · 2017-12-24T05:19:42Z

Ah I see. That's good, though it still bloats the graph.

I do see some files in generated I didn't expect to see:

... though I guess this all must be based on test/**.dart having CLI-based tests.

jakemac53 · 2017-12-24T05:23:49Z

Ya you could try creating a build.yaml which overrides the ddc_bootstrap_builder to only run on web entrypoints (that one is what actually causes things to get built).

I am not actually 100% sure we expose that builder separately though from the config.

jakemac53 · 2017-12-24T05:24:08Z

(using generate_for)

jakemac53 · 2017-12-24T05:37:42Z

we could also try to make it smart about not running on entrypoints that are clearly not web - but that is tricky

matanlurey · 2017-12-24T05:46:36Z

I mean I know we essentially do the same waste on Bazel to an extent, but I think (?) the difference here is the asset graph is more separated than our implementation. Anyway, haven't seen any obvious performance problems yet, though here is a build on my Macbook Pro:

[INFO] BuildDefinition: Reading cached asset graph completed, took 178ms
[INFO] BuildDefinition: Building new asset graph completed, took 471ms

I'm not 100% sure how to read this yet, but 200ms for a read doesn't seem bad considering it's only on a cold start. We should start to get more realistic numbers with stuff like the angular components gallery though.

jakemac53 · 2017-12-24T05:51:06Z

I'm not 100% sure how to read this yet, but 200ms for a read doesn't seem bad considering it's only on a cold start. We should start to get more realistic numbers with stuff like the angular components gallery though.

Yes - we haven't seen big enough issues with it yet to bother trying to optimize it. As soon as we do it should be relatively straightforward to replace the format as its broken out into separate classes that do the serialization/deserialization.

Some sort of format that allows incremental updating would be ideal - we could even consider using a local database or something. Pretty much everything is on the table.

jakemac53 · 2017-12-24T05:56:18Z

[INFO] BuildDefinition: Reading cached asset graph completed, took 178ms
[INFO] BuildDefinition: Building new asset graph completed, took 471ms

Were there other logs between that? It should only build a new asset graph if it invalidated the previous one, which it should give you a message about.

matanlurey · 2017-12-24T06:05:27Z

[INFO] ensureBuildScript: Generating build script completed, took 340ms
[WARNING] BuildDefinition: Throwing away cached asset graph because the build actions have changed. This could happen as a result of adding a new dependency, or if you are using a build script which changes the build structure based on command line flags or other configuration.
[INFO] BuildDefinition: Reading cached asset graph completed, took 178ms
[INFO] BuildDefinition: Building new asset graph completed, took 471ms
[INFO] BuildDefinition: Checking for unexpected pre-existing outputs. completed, took 1ms
[INFO] Build: Running build completed, took 19326ms
[INFO] Build: Caching finalized dependency graph completed, took 84ms
[INFO] Build: Succeeded after 19525ms with 984 outputs

jakemac53 · 2017-12-24T06:24:03Z

Ah looks like the logs are a bit confusing in that case because I think the log about the build actions changing happens while we are reading in the graph, and log the finished line. Probably not a huge issue but it is a bit confusing.

natebosch · 2017-12-27T17:13:59Z

you could try creating a build.yaml which overrides the ddc_bootstrap_builder to only run on web entrypoints

build_web_compilers|ddc_bootstrap already only runs on ["web/**", "test/**.browser_test.dart"]. Is the problem that there are more *.browser_test.dart than will actually get run?

matanlurey · 2018-03-08T04:36:14Z

I wonder how this has changed with optional builders. @natebosch?

jakemac53 · 2018-03-08T15:17:44Z

Optional builders still have nodes in the graph - they just might not be built.

jakemac53 · 2018-03-08T15:18:20Z

(fwiw, ddc/summaries/modules have always been optional, or at least for a long time)

matanlurey · 2018-03-08T15:19:27Z

Ah my mistake, thanks.

natebosch · 2018-05-31T20:59:15Z

One particular thing we should look at is the memory usage of the AssetGraph. It looks like the VM representation can end up being way bigger than the json representation which is a hint that there is some canonicalization happening in our Json representation (we don't repeat strings) that needs to be happening for our AssetGraph. One thing that is likely happening is we're holding on to multiple copies of duplicate AssetIds (which have duplicate String references).

davidmorgan · 2025-03-22T10:48:54Z

Closing in favour of #3811

I'll be looking at the asset graph next: precisely to deduplicate anything that can be deduplicated and remove anything that can be removed.

matanlurey added the package:build_runner label Dec 24, 2017

jakemac53 added P1 A high priority bug; for example, a single project is unusable or has many test failures type-enhancement A request for a change that isn't a bug labels Jan 8, 2018

matanlurey added the type-performance label Mar 29, 2018

jakemac53 mentioned this issue Jul 7, 2020

Size of assets-graph.json is too large #2747

Closed

davidmorgan self-assigned this Jan 29, 2025

davidmorgan closed this as completed Mar 22, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Long-term performance: AssetGraph #787

Long-term performance: AssetGraph #787

matanlurey commented Dec 24, 2017

jakemac53 commented Dec 24, 2017

matanlurey commented Dec 24, 2017

jakemac53 commented Dec 24, 2017

jakemac53 commented Dec 24, 2017

jakemac53 commented Dec 24, 2017

matanlurey commented Dec 24, 2017

jakemac53 commented Dec 24, 2017

jakemac53 commented Dec 24, 2017

matanlurey commented Dec 24, 2017

jakemac53 commented Dec 24, 2017

natebosch commented Dec 27, 2017

matanlurey commented Mar 8, 2018

jakemac53 commented Mar 8, 2018

jakemac53 commented Mar 8, 2018

matanlurey commented Mar 8, 2018

natebosch commented May 31, 2018

davidmorgan commented Mar 22, 2025

Long-term performance: AssetGraph #787

Long-term performance: AssetGraph #787

Comments

matanlurey commented Dec 24, 2017

jakemac53 commented Dec 24, 2017

matanlurey commented Dec 24, 2017

jakemac53 commented Dec 24, 2017

jakemac53 commented Dec 24, 2017

jakemac53 commented Dec 24, 2017

matanlurey commented Dec 24, 2017

jakemac53 commented Dec 24, 2017

jakemac53 commented Dec 24, 2017

matanlurey commented Dec 24, 2017

jakemac53 commented Dec 24, 2017

natebosch commented Dec 27, 2017

matanlurey commented Mar 8, 2018

jakemac53 commented Mar 8, 2018

jakemac53 commented Mar 8, 2018

matanlurey commented Mar 8, 2018

natebosch commented May 31, 2018

davidmorgan commented Mar 22, 2025