VariableOrderAccumulator #940

mhauru · 2025-05-29T15:48:43Z

Removes the order field of Metadata in favour of having an OrderedDict{VarName,Int} in the same accumulator as num_produce (renaming NumProduceAccumulator to VariableOrderAccumulator in the process). Also adds some == methods we were previously missing.

This is currently passing tests except anything related to JET. I think JET freaks out because the OrderedDict within the new accumulator has an abstract key type. I think it's fine to have the abstract key type as long as the value type is concrete, at least once we remove VariableOrderAccumulator from the set of default accumulators and only use it when doing ParticleGibbs. I'm thus tempted to not fix the JET issues and move this whole accumulator from DPPL to Turing.jl's part that interfaces with AdvancedPS. Not sure how to handle merging this PR in that case though.

mhauru · 2025-05-29T15:54:07Z

src/varinfo.jl

+function Base.:(==)(vi1::VarInfo, vi2::VarInfo)
+    return (vi1.metadata == vi2.metadata && vi1.accs == vi2.accs)
+end


In making this PR I learned that the default implementation for structs is

function Base.:(==)(vi1::VarInfo, vi2::VarInfo) return (vi1.metadata === vi2.metadata && vi1.accs === vi2.accs) end

i.e. all the fields are compared with === even when calling ==. That was causing trouble with some tests that did == checks of comparing SimpleVarInfos. So note that before this PR e.g. VarInfo() != VarInfo(), and now VarInfo() == VarInfo().

github-actions · 2025-05-29T15:58:39Z

Benchmark Report for Commit `0b12781`

Computer Information

Julia Version 1.11.5
Commit 760b2e5b739 (2025-04-14 06:53 UTC)
Build Info:
  Official https://julialang.org/ release
Platform Info:
  OS: Linux (x86_64-linux-gnu)
  CPU: 4 × AMD EPYC 7763 64-Core Processor
  WORD_SIZE: 64
  LLVM: libLLVM-16.0.6 (ORCJIT, znver3)
Threads: 1 default, 0 interactive, 1 GC (on 4 virtual cores)

Benchmark Results

|                 Model | Dimension |  AD Backend |      VarInfo Type | Linked | Eval Time / Ref Time | AD Time / Eval Time |
|-----------------------|-----------|-------------|-------------------|--------|----------------------|---------------------|
| Simple assume observe |         1 | forwarddiff |             typed |  false |                108.2 |                 1.0 |
|           Smorgasbord |       201 | forwarddiff |             typed |  false |               3138.4 |                23.0 |
|           Smorgasbord |       201 | forwarddiff | simple_namedtuple |   true |               1892.5 |                22.5 |
|           Smorgasbord |       201 | forwarddiff |           untyped |   true |               3852.3 |                21.2 |
|           Smorgasbord |       201 | forwarddiff |       simple_dict |   true |               7388.1 |                21.1 |
|           Smorgasbord |       201 | reversediff |             typed |   true |               4489.2 |                 9.6 |
|           Smorgasbord |       201 |    mooncake |             typed |   true |               3577.6 |                10.3 |
|    Loop univariate 1k |      1000 |    mooncake |             typed |   true |              31079.0 |                10.0 |
|       Multivariate 1k |      1000 |    mooncake |             typed |   true |               1131.9 |                 8.0 |
|   Loop univariate 10k |     10000 |    mooncake |             typed |   true |             353565.8 |                11.8 |
|      Multivariate 10k |     10000 |    mooncake |             typed |   true |               8945.4 |                 9.3 |
|               Dynamic |        10 |    mooncake |             typed |   true |                299.3 |                13.8 |
|              Submodel |         1 |    mooncake |             typed |   true |                111.8 |                 6.0 |
|                   LDA |        12 | reversediff |             typed |   true |               1479.9 |                 1.8 |

mhauru · 2025-05-29T15:59:59Z

src/varinfo.jl

@@ -1808,13 +1800,12 @@ function BangBang.push!!(vi::VarInfo, vn::VarName, r, dist::Distribution)
            [1:length(val)],
            val,
            [dist],
-            [get_num_produce(vi)],


This is a change in behaviour: Previously calling push!! automatically set the order for a variable. Now order is set only if the push!! takes place within tilde_assume!!. Options for this are

say that it's the caller's responsibility to call set_order!! after push!!. This could be fine because only ParticleGibbs cares about order.

add an extra hook for accumulators for push!!, that gets called on all accumulators on every push!! call, so that they can adjust their state accordingly.

If this is only relevant for VariableOrderAccumulator then I'd lean towards 1. If it comes up with other accumulators too then 2. might be warranted.

Similar considerations apply to at least push!, merge, and subset, which after this PR might result in out-of-sync VariableOrderAccumulators.

If this is only relevant for VariableOrderAccumulator then I'd lean towards 1.

I think that it's PG's responsibility to call setorder correctly, rather than DPPL, so I'd agree.

Similar considerations apply to at least push!, merge, and subset

Still think it should be handled in PG, not here. I assume that we could write functions like

function pg_push!!(...) vi = push!!(...) return setorder!!(...) end

and make sure to always use that in the PG code?

I'm happy with that as long as it doesn't turn out that this is a common need for accumulators. One other instance comes to mind: Currently if you have a PointwiseLogDensityAccumulator in your varinfo and you subset or merge, the pointwise log densities don't get subsetted/merged, and you end up with an accumulator that tracks different variables from the varinfo. This is inconsequential because the use of PointwiseLogDensityAccumulator is so confined to calling the function that needs it.

I'm happy to make PG deal with this, but let's keep our eyes open in case this comes up with other accumulators.

PointwiseLogDensityAccumulator in your varinfo and you subset or merge

Ah, I see -- this would be true in the past as well with PointwiseLogDensityContext tracking different things from the subsetted varinfo, right?

Yep. I don't think PLDAccumulator by itself is a good enough argument for making these subset and merge functions, but it just made me wonder if this is a more common pattern with accumulators than we would at first assume. Easy to leave them out now and add them later if needed though.

mhauru · 2025-05-29T16:08:29Z

Benchmark times indicate a horrendous loss of type stability. Will investigate, probably tomorrow.

penelopeysm · 2025-05-29T16:23:04Z

Not sure how to handle merging this PR in that case though.

I had similar problems with other PRs. How about this?

Make sure we're happy with the code, then drop it from the default accumulators and release a new minor version of DPPL. This will break upstream PG
Fix PG to work with it, release new version of Turing
Find code that can be moved from DPPL and move it to Turing

mhauru · 2025-05-30T09:06:34Z

Is there a particular reason to first drop it from default accumulators and then move it to Turing.jl, rather than doing both in one go?

Also, regardless of what we do, I would develop the corresponding Turing.jl release in parallel, to avoid having to make a lot of patch DPPL releases when we realise we are missing something. I've started that work in TuringLang/Turing.jl#2550, but not yet for VariableOrderAccumulator.

penelopeysm · 2025-05-30T11:14:13Z

Because it's annoyingly difficult to make Turing CI run with an unreleased version of DPPL, short of committing a test/Manifest.toml. There's the new [sources] thing that lets you point to unreleased versions, but it's 1.11 only, so the 1.10 tests will still need a Manifest. But I suppose if you're willing to run tests locally, that's fine (and maybe now that the tests are faster it's less unpalatable -- I've always hated running tests locally because of various reasons, the time being one of them, fiddling with imports and stuff being another).

(I don't think patch releases are really problematic, but there is always the possibility of having to make multiple minor releases to fix bugs, so I see the point)

mhauru · 2025-05-30T14:09:05Z

The performance problem turned out to not be type stability, but rather that every call to unflatten (which happens with every call to logdensity) resulted in a call to deepcopy(::OrderedDict) in VariableOrderAccumulator. And those, it seems, are really slow. I've replaced OrderedDict with Dict (didn't really need the ordering anyway) and started to use copy rather than deecopy, let's see what that does to the benchmarks. (Seems like it makes them crash...)

Two thoughts:

We should probably go over the codebase and replace a lot of uses of deepcopy with copy, because deepcopy is bad practice.
VariableOrderAccumulator would be another use of VarNameTuple or some such data structure.

mhauru added 2 commits May 29, 2025 16:42

Turn NumProduceAccumulator into VariableOrderAccumulator

55be793

Add comparison methods

0b12781

github-actions bot assigned mhauru May 29, 2025

mhauru commented May 29, 2025

View reviewed changes

mhauru requested a review from penelopeysm May 29, 2025 16:00

mhauru added 2 commits May 30, 2025 14:54

Make VariableOrderAccumulator use regular Dict

4dd000c

Use copy rather than deepcopy for accumulators

d3ed55b

mhauru mentioned this pull request May 30, 2025

Can't differentiate a function that copies a Dict chalk-lab/Mooncake.jl#590

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

VariableOrderAccumulator #940

VariableOrderAccumulator #940

Uh oh!

mhauru commented May 29, 2025

Uh oh!

mhauru May 29, 2025

Uh oh!

github-actions bot commented May 29, 2025

Uh oh!

mhauru May 29, 2025

Uh oh!

penelopeysm May 29, 2025

Uh oh!

mhauru May 30, 2025

Uh oh!

penelopeysm May 30, 2025

Uh oh!

mhauru May 30, 2025

Uh oh!

mhauru commented May 29, 2025

Uh oh!

penelopeysm commented May 29, 2025

Uh oh!

mhauru commented May 30, 2025

Uh oh!

penelopeysm commented May 30, 2025 •

edited

Loading

Uh oh!

mhauru commented May 30, 2025

Uh oh!

Uh oh!

VariableOrderAccumulator #940

Are you sure you want to change the base?

VariableOrderAccumulator #940

Uh oh!

Conversation

mhauru commented May 29, 2025

Uh oh!

mhauru May 29, 2025

Choose a reason for hiding this comment

Uh oh!

github-actions bot commented May 29, 2025

Benchmark Report for Commit 0b12781

Computer Information

Benchmark Results

Uh oh!

mhauru May 29, 2025

Choose a reason for hiding this comment

Uh oh!

penelopeysm May 29, 2025

Choose a reason for hiding this comment

Uh oh!

mhauru May 30, 2025

Choose a reason for hiding this comment

Uh oh!

penelopeysm May 30, 2025

Choose a reason for hiding this comment

Uh oh!

mhauru May 30, 2025

Choose a reason for hiding this comment

Uh oh!

mhauru commented May 29, 2025

Uh oh!

penelopeysm commented May 29, 2025

Uh oh!

mhauru commented May 30, 2025

Uh oh!

penelopeysm commented May 30, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

mhauru commented May 30, 2025

Uh oh!

Uh oh!

Benchmark Report for Commit `0b12781`

penelopeysm commented May 30, 2025 •

edited

Loading