Reinstate log-likelihood transforms #4521

brandonwillard · 2021-03-09T22:47:42Z

This PR reinstates the general log-likelihood transform framework.

I'm not happy with the current approach, so I'll probably make quite a few changes.

OriolAbril · 2021-03-09T22:49:44Z

I take we better wait for this before adding pointwise log likelihood conversion in #4489 ?

brandonwillard · 2021-03-09T23:00:56Z

I take we better wait for this before adding pointwise log likelihood conversion in #4489 ?

Not necessarily. The interface for log-likelihood computations shouldn't change much/at all.

ricardoV94 · 2021-03-10T09:09:26Z

I get this error when sampling from a model with Uniforms

with pm.Model() as m:
    x = pm.Uniform('x', 0, 1)
    y = pm.Uniform('y', 0, x, observed=[0.4, 0.5, 0.7])
    
with m:
    trace = pm.sample()

Traceback (most recent call last):
  File "/home/ricardo/Documents/Projects/pymc3-venv/lib/python3.8/site-packages/IPython/core/interactiveshell.py", line 3418, in run_code
    exec(code_obj, self.user_global_ns, self.user_ns)
  File "<ipython-input-4-14e90d4b3669>", line 2, in <module>
    trace = pm.sample()
  File "/home/ricardo/Documents/Projects/pymc3/pymc3/sampling.py", line 429, in sample
    check_start_vals(model.test_point, model)
  File "/home/ricardo/Documents/Projects/pymc3/pymc3/util.py", line 210, in check_start_vals
    raise KeyError(
KeyError: 'Some start parameters do not appear in the model!\nValid keys are: x, y, but x_interval was supplied'

brandonwillard · 2021-03-11T05:44:51Z

@ricardoV94, I pushed an update that should fix that issue.

More testing is needed, and I need to figure out the best approach to syncing test values, because, right now, changing the test value of a variable does not change the corresponding transformed variable's test value (e.g. changing m.x.tag.test_value in the example above will not affect m.x.tag.value_var.tag.test_value).

N.B. I have no intention of literally "syncing" the corresponding *.tag.test_values; that would be a terrible idea. Instead, the Model object could provide some means of updating that information for corresponding untransformed/transformed terms.

In general, we need to remove PyMC3's use of *.tag.test_value and institute a distinct set of initial values. That is probably best accomplished through the Model interface, and, although *.tag.test_values may be set to the initial values, *.tag.test_value should not be used within PyMC3; otherwise, we'll never be able to wean the codebase off of its costly and problematic reliance on the test value debugging mechanism.

ricardoV94 · 2021-03-11T05:56:17Z

Sounds like we need an entire method to generate / control initial values.

Before we were using the mode/ mean/ median when provided by the distribution classes as well as the update_start_vals function to deal with transformations.

Should we rethink this approach?

Unrelated to this, are we able to get transformed random samples automatically or is the transform only appearing in measure space?

brandonwillard · 2021-03-11T06:18:49Z

Before we were using the mode/ mean/ median when provided by the distribution classes as well as the update_start_vals function to deal with transformations.

We can continue to do that, but in the context of these new initial values, and not by assigning values to *.tag.test_value directly.

Unrelated to this, are we able to get transformed random samples automatically or is the transform only appearing in measure space?

The current implementation automatically creates transformed variables when they're added to the Model object. This is how it works in v3, but, unlike v3, we don't create new random variables or deterministics for the transformed variables.

Now that we distinguish between random variables that are actually random variables (i.e. can be sampled) and variables that are used as inputs to log-likelihood graphs but correspond to random variables (i.e. in poor mathematical notation P(Y == y), where Y is the former and y is the latter), it doesn't make sense to include them anywhere in the Model object except Model.vars—where all the other log-likelihood input variables are.

Since Model.vars returns the symbolic log-likelihood inputs parameters in v4, and those are the variables we use to create log-likelihood graphs, by changing Model.vars so that it returns the transformed log-likelihood inputs (when applicable/available, of course), we'll end up producing and using the transformed log-likelihoods throughout most of the codebase.

ricardoV94 · 2021-03-11T06:26:08Z

pymc3/model.py

            if transform is not None:
-                self.deterministics.append(rv_var)
+                value_var.tag.transform = transform
+                value_var.name = f"{rv_var.name}_{transform.name}"


Any reason to drop the dunder at the end of the transformed variable name?

No, no reason. Why was it there in the first place?

Preceeded me. My guess would be to avoid name collisions on the user side.

ricardoV94 · 2021-03-11T10:31:37Z

It would be nice to have a helper method to sample transformed variables as well. e.g., SMC starts by sampling from the prior_predictive and converting the samples to untransformed domain to use as the starting point.

V4 makes it easier to go from sample <- -> logp, and this PR should fix the API for logp <- -> transformed_logp. In addition, I think we should also provide a well integrated API to go from sample <- -> transformed_logp (for a serious lack of better terminology). Otherwise I am afraid we will fall back to more fragile solutions behind bugs like #4484 that fail to consider dynamic constraints on the transformations.

ricardoV94 · 2021-03-11T10:35:27Z

I don't know if you intended to deal with this only later, but the current code is only getting samples for the transformed variables during sampling.

with pm.Model() as m:
    x = pm.Uniform('x', 0, 1)

with m:
    trace = pm.sample(compute_convergence_checks=False)

trace.varnames
# ['x_interval']

brandonwillard · 2021-03-11T17:41:57Z

I don't know if you intended to deal with this only later, but the current code is only getting samples for the transformed variables during sampling.

Yeah, I'll do that next. It seems like we'll need a better way to track these transformed variables, though, because the whole rv_var.tag.value_var approach isn't great.

Plus, it really won't work well when/if we need to keep track of compiled forward/backward transform graphs. I'm talking specifically about aesara.function-compiled graphs that we would use to transform/untransform values (e.g. tests/initial values, samples values, etc.)

twiecki · 2021-03-11T17:51:58Z

While we over-rely on test-values I just want to add that they are nice for debugging a model.

ricardoV94 · 2021-03-11T18:23:56Z

Yeah, I'll do that next. It seems like we'll need a better way to track these transformed variables, though, because the whole rv_var.tag.value_var approach isn't great.

Plus, it really won't work well when/if we need to keep track of compiled forward/backward transform graphs. I'm talking specifically about aesara.function-compiled graphs that we would use to transform/untransform values (e.g. tests/initial values, samples values, etc.)

The previous approach wasn't great either, looking for missing variables and checking if the names corresponded to the transformed / untransformed strings.

brandonwillard · 2021-03-11T18:57:30Z

While we over-rely on test-values I just want to add that they are nice for debugging a model.

We definitely won't get rid of the ability to use test values; we simply won't rely on them within our codebase. We can even set test values for variables (based on these proposed initial values, of course) by default within the codebase.

twiecki · 2021-03-11T19:37:21Z

Perfect.

…

On Thu, Mar 11, 2021, 19:57 Brandon T. Willard ***@***.***> wrote: While we over-rely on test-values I just want to add that they are nice for debugging a model. We definitely won't get rid of the ability to *use* test values; we simply won't rely on them within our codebase. — You are receiving this because you commented. Reply to this email directly, view it on GitHub <#4521 (comment)>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/AAFETGAHTB25KJY6PUN5653TDEABTANCNFSM4Y4UPFLA> .

brandonwillard · 2021-03-13T04:52:14Z

OK, I've added the un-transformed variables to the trace output.

…sses

brandonwillard · 2021-03-15T07:17:20Z

FYI: There are a lot of important fixes in this PR, so I'm probably going to merge it even if all the tests don't pass yet.

brandonwillard added the v4 label Mar 9, 2021

brandonwillard self-assigned this Mar 9, 2021

brandonwillard marked this pull request as draft March 9, 2021 22:47

brandonwillard force-pushed the implement-logp-transforms branch from 36286ea to cdd2852 Compare March 11, 2021 05:27

brandonwillard force-pushed the implement-logp-transforms branch from cf68232 to 99b1237 Compare March 11, 2021 05:49

brandonwillard added 2 commits March 10, 2021 23:50

Add fallback for missing value_var in BaseTrace

c07c709

Use name from Variable.name in pymc3.util.get_var_name

34a66c7

brandonwillard force-pushed the implement-logp-transforms branch from 99b1237 to 066330e Compare March 11, 2021 05:50

brandonwillard added this to the vNext (4.0.0) milestone Mar 11, 2021

brandonwillard force-pushed the implement-logp-transforms branch from 066330e to b0317c6 Compare March 11, 2021 05:59

ricardoV94 reviewed Mar 11, 2021

View reviewed changes

brandonwillard force-pushed the implement-logp-transforms branch from b0317c6 to 3a82747 Compare March 11, 2021 06:28

brandonwillard mentioned this pull request Mar 11, 2021

Consistent API for user friendly distribution customization #4530

Closed

brandonwillard force-pushed the implement-logp-transforms branch 2 times, most recently from ea2b815 to f101534 Compare March 13, 2021 04:41

Reinstate log-likelihood transforms

6c8a2b3

brandonwillard force-pushed the implement-logp-transforms branch from f101534 to 6c8a2b3 Compare March 14, 2021 05:20

brandonwillard added 13 commits March 13, 2021 23:28

Remove remaining v3 sampling code

7130193

Change logp_transform argument to the entire random variable

08393fb

Remove logpt transformed option

fdfb300

Implement transform for Dirichlet

b0c51ed

Always use the value var to initially build the log-likelihood

d802a0e

Register GammaRV to PositiveContinuous class

df7df00

Add an option for negative support assertions in Normal and Gamma cla…

06284f2

…sses

Make logp testing work with transformed values

3458eeb

Disable asserts during logp invalid range tests

376a711

Update xfails in pymc3.tests.test_distributions

b27f848

Fix Categorical logp implementation

86af023

Use the value var's values directly in logpt

cc09a09

Do not assume values are Applys in pymc3.tests.test_distributions

8977294

brandonwillard marked this pull request as ready for review March 15, 2021 07:17

brandonwillard merged commit eced3ef into pymc-devs:v4 Mar 15, 2021

brandonwillard deleted the implement-logp-transforms branch March 15, 2021 07:19

Uh oh!

Reinstate log-likelihood transforms #4521

Reinstate log-likelihood transforms #4521

Uh oh!

Conversation

brandonwillard commented Mar 9, 2021

Uh oh!

OriolAbril commented Mar 9, 2021

Uh oh!

brandonwillard commented Mar 9, 2021

Uh oh!

ricardoV94 commented Mar 10, 2021

Uh oh!

brandonwillard commented Mar 11, 2021

Uh oh!

ricardoV94 commented Mar 11, 2021

Uh oh!

brandonwillard commented Mar 11, 2021

Uh oh!

ricardoV94 Mar 11, 2021

Choose a reason for hiding this comment

Uh oh!

brandonwillard Mar 11, 2021

Choose a reason for hiding this comment

Uh oh!

ricardoV94 Mar 11, 2021

Choose a reason for hiding this comment

Uh oh!

ricardoV94 commented Mar 11, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

ricardoV94 commented Mar 11, 2021

Uh oh!

brandonwillard commented Mar 11, 2021

Uh oh!

twiecki commented Mar 11, 2021

Uh oh!

ricardoV94 commented Mar 11, 2021

Uh oh!

brandonwillard commented Mar 11, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

twiecki commented Mar 11, 2021 via email

Uh oh!

brandonwillard commented Mar 13, 2021

Uh oh!

brandonwillard commented Mar 15, 2021

Uh oh!

Uh oh!

ricardoV94 commented Mar 11, 2021 •

edited

Loading

brandonwillard commented Mar 11, 2021 •

edited

Loading