pass names from RV dims to change_rv_size #5931

flo-schu · 2022-06-26T13:27:37Z

@canyon289 to resolve #5669, I have made some small changes mainly to aesaraf.py to include the dim names in the new_size tensor (6585f90)

This solves the issue mainly and test_model runs without errors. There are a few things that probably need to be addressed.

do dims passed to change_rv_size need to be tested more rigorously?
is None a good default name for the broadcasting tensor dimension?
should information if the broadcasting comes from observed or other processes be included in
the name
things that I'm not aware of :)

changes of the PR

adds argument new_size_dims to change_rv_size in aesaraf.py
tests if dims is None and wraps in tuple if RV has no dim
creates new_size_name by adding dim name and decorations
passes new_size_name as name argument to at.as_tensor method

- adds argument new_size_dims to change_rv_size in aesaraf.py - tests if dims is None and wraps in tuple if RV has no dim - creates new_size_name by adding dim name and decorations - passes new_size_name as name argument to at.as_tensor method

canyon289 · 2022-06-26T13:43:44Z

pymc/aesaraf.py

@@ -180,9 +187,13 @@ def change_rv_size(
        size = shape[: len(shape) - rv_node.op.ndim_supp]
        new_size = tuple(new_size) + tuple(size)

+    # create the name of the RV's resizing tensor
+    # TODO: add information where the dim is coming from (obseverd, prior, ...)


Is this todo going to be implemented in this PR or is this for later?

yes, I think it would be helpful. Just wanted to make sure I'm on the right track before I spend more time on it

canyon289 · 2022-06-26T13:44:21Z

Thanks much for the pr!
Are you planning to add tests?

codecov · 2022-06-26T13:56:15Z

Codecov Report

Merging #5931 (6585f90) into main (be048a4) will decrease coverage by 0.92%.
The diff coverage is 100.00%.

@@            Coverage Diff             @@
##             main    #5931      +/-   ##
==========================================
- Coverage   89.53%   88.60%   -0.93%     
==========================================
  Files          73       73              
  Lines       13267    13270       +3     
==========================================
- Hits        11878    11758     -120     
- Misses       1389     1512     +123

Impacted Files	Coverage Δ
pymc/aesaraf.py	`92.13% <100.00%> (+0.05%)`	⬆️
pymc/distributions/distribution.py	`90.83% <100.00%> (ø)`
pymc/variational/updates.py	`37.93% <0.00%> (-54.19%)`	⬇️
pymc/backends/arviz.py	`86.53% <0.00%> (-4.09%)`	⬇️
pymc/step_methods/hmc/base_hmc.py	`89.76% <0.00%> (-0.79%)`	⬇️
pymc/model.py	`87.79% <0.00%> (-0.27%)`	⬇️

ricardoV94 · 2022-06-26T20:18:48Z

pymc/aesaraf.py

@@ -145,6 +145,7 @@ def convert_observed_data(data):
 def change_rv_size(
    rv: TensorVariable,
    new_size: PotentialShapeType,
+    new_size_dims: Optional[tuple] = (None,),


The name info should not be handled by this function. I think somewhere in the model we create constants or shared variables for dim sizes and there is where we should set the names.

Could you give me a hint roughly where this should be? Before or after the call to change_rv_size?

Could be done here where the dims variables are created:

pymc/pymc/model.py

Lines 1146 to 1149 in be048a4

if mutable:

length = aesara.shared(length)

else:

length = aesara.tensor.constant(length)

Both constants and shared variables accept names during construction.

@ricardoV94, I looked into it and when I set the names in the suggested lines, I receive only the following output from dprint:

normal_rv{0, (0, 0), floatX, False}.1 [id A] 'x' |RandomGeneratorSharedVariable(<Generator(PCG64) at 0x7F99BD4B6960>) [id B] |TensorConstant{(1,) of 5} [id C] |TensorConstant{11} [id D] |TensorConstant{0} [id E] |TensorConstant{1.0} [id F]

only after passing the name to new_size in aesaraf.py::change_rv_size, I get the change

pymc/pymc/aesaraf.py

Line 185 in be048a4

new_size = at.as_tensor(new_size, ndim=1, dtype="int64")

new_size = at.as_tensor(new_size, ndim=1, dtype="int64", name=new_size[0].name)

normal_rv{0, (0, 0), floatX, False}.1 [id A] 'x' |RandomGeneratorSharedVariable(<Generator(PCG64) at 0x7FAD2928A960>) [id B] |cities_dim{(1,) of 5} [id C] |TensorConstant{11} [id D] |TensorConstant{0} [id E] |TensorConstant{1.0} [id F]

If this is okay, I'll implement the changes in this PR

This PR added a name for the mutable case, but not the constant one: f45ca4a, I think adding it to the constant is all that's needed.

Thanks, I will try to give it another go in this week

@ricardoV94, I'm dancing around this PR for quite a bit now, but I'm not finding the time to do it. In the beginning of next year I would have more time, but if it bothers you to have an open PR for such a long time or someone else takes over, I won't mind if I don't get to finish it.

No worries. The original issue lost some of it's relevance when we switched from defaulting resizable dims to constant dims. It's still nice but not urgent by any means. Feel free to come back to it whenever you find the time.

ricardoV94 · 2022-08-29T10:23:30Z

This PR should now be a bit more straightforward due to #6063 Dims will now always be used when shape is not provided.

covertg · 2023-02-24T05:38:55Z

Hi folks, I was interested in this so I took a look with the latest codebase. As @flo-schu was finding, it's not sufficient to simply add the dim name to the call to pytensor.tensor.constant here

pymc/pymc/model.py

Lines 1066 to 1069 in e1d36ca

    
           if mutable: 
        
               length = pytensor.shared(length, name=name) 
        
           else: 
        
               length = pytensor.tensor.constant(length)

Somewhere along the RV creation call stack, the name of the Variable holding dim length gets lost. I believe this is actually happening in pytensor.tensor.random.utils.normalize_size_parameter(), here https://github.com/pymc-devs/pytensor/blob/a1a805c9119e986be9f3e08591daf3698cede0ef/pytensor/tensor/random/utils.py#L136

With shared variables, the call to as_tensor_variable returns a MakeVector, which preserves the original variable(s) holding dim lengths. But with constant variables, as_tensor_variable squashes the dim length(s) into one TensorConstant and wipes out the names.

Using the following model,

with pm.Model() as m:
    mutable = True
    d2 = pm.Data("data1", range(5), dims="d1", mutable=mutable)
    d1 = pm.Data("data2", range(7), dims="d2", mutable=mutable)
    x = pm.Normal("x", dims=("d1", "d2"))

Before that line in normalize_size_parameter, dprint(size) shows

Subtensor{int64} [id A]
 |Shape [id B]
 | |data1 [id C]
 |ScalarConstant{0} [id D]
Subtensor{int64} [id E]
 |Shape [id F]
 | |data2 [id G]
 |ScalarConstant{0} [id H]

and after,

MakeVector{dtype='int64'} [id A]
 |Subtensor{int64} [id B]
 | |Shape [id C]
 | | |data1 [id D]
 | |ScalarConstant{0} [id E]
 |Subtensor{int64} [id F]
   |Shape [id G]
   | |data2 [id H]
   |ScalarConstant{0} [id I]

With pymc patched as described above to pass along name to pytensor.tensor.constant when creating dims, we can try the same model but with mutable=False. That line in normalize_size_parameter changes

d1{5} [id A]
d2{7} [id B]

to

TensorConstant{[5 7]} [id A]

So, what to do?
If this issue is still of interest, would be glad to work on it given some advice.

covertg · 2023-02-24T05:47:47Z

N.B. the tensorconstant name gets disappeared even if there is only one dim, a la

>>> pt.as_tensor_variable((pt.constant(5, name="disappears"),)).name
None

Because normalize_size_parameter will always receive a tuple.

A couple naive thoughts include:

We could pre-empt pytensor's (re)creation of TensorConstant by always turning the dim length information into a more complex Variable, somewhere close to the below line. For example call pt.join(create_size) before the call to rv_op so that the dim length is always a MakeVector.

pymc/pymc/distributions/distribution.py

Lines 388 to 389 in e1d36ca

    
           create_size = find_size(shape=shape, size=size, ndim_supp=ndim_supp) 
        
           rv_out = cls.rv_op(*dist_params, size=create_size, **kwargs)

But, we might break things if there are parts of the codebase that rely on the shape information being TensorConstant

Or perhaps allow the TensorConstants to get squashed into one, but preserve name information by concatenating the strings of the original names if available. (would by a pytensor change)

ricardoV94 · 2023-02-26T12:47:34Z

@covertg thanks for investigating. This issue was more relevant before when all dims used SharedVariable, and hence names were preserved. With TensorConstant's PyTensor is very liberal and happy to return you objects that aren't linked to the original ones, as you found out. I think we can simply close the PR and issue. It was a nice to have, but doesn't seem worth any further trouble at this point.

pass names from RV dims to change_rv_size

6585f90

- adds argument new_size_dims to change_rv_size in aesaraf.py - tests if dims is None and wraps in tuple if RV has no dim - creates new_size_name by adding dim name and decorations - passes new_size_name as name argument to at.as_tensor method

canyon289 reviewed Jun 26, 2022

View reviewed changes

ricardoV94 requested changes Jun 26, 2022

View reviewed changes

ricardoV94 mentioned this pull request Aug 29, 2022

Implement Symbolic RVs #6072

Merged

5 tasks

ricardoV94 closed this Feb 26, 2023

	if mutable:
	length = aesara.shared(length)
	else:
	length = aesara.tensor.constant(length)

Uh oh!

pass names from RV dims to change_rv_size #5931

pass names from RV dims to change_rv_size #5931

Uh oh!

Conversation

flo-schu commented Jun 26, 2022

changes of the PR

Uh oh!

canyon289 Jun 26, 2022

Choose a reason for hiding this comment

Uh oh!

flo-schu Jun 27, 2022

Choose a reason for hiding this comment

Uh oh!

canyon289 commented Jun 26, 2022

Uh oh!

codecov bot commented Jun 26, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

ricardoV94 Jun 26, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

flo-schu Jun 27, 2022

Choose a reason for hiding this comment

Uh oh!

ricardoV94 Jun 27, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

flo-schu Jul 4, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

ricardoV94 Oct 27, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

flo-schu Nov 8, 2022

Choose a reason for hiding this comment

Uh oh!

flo-schu Nov 29, 2022

Choose a reason for hiding this comment

Uh oh!

ricardoV94 Nov 30, 2022

Choose a reason for hiding this comment

Uh oh!

ricardoV94 commented Aug 29, 2022

Uh oh!

covertg commented Feb 24, 2023

Uh oh!

covertg commented Feb 24, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

ricardoV94 commented Feb 26, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

codecov bot commented Jun 26, 2022 •

edited

Loading

ricardoV94 Jun 26, 2022 •

edited

Loading

ricardoV94 Jun 27, 2022 •

edited

Loading

flo-schu Jul 4, 2022 •

edited

Loading

ricardoV94 Oct 27, 2022 •

edited

Loading

covertg commented Feb 24, 2023 •

edited

Loading

ricardoV94 commented Feb 26, 2023 •

edited

Loading