Add `linalg.block_diag` and sparse equivalent #576

jessegrabowski · 2024-01-06T01:43:56Z

Description

There are a number of generic math-type operations in pymc.math that make more sense in pytensor. This PR moves one of them -- block_diag, to pytensor.tensor.slinalg. There are others that could likely be moved as well.

block_diag uses a few support functions that also present opportunities to implement some missing numpy functions in pytensor. I re-wrote largest_common_dtype to use np.promote_types, but there should probably be a pt.promote_types and pt.result_type. The latter should just replace this helper function entirely.

There was also a function ix, which is an implementation of np.ix_. We're missing all of these numpy quick constructors: np.r_, np.c_, np.ogrid, np.mgrid. Plus np.meshgrid, which is an actual function people use.

floatX and intX should probably also be pytensor functions instead of PyMC functions.

I tagged this as relevant to #573 because I'm interested in experimenting with linear algebra rewrites out of the block_diag Op.

Related Issue

Closes #
Related to ENH: Implement COLA library rewrites for linear algebra functions #573

Checklist

Checked that the pre-commit linting/style checks pass
Included tests that prove the fix is effective or that the new feature works
Added necessary documentation (docstrings and/or example notebooks)
If you are a pro: each commit corresponds to a relevant logical change

Type of change

codecov-commenter · 2024-01-06T02:07:18Z

Codecov Report

Attention: 24 lines in your changes are missing coverage. Please review.

Comparison is base (e180927) 80.92% compared to head (a9893b8) 80.92%.
Report is 4 commits behind head on main.

Additional details and impacted files

@@           Coverage Diff            @@
##             main     #576    +/-   ##
========================================
  Coverage   80.92%   80.92%            
========================================
  Files         162      162            
  Lines       46524    46644   +120     
  Branches    11375    11401    +26     
========================================
+ Hits        37648    37746    +98     
- Misses       6653     6668    +15     
- Partials     2223     2230     +7

Files	Coverage Δ
pytensor/link/jax/dispatch/slinalg.py	`93.54% <100.00%> (+1.24%)`	⬆️
pytensor/tensor/basic.py	`88.32% <100.00%> (-0.15%)`	⬇️
pytensor/sparse/basic.py	`82.57% <81.81%> (+0.08%)`	⬆️
pytensor/tensor/slinalg.py	`93.70% <85.00%> (-0.94%)`	⬇️
pytensor/link/numba/dispatch/slinalg.py	`45.60% <17.64%> (-3.20%)`	⬇️

... and 6 files with indirect coverage changes

ricardoV94 · 2024-01-06T08:55:36Z

The floatX functions are actually pretty crappy and should be phased out completely. For instance they fail when you pass a python list of TensorVariables.

PyTensor has tools (including config flags) to handle type promotions that should be relied upon instead or refactored if they don't satisfy our current needs

https://pytensor.readthedocs.io/en/latest/library/config.html#config.cast_policy

ricardoV94

Looks good, just some small tweaks

pytensor/tensor/slinalg.py

tests/tensor/test_slinalg.py

pytensor/tensor/basic.py

ricardoV94 · 2024-01-06T09:10:54Z

Also are dispatch for JAX/Numba trivial that we could support already?

ricardoV94 · 2024-01-06T09:16:53Z

One thing I don't know is whether mixing the sparse and dense in a single Op is the best internal API. Maybe that's fine.

I guess it mostly depends on whether rewrites can ignore this info or whether they will have to always reason differently depending on it.

However, I think the helper function should follow scipy API, so there would be a tensor.linalg.block_diag that builds the dense case (or in this case can just use a pre-built Op) and a tensor.sparse.block_diag for the sparse cases.

Co-authored-by: Ricardo Vieira <[email protected]>

jessegrabowski · 2024-01-06T11:22:01Z

I think numba/jax should be easy, i'll work on it. The other thing would be to make it Blockwise.

With respect to the API I totally agree about splitting sparse and dense into pytensor.tensor.slinalg.block_diag and pytensor.sparse.block_diag. I will also refactor the function to match the scipy function, which takes the matrices as args instead of a single list.

ricardoV94 · 2024-01-06T13:32:47Z

Can you open an issue in PyMC to deprecate the code there?

Closely follow scipy function signature for `block_diag`

jessegrabowski · 2024-01-06T18:15:56Z

I'm going to need some handholding with turning it into Blockwise. I think the gfunc_sig should just be "(n,m)->(o,p)", but there are an arbitrary number of inputs all of the same size, so do I need to represent that somehow?

…allow sparse matrix inputs to `pytensor.sparse.block_diag`

ricardoV94 · 2024-01-06T21:08:07Z

Signatures can be defined dynamically when the Op is initialized, that's how we do it for solve.

For this Op you can do something like we do here

pytensor/pytensor/tensor/blockwise.py

Line 29 in e180927

def safe_signature(

signature="(m0,n0),(m1,n1),...(mn,nn)->(m,n)"

You can add the number of inputs as a parameter of the Op, so it's known at initialization.

The inputs don't have to have the same shape right, just ndim? Could be foo([[0]], [[1,2,3],[4,5,6]])?

jessegrabowski · 2024-01-06T21:17:00Z

Yes you're right about sizes

Do I need to re-write the jax/numba overloads to handle batch dims, or is it just unsupported so far?

pytensor/sparse/basic.py

pytensor/tensor/basic.py

pytensor/tensor/slinalg.py

tests/link/numba/test_slinalg.py

ricardoV94 · 2024-01-06T21:32:58Z

Jax Blockwise will work with vmap, only need to dispatch the base case like you did.

The point of Blockwise is exactly that. Since the batch dims always work the same way, we only need to bother specifying the core case.

Numba doesn't yet have support for Blockwise. We should be able to do something simple with guvectorize although it has some limitations like only working with single outputs

jessegrabowski · 2024-01-06T22:57:33Z

I guess we can't blockwise Sparse Ops? Blockwise uses pt.as_tensor, which breaks if the inputs are sparse. Will need to add some logic to handle that case. This would be the first function it would work with, I think. Otherwise, though, I think this is done.

ricardoV94

Only nitpicks left, address what you want and close what you don't!

pytensor/sparse/basic.py

pytensor/tensor/slinalg.py

ricardoV94 · 2024-01-06T23:11:08Z

pytensor/tensor/slinalg.py

+    Parameters
+    ----------
+    A, B, C ... : tensors
+        Input matrices to form the block diagonal matrix. Each matrix should have the same number of dimensions, and the


This is not correct. Blockwise accepts different number of batch dims and also broadcasts when they have length 1.

I got errors when I tried tensors with different batch dims, but I didn't try broadcasting to dimensions with size 1.

Do you still see errors? Blockwise should introduce expand dims, so the only failure case would be broadcasting?

Broadcasting works, I added a test for it. It was failing when I tried different batch sizes, which doesn't make sense anyway I think.

What do you mean different batch sizes? Blockwise adds expand dims automatically to align the number of batch dims, so that shouldn't be possible?

This errors:

# Different batch sizes A = np.random.normal(size=(batch_size + 3, 2, 2)).astype(config.floatX) B = np.random.normal(size=(batch_size, 4, 4)).astype(config.floatX) result = block_diag(A, B).eval()

with:

E ValueError: Incompatible Blockwise batch input shapes [(8, 2, 2), (5, 4, 4)]

But I think it's supposed to. What does it even mean to batch those two together?

Yeah, that's invalid, batch shapes must be broadcastable. 8 and 5 are not broadcastable.

I thought you were saying inputs with different number of dimensions were failing

pytensor/tensor/slinalg.py

pytensor/sparse/basic.py

ricardoV94 · 2024-01-06T23:22:52Z

I guess we can't blockwise Sparse Ops? Blockwise uses pt.as_tensor, which breaks if the inputs are sparse. Will need to add some logic to handle that case. This would be the first function it would work with, I think. Otherwise, though, I think this is done.

We don't have a sparse type with ndim != 2 so that's only the tip of the iceberg.

Remove `Matrix` from `BlockDiagonal` and `SparseBlockDiagonal` `Op` names Correct errors in docstrings Move input validation to a shared class method

ricardoV94

Looks pretty neat! Two not so nitpick comments.

Also nice catch with the missing test for grad

pytensor/sparse/basic.py

pytensor/tensor/slinalg.py

Co-authored-by: Ricardo Vieira <[email protected]>

Copy block_diag and support functions from pymc.math

8e875d7

ricardoV94 reviewed Jan 6, 2024

View reviewed changes

pytensor/tensor/slinalg.py Outdated Show resolved Hide resolved

pytensor/tensor/slinalg.py Outdated Show resolved Hide resolved

tests/tensor/test_slinalg.py Outdated Show resolved Hide resolved

pytensor/tensor/basic.py Show resolved Hide resolved

ricardoV94 added enhancement New feature or request SciPy compatibility sparse variables linalg Linear algebra labels Jan 6, 2024

jessegrabowski and others added 2 commits January 6, 2024 12:19

Evaluate output in sphinx code example

77b733d

Co-authored-by: Ricardo Vieira <[email protected]>

Test type equivalence with isinstance instead of ==

a1dba8e

Co-authored-by: Ricardo Vieira <[email protected]>

Typo in test function

d809f1c

jessegrabowski mentioned this pull request Jan 6, 2024

Remove block_diag from pymc.math in favor of alias to pytensor.tensor.slinalg.block_diag pymc-devs/pymc#7085

Closed

Split block_diag into sparse and dense version

fd26b74

Closely follow scipy function signature for `block_diag`

jessegrabowski added 3 commits January 6, 2024 19:26

Use as_sparse_or_tensor_variable in SparseBlockDiagonalMatrix to …

62bab9d

…allow sparse matrix inputs to `pytensor.sparse.block_diag`

Test sparse and dense inputs to pytensor.sparse.block_diag

1de23db

Add numba overload for pytensor.tensor.slinalg.block_diag

382c50b

jessegrabowski force-pushed the block-diag branch from 4e04ac9 to 8ac8f50 Compare January 6, 2024 20:06

add jax overload for pytensor.tensor.slinalg.block_diag

491111b

jessegrabowski force-pushed the block-diag branch from f88c27c to 491111b Compare January 6, 2024 20:11

ricardoV94 reviewed Jan 6, 2024

View reviewed changes

jessegrabowski added 2 commits January 6, 2024 22:39

Move stand-alone block_diag_grad function into grad method

bb2bd36

Add format prop to SparseBlockDiagonalMatrix

26bf96d

jessegrabowski added 2 commits January 6, 2024 22:57

Use compare_numba_and_py in numba\test_slinalg.py::test_block_diag

dd70db9

Add support for Blockwise to slinalg.block_diag

d32bf9f

ricardoV94 reviewed Jan 6, 2024

View reviewed changes

pytensor/sparse/basic.py Outdated Show resolved Hide resolved

ricardoV94 changed the title ~~Migrate generic pytensor functions from pymc.math to pytensor~~ Add linalg.block_diag and sparse equivalent Jan 7, 2024

ricardoV94 added the Op implementation label Jan 7, 2024

Add gradient test

747ed1d

Remove `Matrix` from `BlockDiagonal` and `SparseBlockDiagonal` `Op` names Correct errors in docstrings Move input validation to a shared class method

ricardoV94 requested changes Jan 7, 2024

View reviewed changes

pytensor/sparse/basic.py Outdated Show resolved Hide resolved

pytensor/tensor/slinalg.py Outdated Show resolved Hide resolved

jessegrabowski and others added 2 commits January 7, 2024 01:24

Remove gufunc_signature from __props__

2daca2b

Co-authored-by: Ricardo Vieira <[email protected]>

Implement correct __props__ for subclasses of BaseBlockMatrix

a9893b8

ricardoV94 approved these changes Jan 7, 2024

View reviewed changes

jessegrabowski merged commit c4ae6e3 into pymc-devs:main Jan 7, 2024

jessegrabowski deleted the block-diag branch January 7, 2024 03:39

Add linalg.block_diag and sparse equivalent #576

Add linalg.block_diag and sparse equivalent #576

Uh oh!

Conversation

jessegrabowski commented Jan 6, 2024 • edited by ricardoV94 Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Related Issue

Checklist

Type of change

Uh oh!

codecov-commenter commented Jan 6, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

ricardoV94 commented Jan 6, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

ricardoV94 left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

ricardoV94 commented Jan 6, 2024

Uh oh!

ricardoV94 commented Jan 6, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

jessegrabowski commented Jan 6, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

ricardoV94 commented Jan 6, 2024

Uh oh!

jessegrabowski commented Jan 6, 2024

Uh oh!

ricardoV94 commented Jan 6, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

jessegrabowski commented Jan 6, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

ricardoV94 commented Jan 6, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

jessegrabowski commented Jan 6, 2024

Uh oh!

ricardoV94 left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

ricardoV94 Jan 6, 2024

Choose a reason for hiding this comment

Uh oh!

jessegrabowski Jan 6, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

ricardoV94 Jan 7, 2024

Choose a reason for hiding this comment

Uh oh!

jessegrabowski Jan 7, 2024

Choose a reason for hiding this comment

Uh oh!

ricardoV94 Jan 7, 2024

Choose a reason for hiding this comment

Uh oh!

jessegrabowski Jan 7, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Add `linalg.block_diag` and sparse equivalent #576

Add `linalg.block_diag` and sparse equivalent #576

jessegrabowski commented Jan 6, 2024 •

edited by ricardoV94

Loading

codecov-commenter commented Jan 6, 2024 •

edited

Loading

ricardoV94 commented Jan 6, 2024 •

edited

Loading

ricardoV94 commented Jan 6, 2024 •

edited

Loading

jessegrabowski commented Jan 6, 2024 •

edited

Loading

ricardoV94 commented Jan 6, 2024 •

edited

Loading

jessegrabowski commented Jan 6, 2024 •

edited

Loading

ricardoV94 commented Jan 6, 2024 •

edited

Loading

jessegrabowski Jan 6, 2024 •

edited

Loading

jessegrabowski Jan 7, 2024 •

edited

Loading

ricardoV94 Jan 7, 2024 •

edited

Loading