Switch to sqrt(precision) representation in Gaussian #568

fritzo · 2021-10-07T03:52:29Z

Resolves #567
Adapts @fehiepsi's pyro-ppl/pyro#2019

This switches the internal Gaussian representation to a numerically stable and space efficient representation

- Gaussian(info_vec, precision, inputs)
+ Gaussian(white_vec, prec_sqrt, inputs)

In the new parametrization, Gaussians represent the log-density function

Gaussian(white_vec, prec_sqrt, OrderedDict(x=Reals[n]))
  = -1/2 || x @ prec_sqrt - white_vec ||^2

These two parameters are shaped to efficiently represent low-rank data:

assert white_vec.shape == batch_shape + (rank,)
assert prec_sqrt.shape == batch_shape + (dim, rank)

reducing space complexity from O(dim(dim+1)) to O(rank(dim+1)). In my real-world example rank=1, dim=2369, and batch_shape=(1343,), so the space reduction is 30GB → 13MB.

Computations is cheap in this representation: addition amounts to concatenation, and plate-reduction amounts to transpose and reshape. Some ops are only supported on full-rank Gaussians, and I've added checks based on the new property .is_full_rank. This partial support is ok because the Gaussian funsors that arise in Bayesian models are all full rank due to priors (notwithstanding numerical loss of rank).

Because the Gaussian funsor is internal, the interface change should not cause breakage of most user code, since most user code uses to_funsor() and to_data() with backend-specific distributions. One broken piece of user code is Pyro's AutoGaussianFunsor which will need an update (and which will be sped up).

As suggested by @eb8680, I've added some optional kwarg parametrizations and properties to support conversion to other Gaussian representations, e.g. g = Gaussian(mean=..., covariance=..., inputs=...) and g._mean, g._covariance. This allows more Gaussian math to live in gaussian.py.

Tested

added new tests
pass existing funsor tests (linear algebra)
pass existing funsor tests (patterns)
pass NumPyro tests (conjugacy)
pass Pyro tests on a branch (Update to use funsor's SRIF Gaussian pyro#2943)
check computational cost on the pyro-cov model (results: under 12GB memory)

fritzo · 2021-10-11T21:37:20Z

@eb8680 could we pair code on Gaussian patterns this week? I think this PR now has correct linear algebra, but it produces slightly different patterns. Specifically, the new square root representation can no longer be negated, so we'll need to keep Unary(ops.neg, Gaussian) lazy.

fritzo · 2021-10-13T23:35:35Z

@eb8680 could you please review this PR in general?
@fehiepsi could you please review the linear algebra?

I'm happy to walk you through the changes over zoom.

fehiepsi · 2021-10-14T00:00:29Z

Whoa, impressive work to make this possible! I haven't looked into the code yet but my general concern would be on marginalization and compress rank logics. I will look into the details later of this week.

fehiepsi

My initial concerns regarding compress rank and marginalization seem to be already resolved. The math seems correct and the implementation is quite optimal in my opinion.

white_vec seems to be natural with the implementation but I'm curious on why using it instead of info_vec?

funsor/gaussian.py

fehiepsi · 2021-10-15T01:29:37Z

funsor/gaussian.py

-def eager_neg(op, arg):
-    info_vec = -arg.info_vec
-    precision = -arg.precision
-    return Gaussian(info_vec, precision, arg.inputs)


Just curious, previously when sub is needed? and how you compute Gaussian - Gaussian? (I couldn't derive the math :( )

I believe sub is needed only in computing KL divergences. E.g. in fully Gaussian Elbos, we compute Integrate(guide, model - guide) where both model and guide are Gaussian. Actually we'll need some more patterns now that ops.neg(Gaussian(...)) is lazy. As discussed with @eb8680 I'll open an issue once this merges.

As group coded on Friday, full subtraction will be implemented in the follow-up PR #553

funsor/integrate.py

fritzo · 2021-10-15T02:59:30Z

Thanks for reviewing, @fehiepsi! Here are a few weak reasons I chose white_vec instead of info_vec:

white_vec seems natural 😄
white_vec is space optimal in the low-rank case, with white_vec.shape[-1] == rank versus info_vec.shape[-1] == dim. E.g. in the rank-1 case we get up to a factor of two savings: O(rank(1+dim)) < O(dim(1+rank)).
Because white_vec does not depend on the real inputs, it need not be rearranged when interleaving or substituting real variables. This is minor, but it does simplify code a bit.

eb8680

Generally looks great. Once AutoGaussian is finished I think we should consider writing a short technical report on the internals of funsor.Gaussian, since there's a lot of cool stuff here and it wasn't really covered at all in the Funsor paper.

funsor/testing.py

funsor/pyro/convert.py

eb8680 · 2021-10-14T19:58:46Z

funsor/gaussian.py

+        if prec_sqrt is not None:
+            is_tril = False
+        elif precision is not None:
+            prec_sqrt = ops.cholesky(precision)


This conversion logic in GaussianMeta seems fine overall, but I wonder whether we should look for a way to restrict actual performance of the possibly expensive linear algebra computations to specific interpretations like eager. If so, that might be another reason to attempt #556, since we could just make them lazy Ops.

funsor/gaussian.py

fritzo · 2021-10-16T21:52:29Z

Thanks for reviewing @eb8680 and @fehiepsi! I believe I've addressed all comments.

I plan to add more patterns in subsequent PRs that handle Gaussian variable elimination, e.g. in #553 and https://github.com/pyro-ppl/funsor/tree/tractable-for-gaussians

fehiepsi

Look great to me on the second pass! Thanks, @fritzo!

eb8680

Looks great!

fritzo added 2 commits October 6, 2021 23:33

Switch to sqrt(precision) representation in Gaussian

bf30d15

Fix some bugs

6ad1952

fritzo added WIP refactor labels Oct 7, 2021

fritzo added 12 commits October 7, 2021 08:19

Fix more math

15f767c

Add GaussianMeta conversions; fix broadcasting bug

5b3c285

Fix some distribution tests

6317f7f

Refactor from info_vec to white_vec

841010a

Fix more tests

57a1204

Flesh our matrix_and_mvn_to_funsor()

d858cd5

Work our marginalization

47afb49

fix more tests

e919c33

Fix more tests

965bb50

Fix test_gaussian.py

c1c8d18

Fix distribution patterns

47ab8da

Fix argmax approximation

fe0c7c5

fritzo force-pushed the srif branch from 393ea94 to 911fc7c Compare October 11, 2021 21:28

Remove Gaussian.negate attribute

10b3432

fritzo force-pushed the srif branch from 911fc7c to 10b3432 Compare October 11, 2021 21:30

fritzo added 2 commits October 11, 2021 22:31

Fix matrix_and_mvn_to_funsor diag (full still broken)

702152b

Fix old uses of info_vec

493edb6

fritzo mentioned this pull request Oct 12, 2021

Update to use funsor's SRIF Gaussian pyro-ppl/pyro#2943

Merged

3 tasks

fritzo added 7 commits October 12, 2021 08:35

Add a test

67ad0c1

Fix shape bug in matrix_and_mvn_to_funsor()

2d4fdb9

Merge branch 'master' into srif

18674e8

Enable pprint for funsors

eeda90d

Revert pp property

5f17da8

Merge branch 'pprint' into srif

be11455

Fix matrix_and_mvn_to_funsor()

d7dfd20

fritzo added 9 commits October 13, 2021 11:24

Update docstring

22479dc

Merge branch 'master' into srif

8aa123d

Fix backward sampling support bug

639ed0b

Xfail test_elbo.py::test_complex

76d8bcd

Relax test thresholds

c709453

Fix ops.qr numpy backend

c8ff3a9

Fix jax tests

503383b

Fix bugs

ec499b0

Tweak sensor example

f5d8519

fritzo marked this pull request as ready for review October 13, 2021 23:33

fritzo added awaiting review and removed WIP labels Oct 13, 2021

fritzo requested review from fehiepsi and eb8680 and removed request for fehiepsi October 13, 2021 23:34

fritzo mentioned this pull request Oct 14, 2021

Learn white_vec parameter of AutoGaussian guide pyro-ppl/pyro#2946

Merged

2 tasks

fehiepsi reviewed Oct 15, 2021

View reviewed changes

funsor/integrate.py Outdated Show resolved Hide resolved

eb8680 reviewed Oct 15, 2021

View reviewed changes

Address review comments

5f468aa

fehiepsi approved these changes Oct 17, 2021

View reviewed changes

fritzo requested a review from eb8680 October 17, 2021 19:18

eb8680 approved these changes Oct 18, 2021

View reviewed changes

eb8680 merged commit c94a9bd into master Oct 18, 2021

eb8680 deleted the srif branch October 18, 2021 14:39

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Switch to sqrt(precision) representation in Gaussian #568

Switch to sqrt(precision) representation in Gaussian #568

fritzo commented Oct 7, 2021 •

edited

Loading

fritzo commented Oct 11, 2021

fritzo commented Oct 13, 2021 •

edited

Loading

fehiepsi commented Oct 14, 2021

fehiepsi left a comment

fehiepsi Oct 15, 2021

fritzo Oct 15, 2021

fritzo Oct 16, 2021

fritzo commented Oct 15, 2021 •

edited

Loading

eb8680 left a comment

eb8680 Oct 14, 2021

fritzo commented Oct 16, 2021

fehiepsi left a comment

eb8680 left a comment

Switch to sqrt(precision) representation in Gaussian #568

Switch to sqrt(precision) representation in Gaussian #568

Conversation

fritzo commented Oct 7, 2021 • edited Loading

Tested

fritzo commented Oct 11, 2021

fritzo commented Oct 13, 2021 • edited Loading

fehiepsi commented Oct 14, 2021

fehiepsi left a comment

Choose a reason for hiding this comment

fehiepsi Oct 15, 2021

Choose a reason for hiding this comment

fritzo Oct 15, 2021

Choose a reason for hiding this comment

fritzo Oct 16, 2021

Choose a reason for hiding this comment

fritzo commented Oct 15, 2021 • edited Loading

eb8680 left a comment

Choose a reason for hiding this comment

eb8680 Oct 14, 2021

Choose a reason for hiding this comment

fritzo commented Oct 16, 2021

fehiepsi left a comment

Choose a reason for hiding this comment

eb8680 left a comment

Choose a reason for hiding this comment

fritzo commented Oct 7, 2021 •

edited

Loading

fritzo commented Oct 13, 2021 •

edited

Loading

fritzo commented Oct 15, 2021 •

edited

Loading