Computation-Aware Kalman Filtering and Smoothing (2405.08971v1)
Abstract: Kalman filtering and smoothing are the foundational mechanisms for efficient inference in Gauss-Markov models. However, their time and memory complexities scale prohibitively with the size of the state space. This is particularly problematic in spatiotemporal regression problems, where the state dimension scales with the number of spatial observations. Existing approximate frameworks leverage low-rank approximations of the covariance matrix. Since they do not model the error introduced by the computational approximation, their predictive uncertainty estimates can be overly optimistic. In this work, we propose a probabilistic numerical method for inference in high-dimensional Gauss-Markov models which mitigates these scaling issues. Our matrix-free iterative algorithm leverages GPU acceleration and crucially enables a tunable trade-off between computational cost and predictive uncertainty. Finally, we demonstrate the scalability of our method on a large-scale climate dataset.
- Learning Latent Dynamics for Planning from Pixels. In International Conference on Machine Learning (ICML), 2019. doi:10.48550/arXiv.1811.04551. URL http://arxiv.org/abs/1811.04551.
- Mamba: Linear-Time Sequence Modeling with Selective State Spaces, 2023. URL http://arxiv.org/abs/2312.00752.
- Bayesian Filtering and Smoothing, volume 17. Cambridge University Press, 2nd edition, 2023. ISBN 978-1-108-91230-3.
- Carl Edward Rasmussen and Christopher K. I. Williams. Gaussian Processes for Machine Learning. The MIT Press, 2006.
- Geir Evensen. Sequential data assimilation with a nonlinear quasi-geostrophic model using Monte Carlo methods to forecast error statistics. Journal of Geophysical Research: Oceans, 99(C5):10143–10162, 1994. doi:10.1029/94JC00572.
- Krylov space approximate Kalman filtering. Numerical Linear Algebra with Applications, 20(2):171–184, December 2011. ISSN 1099-1506. doi:10.1002/nla.805. URL http://dx.doi.org/10.1002/nla.805.
- Data sketching for large-scale Kalman filtering. IEEE Transactions on Signal Processing, 65(14):3688–3701, July 2017. ISSN 1941-0476. doi:10.1109/tsp.2017.2691662. URL http://dx.doi.org/10.1109/tsp.2017.2691662.
- The Rank-Reduced Kalman Filter: Approximate Dynamical-Low-Rank Filtering In High Dimensions. In Advances in Neural Information Processing Systems (NeurIPS), 2023. doi:10.48550/arXiv.2306.07774. URL http://arxiv.org/abs/2306.07774.
- The ERA5 global reanalysis. Quarterly Journal of the Royal Meteorological Society, 146(730):1999–2049, 2020. ISSN 1477-870X. doi:10.1002/qj.3803. URL https://onlinelibrary.wiley.com/doi/abs/10.1002/qj.3803.
- Kalman filtering and smoothing solutions to temporal Gaussian process regression models. In IEEE International Workshop on Machine Learning for Signal Processing, pages 379–384, 2010. doi:10.1109/MLSP.2010.5589113.
- Spatiotemporal Learning via Infinite-Dimensional Bayesian Filtering and Smoothing: A Look at Gaussian Process Regression Through Kalman Filtering. IEEE Signal Processing Magazine, 30(4):51–61, 2013. ISSN 1558-0792. doi:10.1109/MSP.2013.2246292.
- Posterior and computational uncertainty in Gaussian processes. In Advances in Neural Information Processing Systems (NeurIPS), 2022.
- Testing whether a learning procedure is calibrated. Journal of Machine Learning Research, 23(203):1–36, 2022. URL http://jmlr.org/papers/v23/21-1065.html.
- L. Mirsky. Symmetric gauge functions and unitarily invariant norms. The Quarterly Journal of Mathematics, 11(1):50–59, 1960. ISSN 1464-3847. doi:10.1093/qmath/11.1.50. URL http://dx.doi.org/10.1093/qmath/11.1.50.
- A bayesian conjugate gradient method (with discussion). Bayesian Analysis, 14(3), September 2019a. ISSN 1936-0975. doi:10.1214/19-ba1145. URL http://dx.doi.org/10.1214/19-BA1145.
- Georges Matheron. Principles of geostatistics. Economic geology, 58(8):1246–1266, 1963. Publisher: Society of Economic Geologists.
- Random Features for Large-Scale Kernel Machines. In Advances in Neural Information Processing Systems (NeurIPS), 2007.
- Efficiently sampling functions from Gaussian process posteriors. In International Conference on Machine Learning (ICML), 2020.
- Fast matrix square roots with applications to gaussian processes and bayesian optimization. In H. Larochelle, M. Ranzato, R. Hadsell, M.F. Balcan, and H. Lin, editors, Advances in Neural Information Processing Systems (NeurIPS), volume 33, pages 22268–22281. Curran Associates, Inc., 2020. URL https://proceedings.neurips.cc/paper_files/paper/2020/file/fcf55a303b71b84d326fb1d06e332a26-Paper.pdf.
- Gaussian Processes and Kernel Methods: A Review on Connections and Equivalences, July 2018. URL http://arxiv.org/abs/1807.02582.
- On dimension reduction in gaussian filters. Inverse Problems, 32(4):045003, March 2016. ISSN 1361-6420. doi:10.1088/0266-5611/32/4/045003. URL http://dx.doi.org/10.1088/0266-5611/32/4/045003.
- Efficient spatio-temporal Gaussian regression via Kalman filtering. Automatica, 118, 2020. ISSN 0005-1098. doi:10.1016/j.automatica.2020.109032.
- Infinite-Dimensional Kalman Filtering Approach to Spatio-Temporal Gaussian Process Regression. In International Conference on Artificial Intelligence and Statistics (AISTATS), pages 993–1001, 2012. URL https://proceedings.mlr.press/v22/sarkka12.html.
- Infinite-dimensional Bayesian filtering for detection of quasi-periodic phenomena in spatio-temporal data. Physical Review E, 88(5), November 2013. ISSN 1539-3755, 1550-2376. doi:10.1103/PhysRevE.88.052909.
- Probabilistic numerics and uncertainty in computations. Proceedings of the Royal Society of London A: Mathematical, Physical and Engineering Sciences, 471(2179), 2015.
- Bayesian probabilistic numerical methods. SIAM Review, 61(4):756–789, 2019b.
- Probabilistic Numerics: Computation as Machine Learning. Cambridge University Press, 2022. ISBN 978-1-316-68141-1. doi:10.1017/9781316681411.
- Philipp Hennig. Probabilistic interpretation of linear solvers. SIAM Journal on Optimization, 25(1):234–260, January 2015. ISSN 1095-7189. doi:10.1137/140955501. URL http://dx.doi.org/10.1137/140955501.
- Probabilistic linear solvers for machine learning. In Advances in Neural Information Processing Systems (NeurIPS), 2020. URL https://github.com/JonathanWenger/probabilistic-linear-solvers-for-ml.
- Accelerating Generalized Linear Models by Trading off Computation for Uncertainty, 2023. URL http://arxiv.org/abs/2310.20285. arXiv:2310.20285 [cs, stat].
- Temporal Parallelization of Bayesian Smoothers. IEEE Transactions on Automatic Control, 66(1):299–306, 2020. doi:10.48550/arXiv.1905.13002. URL http://arxiv.org/abs/1905.13002.
- Arno Solin. Stochastic differential equation methods for spatio-temporal Gaussian process regression. PhD thesis, Aalto University, 2016.
- Spatio-Temporal Variational Gaussian Processes, 2021. URL http://arxiv.org/abs/2111.01732. arXiv:2111.01732 [cs, stat].
- Christopher M. Bishop. Pattern Recognition and Machine Learning (Information Science and Statistics). Springer-Verlag, 2006.
- Jean-François Le Gall. Brownian Motion, Martingales, and Stochastic Calculus, volume 274 of Graduate Texts in Mathematics. Springer International Publishing, Cham, 2016. doi:10.1007/978-3-319-31089-3.
- Nachman Aronszajn. Theory of reproducing kernels. Transactions of the American Mathematical Society, 68(3):337–404, 1950.
- Simo Särkkä. Recursive Bayesian Inference on Stochastic Differential Equations. PhD thesis, Helsinki University of Technology, 2006.
- Applied Stochastic Differential Equations. Cambridge University Press, 1 edition, 2019. ISBN 978-1-108-18673-5. doi:10.1017/9781108186735.
- Randomized numerical linear algebra: Foundations and algorithms. Acta Numerica, 29:403–572, May 2020. ISSN 1474-0508. doi:10.1017/s0962492920000021. URL http://dx.doi.org/10.1017/S0962492920000021.
- James O. Berger. Statistical Decision Theory. Springer New York, 1980. ISBN 9781475717273. doi:10.1007/978-1-4757-1727-3. URL http://dx.doi.org/10.1007/978-1-4757-1727-3.
- Yousef Saad. Iterative Methods for Sparse Linear Systems. Society for Industrial and Applied Mathematics, January 2003. ISBN 9780898718003. doi:10.1137/1.9780898718003. URL http://dx.doi.org/10.1137/1.9780898718003.
- Krylov subspace methods. Numerical Mathematics and Scientific Computation. Oxford University Press, London, England, December 2012.