Tensor cumulants for statistical inference on invariant distributions (2404.18735v1)
Abstract: Many problems in high-dimensional statistics appear to have a statistical-computational gap: a range of values of the signal-to-noise ratio where inference is information-theoretically possible, but (conjecturally) computationally intractable. A canonical such problem is Tensor PCA, where we observe a tensor $Y$ consisting of a rank-one signal plus Gaussian noise. Multiple lines of work suggest that Tensor PCA becomes computationally hard at a critical value of the signal's magnitude. In particular, below this transition, no low-degree polynomial algorithm can detect the signal with high probability; conversely, various spectral algorithms are known to succeed above this transition. We unify and extend this work by considering tensor networks, orthogonally invariant polynomials where multiple copies of $Y$ are "contracted" to produce scalars, vectors, matrices, or other tensors. We define a new set of objects, tensor cumulants, which provide an explicit, near-orthogonal basis for invariant polynomials of a given degree. This basis lets us unify and strengthen previous results on low-degree hardness, giving a combinatorial explanation of the hardness transition and of a continuum of subexponential-time algorithms that work below it, and proving tight lower bounds against low-degree polynomials for recovering rather than just detecting the signal. It also lets us analyze a new problem of distinguishing between different tensor ensembles, such as Wigner and Wishart tensors, establishing a sharp computational threshold and giving evidence of a new statistical-computational gap in the Central Limit Theorem for random tensors. Finally, we believe these cumulants are valuable mathematical objects in their own right: they generalize the free cumulants of free probability theory from matrices to tensors, and share many of their properties, including additivity under additive free convolution.
- Random matrices and complexity of spin glasses. Communications on Pure and Applied Mathematics, 66(2):165–201, 2013.
- On the heat equation and the index theorem. Inventiones Math., 19:279–330, 1973.
- Homotopy analysis for tensor PCA. In Conference on Learning Theory, pages 79–104. PMLR, 2017.
- Finite free cumulants: multiplicative convolutions, genus expansion and infinitesimal distributions. Transactions of the American Mathematical Society, 376(06):4383–4420, 2023.
- Random tensor theory: extending random matrix theory to mixtures of random product states. Communications in Mathematical Physics, 310(1):25–74, 2012.
- Graph matrices: norm bounds and applications. arXiv preprint arXiv:1604.03423, 2016.
- Cumulants for finite free convolution. Journal of Combinatorial Theory, Series A, 155:244–266, 2018.
- Teodor Banica. The orthogonal Weingarten formula in compact form. Letters in Mathematical Physics, 91(2):105–118, 2010.
- Reducibility and statistical-computational gaps from secret leakage. In 33rd Annual Conference on Learning Theory (COLT 2020), pages 648–847. PMLR, 2020.
- On the Fourier coefficients of high-dimensional random geometric graphs. arXiv preprint arXiv:2402.12589, 2024.
- De Finetti-style results for Wishart matrices: Combinatorial structure and phase transitions. arXiv preprint arXiv:2103.14011, 2021.
- Phase transitions for detecting latent geometry in random graphs. Probability Theory and Related Fields, 178(3):1215–1289, 2020.
- How to iron out rough landscapes and get optimal performances: averaged gradient descent and its application to tensor PCA. Journal of Physics A: Mathematical and Theoretical, 53(17):174003, 2020.
- Testing for high-dimensional geometry in random graphs. Random Structures & Algorithms, 49(3):503–532, 2016.
- The Franz-Parisi criterion and computational trade-offs in high dimensional statistics. Advances in Neural Information Processing Systems, 35:33831–33844, 2022.
- Multiplicative approximations for polynomial optimization over the unit sphere. Electron. Colloquium Comput. Complex., TR16-185, 2016.
- Sum-of-squares certificates for maxima of random tensors on the sphere. In Approximation, Randomization, and Combinatorial Optimization. Algorithms and Techniques, APPROX/RANDOM 2017, volume 81 of LIPIcs, pages 31:1–31:20. Schloss Dagstuhl - Leibniz-Zentrum für Informatik, 2017.
- Universality in p𝑝pitalic_p-spin glasses with correlated disorder. Journal of Statistical Mechanics: Theory and Experiment, 2013(02):L02003, 2013.
- A nearly tight sum-of-squares lower bound for the planted clique problem. SIAM Journal on Computing, 48(2):687–735, 2019.
- Reconfiguration of graphs with connectivity constraints. In International Workshop on Approximation and Online Algorithms, pages 295–309. Springer, 2018.
- Béla Bollobás. The asymptotic number of unlabelled regular graphs. Journal of the London Mathematical Society, 2(2):201–206, 1982.
- Remi Bonnin. Universality of the Wigner-Gurau limit for random tensors. arXiv preprint arxiv:2404.14144, 2024.
- Richard Brauer. On algebras which are connected with the semisimple continuous groups. Annals of Mathematics, 38(4):857–872, 1937.
- All real eigenvalues of symmetric tensors. SIAM Journal on Matrix Analysis and Applications, 35(4):1582–1601, 2014.
- The tensor Harish-Chandra–Itzykson–Zuber integral II: Detecting entanglement in large quantum systems. Communications in Mathematical Physics, pages 1–48, 2023.
- The tensor Harish-Chandra–Itzykson–Zuber integral I: Weingarten calculus and a generalization of monotone Hurwitz numbers. Journal of the European Mathematical Society, 2023.
- Second order freeness and fluctuations of random matrices, III. Higher order freeness and free cumulants. Documenta Mathematica, 12, 07 2006.
- Statistical and computational phase transitions in group testing. In Conference on Learning Theory, pages 4764–4781. PMLR, 2022.
- Benoît Collins. Moments and cumulants of polynomial random variables on unitary groups, the Itzykson-Zuber integral, and free probability. International Mathematics Research Notices, 2003(17):953–982, 2003.
- Integration with respect to the haar measure on unitary, orthogonal and symplectic group. Communications in Mathematical Physics, 264(3):773–795, 2006.
- Fast algorithm for overcomplete order-3 tensor decomposition. arXiv preprint arXiv:2202.06442, 2022.
- Low-degree hardness of detection for correlated Erdős-Rényi graphs. arXiv preprint arXiv:2311.15931, 2023.
- Estimating rank-one spikes from heavy-tailed noise via self-avoiding walks. arXiv preprint arXiv:2008.13735, 2020.
- Detection of dense subhypergraphs by low-degree polynomials. arXiv preprint arXiv:2304.08135, 2023.
- Oleg Evnin. Melonic dominance and the largest eigenvalue of a large random tensor. Letters in Mathematical Physics, 111(3):66, 2021.
- S. Geršgorin. Über die abgrenzung der eigenwerte einer matrix. Bulletin de l’Académie des Sciences de l’URSS. Classe des sciences mathématiques et na, pages 749–754, 1931.
- Sum-of-squares lower bounds for Sherrington-Kirkpatrick via planted affine planes. In 61st Annual Symposium on Foundations of Computer Science (FOCS 2020), pages 954–965. IEEE, 2020.
- Hardness of random optimization problems for Boolean circuits, low-degree polynomials, and Langevin dynamics. SIAM Journal on Computing, 53(1):1–46, 2024.
- Razvan Gurau. Universality for random tensors. In Annales de l’IHP Probabilités et statistiques, volume 50, pages 1474–1525, 2014.
- Răzvan Gheorghe Gurău. Random tensors. Oxford University Press, 2017.
- Razvan Gurau. On the generalization of the Wigner semicircle law to real symmetric tensors. arXiv preprint arXiv:2004.02660, 2020.
- Representations and Invariants of the Classical Groups. Cambridge University Press, 1998.
- Seifollah Louis Hakimi. On realizability of a set of integers as degrees of the vertices of a linear graph II. Uniqueness. Journal of the Society for Industrial and Applied Mathematics, 11(1):135–147, 1963.
- Matthew B Hastings. Classical and quantum algorithms for tensor principal component analysis. Quantum, 4:237, 2020.
- The power of sum-of-squares for detecting hidden structures. In 58th Annual Symposium on Foundations of Computer Science (FOCS), pages 720–731. IEEE, 2017.
- Samuel Hopkins. Statistical inference and the sum of squares method. PhD thesis, Cornell University, 2018.
- Asymptotic formulaæ in combinatory analysis. Proceedings of the London Mathematical Society, 2(1):75–115, 1918.
- Efficient Bayesian estimation from few samples: community detection and related problems. In 58th Annual Symposium on Foundations of Computer Science (FOCS), pages 379–390. IEEE, 2017.
- Tensor principal component analysis via sum-of-square proofs. In Peter Grünwald, Elad Hazan, and Satyen Kale, editors, Proc. 28th Conference on Learning Theory, volume 40 of Proceedings of Machine Learning Research, pages 956–1006, 2015.
- Fast spectral algorithms from sum-of-squares proofs: tensor decomposition and planted sparse vectors. In Proceedings of the Forty-Eighth Annual ACM Symposium on Theory of Computing, pages 178––191, 2016.
- L. Isserlis. On a formula for the product-moment coefficient of any order of a normal frequency distribution in any number of variables. Biometrika, 12:134–139, 1918.
- Statistical thresholds for tensor PCA. Annals of Applied Probability, 30(4):1910–1933, 2020.
- Almost-orthogonal bases for inner product polynomials. arXiv preprint arXiv:2107.00216, 2021.
- Sum-of-squares lower bounds for densest k𝑘kitalic_k-subgraph. In Proceedings of the 55th Annual ACM Symposium on Theory of Computing, pages 84–95, 2023.
- Is planted coloring easier than planted clique? In The Thirty Sixth Annual Conference on Learning Theory, pages 5343–5372. PMLR, 2023.
- Notes on computational hardness of hypothesis testing: Predictions using the low-degree likelihood ratio. In Paula Cerejeiras and Michael Reissig, editors, Mathematical Analysis, its Applications and Computation, pages 1–50, Cham, 2022. Springer International Publishing.
- Testing thresholds for high-dimensional sparse random geometric graphs. In Proceedings of the 54th Annual ACM SIGACT Symposium on Theory of Computing, pages 672–677, 2022.
- Laurent Massoulié. Community detection thresholds and the weak ramanujan property. In Proceedings of the forty-sixth annual ACM symposium on Theory of computing, pages 694–703, 2014.
- High-temperature expansions and message passing algorithms. Journal of Statistical Mechanics: Theory and Experiment, 2019(11):113301, 2019.
- Dan Mikulincer. A CLT in Stein’s distance for generalized Wishart matrices and higher order tensors. arXiv preprint arXiv:2002.10846, 2020.
- A critical point for random graphs with a given degree sequence. Random Structures & Algorithms, 6(2-3):161–180, 1995.
- A graph integral formulation of the circuit partition polynomial. Combinatorics, Probability & Computing, 20(6):911, 2011.
- A statistical model for tensor PCA. In Proceedings of the 27th International Conference on Neural Information Processing Systems - Volume 2, NIPS’14, page 2897–2905. MIT Press, 2014.
- Free Probability and Random Matrices, volume 35 of Fields Institute Monographs. Springer, 2017.
- Spectral methods from tensor networks. In 51st Annual ACM SIGACT Symposium on Theory of Computing (STOC 2019), pages 926–937, 2019.
- Lectures on the combinatorics of free probability, volume 13. Cambridge University Press, 2006.
- What is…a free cumulant? Notices of the American Mathematical Society, 58:300–301, 2011.
- A new framework for tensor PCA based on trace invariants. 2020.
- Random tensor theory for tensor decomposition. Proceedings of the AAAI Conference on Artificial Intelligence, 36(7):7913–7921, 2022.
- Mohamed Ouerfelli. New perspectives and tools for Tensor Principal Component Analysis and beyond. PhD thesis, Université Paris-Saclay, 2022.
- Sub-exponential time sum-of-squares lower bounds for principal components analysis. Advances in Neural Information Processing Systems, 35:35724–35740, 2022.
- Claudio Procesi. The invariant theory of n×n𝑛𝑛n\times nitalic_n × italic_n matrices. Advances in mathematics, 19(3):306–381, 1976.
- Tensor eigenvalues and their applications, volume 39. Springer, 2018.
- Liqun Qi. Eigenvalues of a real supersymmetric tensor. Journal of Symbolic Computation, 40(6):1302–1324, 2005.
- Liqun Qi. Eigenvalues and invariants of tensors. Journal of Mathematical Analysis and Applications, 325(2):1363–1377, 2007.
- Strongly refuting random CSPs below the spectral threshold. In Proceedings of the 49th Annual ACM SIGACT Symposium on Theory of Computing, pages 121–131, 2017.
- Strongly refuting random CSPs below the spectral threshold. In Proceedings of the 49th Annual ACM SIGACT Symposium on Theory of Computing, STOC 2017, page 121–131, 2017.
- Is it easier to count communities than find them? In 14th Innovations in Theoretical Computer Science Conference (ITCS 2023). Schloss-Dagstuhl-Leibniz Zentrum für Informatik, 2023.
- Guilhem Semerjian. Matrix denoising: Bayes-optimal estimators via low-degree polynomials. arXiv preprint arxiv:2402.16719, 2024.
- Eliran Subag. The complexity of spherical p𝑝pitalic_p-spin models — a second moment approach. The Annals of Probability, 45(5):3385–3450, 2017.
- Computational barriers to estimation from low-degree polynomials. The Annals of Statistics, 50(3):1833–1858, 2022.
- The Kikuchi hierarchy and tensor PCA. In 60th IEEE Annual Symposium on Foundations of Computer Science, FOCS 2019, pages 1446–1468. IEEE Computer Society, 2019.
- Alexander S Wein. Average-case complexity of tensor decomposition for low-degree polynomials. In Proceedings of the 55th Annual ACM Symposium on Theory of Computing (STOC 2023), pages 1685–1698, 2023.
- Hans Wenzl. On the structure of Brauer’s centralizer algebras. Annals of Mathematics, 128(1):173–193, 1988.
- Hermann Weyl. The Classical Groups: Their Invariants and Representations. 1946.
- G. C. Wick. The evaluation of the collision matrix. Physical Review, 80:268–272, 1950.
- Todd G Will. Switching distance between graphs with the same degrees. SIAM Journal on Discrete Mathematics, 12(3):298–306, 1999.
- Sharp analysis of power iteration for tensor PCA. arXiv preprint arXiv:2401.01047, 2024.
- Paul Zinn-Justin. Jucys-Murphy elements and Weingarten matrices. arXiv preprint arXiv:0907.2719, 2009.