General Gaussian Noise Mechanisms and Their Optimality for Unbiased Mean Estimation (2301.13850v2)
Abstract: We investigate unbiased high-dimensional mean estimators in differential privacy. We consider differentially private mechanisms whose expected output equals the mean of the input dataset, for every dataset drawn from a fixed bounded $d$-dimensional domain $K$. A classical approach to private mean estimation is to compute the true mean and add unbiased, but possibly correlated, Gaussian noise to it. In the first part of this paper, we study the optimal error achievable by a Gaussian noise mechanism for a given domain $K$ when the error is measured in the $\ell_p$ norm for some $p \ge 2$. We give algorithms that compute the optimal covariance for the Gaussian noise for a given $K$ under suitable assumptions, and prove a number of nice geometric properties of the optimal error. These results generalize the theory of factorization mechanisms from domains $K$ that are symmetric and finite (or, equivalently, symmetric polytopes) to arbitrary bounded domains. In the second part of the paper we show that Gaussian noise mechanisms achieve nearly optimal error among all private unbiased mean estimation mechanisms in a very strong sense. In particular, for every input dataset, an unbiased mean estimator satisfying concentrated differential privacy introduces approximately at least as much error as the best Gaussian noise mechanism. We extend this result to local differential privacy, and to approximate differential privacy, but for the latter the error lower bound holds either for a dataset or for a neighboring dataset, and this relaxation is necessary.
- Instance-optimality in differential privacy via approximate inverse sensitivity mechanisms. In Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, NeurIPS 2020, December 6-12, 2020, virtual, 2020.
- Near instance-optimality in differential privacy. CoRR, abs/2005.10630, 2020.
- Optimal algorithms for mean estimation under local differential privacy. In International Conference on Machine Learning, ICML 2022, 17-23 July 2022, Baltimore, Maryland, USA, volume 162 of Proceedings of Machine Learning Research, pages 1046â1056. PMLR, 2022.
- PLAN: variance-aware private mean estimation. CoRR, abs/2306.08745, 2023.
- Approximating the cut-norm via Grothendieckâs inequality. In ACM Symposium on Theory of Computing, pages 72â80, 2004.
- T. Ando. Concavity of certain maps on positive definite matrices and applications to Hadamard products. Linear Algebra Appl., 26:203â241, 1979.
- Towards instance-optimal private query release. In Proceedings of the Thirtieth Annual ACM-SIAM Symposium on Discrete Algorithms. SODA 2019, pages 2480â2497. SIAM, Philadelphia, PA, 2019.
- Unconditional differentially private mechanisms for linear queries. In Howard J. Karloff and Toniann Pitassi, editors, Proceedings of the 44th Symposium on Theory of Computing Conference, STOC 2012, New York, NY, USA, May 19 - 22, 2012, pages 1269â1284. ACM, 2012.
- A framework for quadratic form maximization over convex sets through nonconvex relaxations. In Samir Khuller and Virginia Vassilevska Williams, editors, STOC â21: 53rd Annual ACM SIGACT Symposium on Theory of Computing, Virtual Event, Italy, June 21-25, 2021, pages 870â881. ACM, 2021.
- Concentrated differential privacy: Simplifications, extensions, and lower bounds. In Theory of Cryptography - 14th International Conference, TCC 2016-B, Beijing, China, October 31 - November 3, 2016, Proceedings, Part I, volume 9985 of Lecture Notes in Computer Science, pages 635â658, 2016.
- Private empirical risk minimization: Efficient algorithms and tight error bounds. In 55th IEEE Annual Symposium on Foundations of Computer Science, FOCS 2014, Philadelphia, PA, USA, October 18-21, 2014, pages 464â473. IEEE Computer Society, 2014.
- Fingerprinting codes and the price of approximate differential privacy. In David B. Shmoys, editor, Symposium on Theory of Computing, STOC 2014, New York, NY, USA, May 31 - June 03, 2014, pages 1â10. ACM, 2014.
- Eric Carlen. Trace inequalities and quantum entropy: an introductory course. In Entropy and the quantum, volume 529 of Contemp. Math., pages 73â140. Amer. Math. Soc., Providence, RI, 2010.
- Multi-epoch matrix factorization mechanisms for private machine learning. CoRR, abs/2211.06530, 2022.
- Minimum variance estimation without regularity assumptions. Ann. Math. Statistics, 22:581â586, 1951.
- Minimax optimal procedures for locally private estimation. J. Amer. Statist. Assoc., 113(521):182â201, 2018.
- Our data, ourselves: Privacy via distributed noise generation. In Advances in Cryptology - EUROCRYPT 2006, 25th Annual International Conference on the Theory and Applications of Cryptographic Techniques, St. Petersburg, Russia, May 28 - June 1, 2006, Proceedings, pages 486â503, 2006.
- Subset-based instance optimality in private estimation. In Andreas Krause, Emma Brunskill, Kyunghyun Cho, Barbara Engelhardt, Sivan Sabato, and Jonathan Scarlett, editors, International Conference on Machine Learning, ICML 2023, 23-29 July 2023, Honolulu, Hawaii, USA, volume 202 of Proceedings of Machine Learning Research, pages 7992â8014. PMLR, 2023.
- Differentially private covariance revisited. CoRR, abs/2205.14324, 2022.
- Calibrating noise to sensitivity in private data analysis. In Proceedings of the Third Conference on Theory of Cryptography, TCCâ06, pages 265â284, Berlin, Heidelberg, 2006. Springer-Verlag.
- Efficient algorithms for privately releasing marginals via convex relaxations. Discrete Comput. Geom., 53(3):650â673, 2015.
- Concentrated differential privacy. CoRR, abs/1603.01887, 2016.
- The right complexity measure in locally private estimation: It is not the fisher information. CoRR, abs/1806.05756, 2018.
- Wei Dong and Ke Yi. A nearly instance-optimal differentially private mechanism for conjunctive queries. In PODS â22: International Conference on Management of Data, Philadelphia, PA, USA, June 12 - 17, 2022, pages 213â225. ACM, 2022.
- Measure theory and fine properties of functions. Studies in Advanced Mathematics. CRC Press, Boca Raton, FL, 1992.
- Limiting privacy breaches in privacy preserving data mining. In PODS, pages 211â222. ACM, 2003.
- The power of factorization mechanisms in local and central differential privacy. In STOCâ20âProceedings of the 52n Annual ACM SIGACT Symposium on Theory of Computing, pages 425â438. ACM, 2020.
- Alexandre Grothendieck. Résumé de la théorie métrique des produits tensoriels topologiques. Bol. Soc. Mat. Sao Paulo, 8(1-79):88, 1953.
- J. M. Hammersley. On estimating restricted parameters. J. Roy. Statist. Soc. Ser. B, 12:192â229; discussion, 230â240, 1950.
- Instance-optimal mean estimation under differential privacy. In MarcâAurelio Ranzato, Alina Beygelzimer, Yann N. Dauphin, Percy Liang, and Jennifer Wortman Vaughan, editors, Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, NeurIPS 2021, December 6-14, 2021, virtual, pages 25993â26004, 2021.
- Constant matters: Fine-grained complexity of differentially private continual observation using completely bounded norms. CoRR, abs/2202.11205, 2022.
- Almost tight error bounds on differentially private continual counting. CoRR, abs/2211.05006, 2022.
- What can we learn privately? In FOCS, pages 531â540. IEEE, Oct 25â28 2008.
- A bias-variance-privacy trilemma for statistical estimation. CoRR, abs/2301.13334, 2023.
- Grothendieck-type inequalities in combinatorial optimization. Comm. Pure Appl. Math., 65(7):992â1035, 2012.
- Hidetoshi Komiya. Elementary proof for Sionâs minimax theorem. Kodai Math. J., 11(1):5â7, 1988.
- The composition theorem for differential privacy. IEEE Trans. Inf. Theory, 63(6):4037â4049, 2017.
- J. L. Krivine. ThĂ©orĂšmes de factorisation dans les espaces rĂ©ticulĂ©s. In SĂ©minaire Maurey-Schwartz 1973â1974: Espaces LpsuperscriptđżđL^{p}italic_L start_POSTSUPERSCRIPT italic_p end_POSTSUPERSCRIPT, applications radonifiantes et gĂ©omĂ©trie des espaces de Banach, pages Exp. Nos. 22 et 23, 22. Ăcole Polytech., Paris, 1974.
- On the âsemanticsâ of differential privacy: A bayesian formulation. J. Priv. Confidentiality, 6(1), 2014.
- Finite sample differentially private confidence intervals. In Anna R. Karlin, editor, 9th Innovations in Theoretical Computer Science Conference, ITCS 2018, January 11-14, 2018, Cambridge, MA, USA, volume 94 of LIPIcs, pages 44:1â44:9. Schloss Dagstuhl - Leibniz-Zentrum fĂŒr Informatik, 2018.
- Optimizing linear counting queries under differential privacy. In Proceedings of the 29th ACM Symposium on Principles of Database Systems, PODSâ10, pages 123â134. ACM, 2010.
- The matrix mechanism: optimizing linear counting queries under differential privacy. VLDB J., 24(6):757â781, 2015.
- A direct product theorem for discrepancy. In Proceedings of the 23rd Annual IEEE Conference on Computational Complexity, CCC 2008, 23-26 June 2008, College Park, Maryland, USA, pages 71â80. IEEE Computer Society, 2008.
- Optimizing error of high-dimensional statistical queries under differential privacy. Proc. VLDB Endow., 11(10):1206â1219, 2018.
- Factorization norms and hereditary discrepancy. Int. Math. Res. Not. IMRN, 2020(3):751â780, 2020.
- Private online prefix sums via optimal matrix factorizations. CoRR, abs/2202.08312, 2022.
- Instance-optimal differentially private estimation. CoRR, abs/2210.15819, 2022.
- Yu. Nesterov. Semidefinite relaxation and nonconvex quadratic optimization. Optim. Methods Softw., 9(1-3):141â160, 1998.
- Aleksandar Nikolov. New Computational Aspects of Discrepancy Theory. PhD thesis, Rutgers, The State University of New Jersey, 2014.
- Aleksandar Nikolov. Private query release via the johnson-lindenstrauss transform. In Proceedings of the 2023 ACM-SIAM Symposium on Discrete Algorithms, SODA 2023, Florence, Italy, January 22-25, 2023, pages 4982â5002. SIAM, 2023.
- Efficient rounding for the noncommutative Grothendieck inequality. Theory Comput., 10:257â295, 2014.
- The geometry of differential privacy: the sparse and approximate cases. In STOCâ13âProceedings of the 2013 ACM Symposium on Theory of Computing, pages 351â360. ACM, New York, 2013.
- The geometry of differential privacy: The small database and approximate cases. SIAM J. Comput., 45(2):575â616, 2016.
- Gilles Pisier. Grothendieckâs theorem for noncommutative Câsuperscriptđ¶âC^{\ast}italic_C start_POSTSUPERSCRIPT â end_POSTSUPERSCRIPT-algebras, with an appendix on Grothendieckâs constants. J. Functional Analysis, 29(3):397â415, 1978.
- R. Tyrrell Rockafellar. Convex analysis. Princeton Mathematical Series, No. 28. Princeton University Press, Princeton, N.J., 1970.
- Mathematical statistics with applications. Elsevier/Academic Press, Amsterdam, 2009.
- Maurice Sion. On general minimax theorems. Pacific J. Math., 8:171â176, 1958.
- N. Tomczak-Jaegermann. Banach-Mazur Distances and Finite-Dimensional Operator Ideals. Pitman Monographs and Surveys in Pure and Applied Mathematics 38. J. Wiley, New York, 1989.
- Roman Vershynin. High-dimensional probability, volume 47 of Cambridge Series in Statistical and Probabilistic Mathematics. Cambridge University Press, Cambridge, 2018. An introduction with applications in data science, With a foreword by Sara van de Geer.
- Unbiased estimators and their applications. Vol. 1, volume 263 of Mathematics and its Applications. Kluwer Academic Publishers, Dordrecht, 1993. Univariate case, Translated from the 1989 Russian original by L. E. Strautman and revised by the authors.
- Unbiased estimators and their applications. Vol. 2, volume 362 of Mathematics and its Applications. Kluwer Academic Publishers Group, Dordrecht, 1996. Multivariate case.
- Stanley L. Warner. Randomized response: A survey technique for eliminating evasive answer bias. Journal of the American Statistical Association, 60(309):63â69, 1965.
- An optimal and scalable matrix mechanism for noisy marginals under convex loss functions. CoRR, abs/2305.08175, 2023.
- Privacy and bias analysis of disclosure avoidance systems. CoRR, abs/2301.12204, 2023.
- Bias and variance of post-processing in differential privacy. In Thirty-Fifth AAAI Conference on Artificial Intelligence, AAAI 2021, Virtual Event, February 2-9, 2021, pages 11177â11184. AAAI Press, 2021.
Paper Prompts
Sign up for free to create and run prompts on this paper using GPT-5.
Top Community Prompts
Collections
Sign up for free to add this paper to one or more collections.