High-Dimensional Differentially-Private EM Algorithm: Methods and Near-Optimal Statistical Guarantees (2104.00245v2)

Published 1 Apr 2021 in stat.ML, cs.CR, cs.LG, math.ST, stat.ME, and stat.TH

Abstract: In this paper, we develop a general framework to design differentially private expectation-maximization (EM) algorithms in high-dimensional latent variable models, based on the noisy iterative hard-thresholding. We derive the statistical guarantees of the proposed framework and apply it to three specific models: Gaussian mixture, mixture of regression, and regression with missing covariates. In each model, we establish the near-optimal rate of convergence with differential privacy constraints, and show the proposed algorithm is minimax rate optimal up to logarithm factors. The technical tools developed for the high-dimensional setting are then extended to the classic low-dimensional latent variable models, and we propose a near rate-optimal EM algorithm with differential privacy guarantees in this setting. Simulation studies and real data analysis are conducted to support our results.

Citations (3)

View on Semantic Scholar