Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
97 tokens/sec
GPT-4o
53 tokens/sec
Gemini 2.5 Pro Pro
44 tokens/sec
o3 Pro
5 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Sharp Analysis of Expectation-Maximization for Weakly Identifiable Models (1902.00194v4)

Published 1 Feb 2019 in math.ST, cs.LG, stat.ML, and stat.TH

Abstract: We study a class of weakly identifiable location-scale mixture models for which the maximum likelihood estimates based on $n$ i.i.d. samples are known to have lower accuracy than the classical $n{- \frac{1}{2}}$ error. We investigate whether the Expectation-Maximization (EM) algorithm also converges slowly for these models. We provide a rigorous characterization of EM for fitting a weakly identifiable Gaussian mixture in a univariate setting where we prove that the EM algorithm converges in order $n{\frac{3}{4}}$ steps and returns estimates that are at a Euclidean distance of order ${ n{- \frac{1}{8}}}$ and ${ n{-\frac{1} {4}}}$ from the true location and scale parameter respectively. Establishing the slow rates in the univariate setting requires a novel localization argument with two stages, with each stage involving an epoch-based argument applied to a different surrogate EM operator at the population level. We demonstrate several multivariate ($d \geq 2$) examples that exhibit the same slow rates as the univariate case. We also prove slow statistical rates in higher dimensions in a special case, when the fitted covariance is constrained to be a multiple of the identity.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (6)
  1. Raaz Dwivedi (29 papers)
  2. Nhat Ho (126 papers)
  3. Koulik Khamaru (21 papers)
  4. Martin J. Wainwright (141 papers)
  5. Michael I. Jordan (438 papers)
  6. Bin Yu (168 papers)
Citations (31)

Summary

We haven't generated a summary for this paper yet.