Papers
Topics
Authors
Recent
Detailed Answer
Quick Answer
Concise responses based on abstracts only
Detailed Answer
Well-researched responses based on abstracts and relevant paper content.
Custom Instructions Pro
Preferences or requirements that you'd like Emergent Mind to consider when generating responses
Gemini 2.5 Flash
Gemini 2.5 Flash 77 tok/s
Gemini 2.5 Pro 33 tok/s Pro
GPT-5 Medium 25 tok/s Pro
GPT-5 High 27 tok/s Pro
GPT-4o 75 tok/s Pro
Kimi K2 220 tok/s Pro
GPT OSS 120B 465 tok/s Pro
Claude Sonnet 4 36 tok/s Pro
2000 character limit reached

Nonasymptotic one-and two-sample tests in high dimension with unknown covariance structure (2109.01730v2)

Published 1 Sep 2021 in cs.LG, cs.AI, math.ST, stat.ML, and stat.TH

Abstract: Let $\mathbf{X} = (X_i){1\leq i \leq n}$ be an i.i.d. sample of square-integrable variables in $\mathbb{R}d$, \GB{with common expectation $\mu$ and covariance matrix $\Sigma$, both unknown.} We consider the problem of testing if $\mu$ is $\eta$-close to zero, i.e. $|\mu| \leq \eta $ against $|\mu| \geq (\eta + \delta)$; we also tackle the more general two-sample mean closeness (also known as {\em relevant difference}) testing problem. The aim of this paper is to obtain nonasymptotic upper and lower bounds on the minimal separation distance $\delta$ such that we can control both the Type I and Type II errors at a given level. The main technical tools are concentration inequalities, first for a suitable estimator of $|\mu|2$ used a test statistic, and secondly for estimating the operator and Frobenius norms of $\Sigma$ coming into the quantiles of said test statistic. These properties are obtained for Gaussian and bounded distributions. A particular attention is given to the dependence in the pseudo-dimension $d$ of the distribution, defined as $d_ := |\Sigma|22/|\Sigma|\infty2$. In particular, for $\eta=0$, the minimum separation distance is ${\Theta}( d_*{\frac{1}{4}}\sqrt{|\Sigma|_\infty/n})$, in contrast with the minimax estimation distance for $\mu$, which is ${\Theta}(d_e{\frac{1}{2}}\sqrt{|\Sigma|_\infty/n})$ (where $d_e:=|\Sigma|1/|\Sigma|\infty$). This generalizes a phenomenon spelled out in particular by Baraud (2002).

Citations (2)
List To Do Tasks Checklist Streamline Icon: https://streamlinehq.com

Collections

Sign up for free to add this paper to one or more collections.

Summary

We haven't generated a summary for this paper yet.

Dice Question Streamline Icon: https://streamlinehq.com

Follow-Up Questions

We haven't generated follow-up questions for this paper yet.