NeurIPS Poster Robust Gaussian Covariance Estimation in Nearly-Matrix Multiplication Time

Poster

Robust Gaussian Covariance Estimation in Nearly-Matrix Multiplication Time

Jerry Li · Guanghao Ye

Poster Session 2 #708

[ Abstract ] [ Paper PDF ]

[ Paper ]

Abstract: Robust covariance estimation is the following, well-studied problem in high dimensional statistics: given

N

$N$ samples from a

d

$d$ -dimensional Gaussian

N (0, Σ)

$\mathcal{N}(\boldsymbol{0}, \Sigma)$ , but where an

ε

$\varepsilon$ -fraction of the samples have been arbitrarily corrupted, output

\hat{Σ}

$\widehat{\Sigma}$ minimizing the total variation distance between

N (0, Σ)

$\mathcal{N}(\boldsymbol{0}, \Sigma)$ and

N (0, \hat{Σ})

$\mathcal{N}(\boldsymbol{0}, \widehat{\Sigma})$ . This corresponds to learning

Σ

$\Sigma$ in a natural affine-invariant variant of the Frobenius norm known as the \emph{Mahalanobis norm}. Previous work of Cheng et al demonstrated an algorithm that, given

N = \tilde{Ω} (d^{2} / ε^{2})

$N = \widetilde{\Omega}(d^2 / \varepsilon^2)$ samples, achieved a near-optimal error of

O (ε \log 1 / ε)

$O(\varepsilon \log 1 / \varepsilon)$ , and moreover, their algorithm ran in time

\tilde{O} (T (N, d) \log κ / p o l y (ε))

$\widetilde{O}(T(N, d) \log \kappa / \mathrm{poly} (\varepsilon))$ , where

T (N, d)

$T(N, d)$ is the time it takes to multiply a

d \times N

$d \times N$ matrix by its transpose, and

κ

$\kappa$ is the condition number of

Σ

$\Sigma$ . When

ε

$\varepsilon$ is relatively small, their polynomial dependence on

1 / ε

$1/\varepsilon$ in the runtime is prohibitively large. In this paper, we demonstrate a novel algorithm that achieves the same statistical guarantees, but which runs in time

\tilde{O} (T (N, d) \log κ)

$\widetilde{O} (T(N, d) \log \kappa)$ . In particular, our runtime has no dependence on

ε

$\varepsilon$ . When

Σ

$\Sigma$ is reasonably conditioned, our runtime matches that of the fastest algorithm for covariance estimation without outliers, up to poly-logarithmic factors, showing that we can get robustness essentially

for free.''

Chat is not available.