NeurIPS Poster Feature Adaptation for Sparse Linear Regression

Spotlight Poster

Feature Adaptation for Sparse Linear Regression

Jonathan Kelner · Frederic Koehler · Raghu Meka · Dhruv Rohatgi

Great Hall & Hall B1+B2 (level 1) #1726

[ Abstract ]

[ Paper] [ OpenReview]

Abstract: Sparse linear regression is a central problem in high-dimensional statistics. We study the correlated random design setting, where the covariates are drawn from a multivariate Gaussian

N (0, Σ)

$N(0,\Sigma)$ , and we seek an estimator with small excess risk. If the true signal is

t

$t$ -sparse, information-theoretically, it is possible to achieve strong recovery guarantees with only

O (t \log n)

$O(t\log n)$ samples. However, computationally efficient algorithms have sample complexity linear in (some variant of) the *condition number* of

Σ

$\Sigma$ . Classical algorithms such as the Lasso can require significantly more samples than necessary even if there is only a single sparse approximate dependency among the covariates.We provide a polynomial-time algorithm that, given

Σ

$\Sigma$ , automatically adapts the Lasso to tolerate a small number of approximate dependencies. In particular, we achieve near-optimal sample complexity for constant sparsity and if

Σ

$\Sigma$ has few

outlier'' eigenvalues.Our algorithm fits into a broader framework of *feature adaptation* for sparse linear regression with ill-conditioned covariates. With this framework, we additionally provide the first polynomial-factor improvement over brute-force search for constant sparsity

t

$t$ and arbitrary covariance

Σ

$\Sigma$ .

Chat is not available.