NeurIPS Poster Computing Approximate $\ell

Poster

Computing Approximate $\ell_p$ Sensitivities

Swati Padmanabhan · David Woodruff · Richard Zhang

Great Hall & Hall B1+B2 (level 1) #1224

[ Abstract ]

[ Paper] [ Poster] [ OpenReview]

Abstract: Recent works in dimensionality reduction for regression tasks have introduced the notion of sensitivity, an estimate of the importance of a specific datapoint in a dataset, offering provable guarantees on the quality of the approximation after removing low-sensitivity datapoints via subsampling. However, fast algorithms for approximating sensitivities, which we show is equivalent to approximate regression, are known for only the

ℓ_{2}

$\ell_2$ setting, in which they are popularly termed leverage scores. In this work, we provide the first efficient algorithms for approximating

ℓ_{p}

$\ell_p$ sensitivities and other summary statistics of a given matrix. In particular, for a given

n \times d

$n \times d$ matrix, we compute

α

$\alpha$ -approximation to its

ℓ_{1}

$\ell_1$ sensitivities at the cost of

n / α

$n/\alpha$ sensitivity computations. For estimating the total

ℓ_{p}

$\ell_p$ sensitivity (i.e. the sum of

ℓ_{p}

$\ell_p$ sensitivities), we provide an algorithm based on importance sampling of

ℓ_{p}

$\ell_p$ Lewis weights, which computes a constant factor approximation at the cost of roughly

\sqrt{d}

$\sqrt{d}$ sensitivity computations, with no polynomial dependence on

n

$n$ . Furthermore, we estimate the maximum

ℓ_{1}

$\ell_1$ sensitivity up to a

\sqrt{d}

$\sqrt{d}$ factor in

O (d)

$O(d)$ sensitivity computations. We also generalize these results to

ℓ_{p}

$\ell_p$ norms. Lastly, we experimentally show that for a wide class of structured matrices in real-world datasets, the total sensitivity can be quickly approximated and is significantly smaller than the theoretical prediction, demonstrating that real-world datasets have on average low intrinsic effective dimensionality.

Chat is not available.

Poster

Computing Approximate ℓpℓp\ell_p Sensitivities

Swati Padmanabhan · David Woodruff · Richard Zhang

Great Hall & Hall B1+B2 (level 1) #1224

Computing Approximate $\ell_p$ Sensitivities