NeurIPS Poster Efficient active learning of sparse halfspaces with arbitrary bounded noise

Poster

Efficient active learning of sparse halfspaces with arbitrary bounded noise

Chicheng Zhang · Jie Shen · Pranjal Awasthi

Poster Session 1 #516

[ Abstract ] [ Paper PDF ]

[ Paper ]

Abstract: We study active learning of homogeneous

s

$s$ -sparse halfspaces in

R^{d}

$\mathbb{R}^d$ under the setting where the unlabeled data distribution is isotropic log-concave and each label is flipped with probability at most

η

$\eta$ for a parameter

η \in [0, \frac{1}{2})

$\eta \in \big[0, \frac12\big)$ , known as the bounded noise. Even in the presence of mild label noise, i.e.

η

$\eta$ is a small constant, this is a challenging problem and only recently have label complexity bounds of the form

~ O (s \cdot p o l y l o g (d, \frac{1}{ϵ}))

$\tilde{O}(s \cdot polylog(d, \frac{1}{\epsilon}))$ been established in [Zhang 2018] for computationally efficient algorithms. In contrast, under high levels of label noise, the label complexity bounds achieved by computationally efficient algorithms are much worse: the best known result [Awasthi et al. 2016] provides a computationally efficient algorithm with label complexity

~ O ((s l n d / ϵ)^{p o l y (1 / (1 - 2 η))})

$\tilde{O}((s ln d/\epsilon)^{poly(1/(1-2\eta))})$ , which is label-efficient only when the noise rate

η

$\eta$ is a fixed constant. In this work, we substantially improve on it by designing a polynomial time algorithm for active learning of

s

$s$ -sparse halfspaces, with a label complexity of

~ O (\frac{s}{(1 - 2 η)^{4}} p o l y l o g (d, \frac{1}{ϵ}))

$\tilde{O}\big(\frac{s}{(1-2\eta)^4} polylog (d, \frac 1 \epsilon) \big)$ . This is the first efficient algorithm with label complexity polynomial in

\frac{1}{1 - 2 η}

$\frac{1}{1-2\eta}$ in this setting, which is label-efficient even for

η

$\eta$ arbitrarily close to

\frac{1}{2}

$\frac12$ . Our active learning algorithm and its theoretical guarantees also immediately translate to new state-of-the-art label and sample complexity results for full-dimensional active and passive halfspace learning under arbitrary bounded noise.

Chat is not available.