NeurIPS Poster Most Neural Networks Are Almost Learnable

Most Neural Networks Are Almost Learnable

[ Abstract ]

[ Paper] [ OpenReview]

Abstract: We present a PTAS for learning random constant-depth networks. We show that for any fixed

ϵ > 0

$\epsilon>0$ and depth

i

$i$ , there is a poly-time algorithm that for any distribution on

\sqrt{d} \cdot S^{d - 1}

$\sqrt{d} \cdot \mathbb{S}^{d-1}$ learns random Xavier networks of depth

i

$i$ , up to an additive error of

ϵ

$\epsilon$ . The algorithm runs in time and sample complexity of

(\bar{d})^{p o l y (ϵ^{- 1})}

$(\bar{d})^{\mathrm{poly}(\epsilon^{-1})}$ , where

\bar{d}

$\bar d$ is the size of the network. For some cases of sigmoid and ReLU-like activations the bound can be improved to

(\bar{d})^{p o l y l o g (ϵ^{- 1})}

$(\bar{d})^{\mathrm{polylog}(\epsilon^{-1})}$ , resulting in a quasi-poly-time algorithm for learning constant depth random networks.

Chat is not available.