NeurIPS Poster Adversarial Training is a Form of Data-dependent Operator Norm Regularization

Poster

Adversarial Training is a Form of Data-dependent Operator Norm Regularization

Kevin Roth · Yannic Kilcher · Thomas Hofmann

Poster Session 3 #910

[ Abstract ] [ Paper PDF ]

[ Paper ]

Abstract: We establish a theoretical link between adversarial training and operator norm regularization for deep neural networks. Specifically, we prove that

l_{p}

$l_p$ -norm constrained projected gradient ascent based adversarial training with an

l_{q}

$l_q$ -norm loss on the logits of clean and perturbed inputs is equivalent to data-dependent (p, q) operator norm regularization. This fundamental connection confirms the long-standing argument that a network’s sensitivity to adversarial examples is tied to its spectral properties and hints at novel ways to robustify and defend against adversarial attacks. We provide extensive empirical evidence on state-of-the-art network architectures to support our theoretical results.

Chat is not available.