NeurIPS Poster Differentially Private Generalized Linear Models Revisited

Poster

Differentially Private Generalized Linear Models Revisited

Raman Arora · Raef Bassily · Cristóbal Guzmán · Michael Menart · Enayat Ullah

Hall J (level 1) #818

Keywords: [ supervised learning ] [ generalized linear model ] [ Optimization ] [ differential privacy ]

[ Abstract ]

[ Paper] [ Poster] [ OpenReview]

Abstract: We study the problem of

(ϵ, δ)

$(\epsilon,\delta)$ -differentially private learning of linear predictors with convex losses. We provide results for two subclasses of loss functions. The first case is when the loss is smooth and non-negative but not necessarily Lipschitz (such as the squared loss). For this case, we establish an upper bound on the excess population risk of

~ O (\frac{∥ w^{*} ∥}{\sqrt{n}} + min {\frac{∥ w^{*} ∥^{2}}{(n ϵ)^{2 / 3}}, \frac{\sqrt{d} ∥ w^{*} ∥^{2}}{n ϵ}})

$\tilde{O}\left(\frac{\Vert w^*\Vert}{\sqrt{n}} + \min\left\{\frac{\Vert w^* \Vert^2}{(n\epsilon)^{2/3}},\frac{\sqrt{d}\Vert w^*\Vert^2}{n\epsilon}\right\}\right)$ , where

n

$n$ is the number of samples,

d

$d$ is the dimension of the problem, and

w^{*}

$w^*$ is the minimizer of the population risk. Apart from the dependence on

∥ w^{*} ∥

$\Vert w^\ast\Vert$ , our bound is essentially tight in all parameters. In particular, we show a lower bound of

~ Ω (\frac{1}{\sqrt{n}} + min {\frac{∥ w^{*} ∥^{4 / 3}}{(n ϵ)^{2 / 3}}, \frac{\sqrt{d} ∥ w^{*} ∥}{n ϵ}})

$\tilde{\Omega}\left(\frac{1}{\sqrt{n}} + {\min\left\{\frac{\Vert w^*\Vert^{4/3}}{(n\epsilon)^{2/3}}, \frac{\sqrt{d}\Vert w^*\Vert}{n\epsilon}\right\}}\right)$ . We also revisit the previously studied case of Lipschitz losses \cite{SSTT21}. For this case, we close the gap in the existing work and show that the optimal rate is (up to log factors)

Θ (\frac{∥ w^{*} ∥}{\sqrt{n}} + min {\frac{∥ w^{*} ∥}{\sqrt{n ϵ}}, \frac{\sqrt{rank} ∥ w^{*} ∥}{n ϵ}})

$\Theta\left(\frac{\Vert w^*\Vert}{\sqrt{n}} + \min\left\{\frac{\Vert w^*\Vert}{\sqrt{n\epsilon}},\frac{\sqrt{\text{rank}}\Vert w^*\Vert}{n\epsilon}\right\}\right)$ , where

rank

$\text{rank}$ is the rank of the design matrix. This improves over existing work in the high privacy regime. Finally, our algorithms involve a private model selection approach that we develop to enable attaining the stated rates without a-priori knowledge of

∥ w^{*} ∥

$\Vert w^*\Vert$ .

Chat is not available.