NeurIPS Enhancing Low-Precision Sampling via Stochastic Gradient Hamiltonian Monte Carlo

Poster
in
Workshop: Machine Learning with New Compute Paradigms

Enhancing Low-Precision Sampling via Stochastic Gradient Hamiltonian Monte Carlo

Ziyi Wang · Yujie Chen · Ruqi Zhang · Qifan Song

[ Abstract ] [ Project Page ]

[ Poster] [ OpenReview]

Abstract: Low-precision training has emerged as a promising low-cost technique to enhance the training efficiency of deep neural networks without sacrificing much accuracy.Its Bayesian counterpart can further provide uncertainty quantification and improved generalization accuracy.This paper investigates low-precision samplers via Stochastics Gradient Hamiltonian Monte Carlo (SGHMC) with low-precision and full-precision gradients accumulators for both strongly log-concave and non-log-concave distributions.Theoretically, our results show that, to achieve

ϵ

$\epsilon$ -error in the 2-Wasserstein distance for non-log-concave distributions, low-precision SGHMC achieves quadratic improvement (

\tilde{O} (ϵ^{- 2} {μ^{*}}^{- 2} \log^{2} (ϵ^{- 1}))

$\tilde{\mathcal{O}}\left({\epsilon^{-2}{\mu^*}^{-2}\log^2\left({\epsilon^{-1}}\right)}\right)$ ) compared to the state-of-the-art low-precision sampler, Stochastic Gradient Langevin Dynamics (SGLD) (

\tilde{O} (ϵ^{- 4} {λ^{*}}^{- 1} \log^{5} (ϵ^{- 1}))

$\tilde{\mathcal{O}}\left({{\epsilon}^{-4}{\lambda^{*}}^{-1}\log^5\left({\epsilon^{-1}}\right)}\right)$ ). Moreover, we prove that low-precision SGHMC is more robust to the quantization error compared to low-precision SGLD due to the robustness of the momentum-based update w.r.t. gradient noise. Empirically, we conduct experiments on synthetic and MNIST, CIFAR-10 \& CIFAR-100 datasets which successfully validate our theoretical findings. Our study highlights the potential of low-precision SGHMC as an efficient and accurate sampling method for large-scale and resource-limited deep learning.

Chat is not available.

Poster in Workshop: Machine Learning with New Compute Paradigms

Enhancing Low-Precision Sampling via Stochastic Gradient Hamiltonian Monte Carlo

Ziyi Wang · Yujie Chen · Ruqi Zhang · Qifan Song

Poster
in
Workshop: Machine Learning with New Compute Paradigms