Skip to yearly menu bar Skip to main content


Stochastic Variance Reduction Methods for Saddle-Point Problems

Balamurugan Palaniappan · Francis Bach

Area 5+6+7+8 #88

Keywords: [ Large Scale Learning and Big Data ] [ Convex Optimization ] [ Stochastic Methods ]


We consider convex-concave saddle-point problems where the objective functions may be split in many components, and extend recent stochastic variance reduction methods (such as SVRG or SAGA) to provide the first large-scale linearly convergent algorithms for this class of problems which are common in machine learning. While the algorithmic extension is straightforward, it comes with challenges and opportunities: (a) the convex minimization analysis does not apply and we use the notion of monotone operators to prove convergence, showing in particular that the same algorithm applies to a larger class of problems, such as variational inequalities, (b) there are two notions of splits, in terms of functions, or in terms of partial derivatives, (c) the split does need to be done with convex-concave terms, (d) non-uniform sampling is key to an efficient algorithm, both in theory and practice, and (e) these incremental algorithms can be easily accelerated using a simple extension of the "catalyst" framework, leading to an algorithm which is always superior to accelerated batch algorithms.

Live content is unavailable. Log in and register to view live content