Skip to yearly menu bar Skip to main content


Poster

Linear and Kernel Classification in the Streaming Model: Improved Bounds for Heavy Hitters

Arvind Mahankali · David Woodruff

Keywords: [ Machine Learning ]


Abstract: We study linear and kernel classification in the streaming model. For linear classification, we improve upon the algorithm of (Tai, et al. 2018), which solves the 1 point query problem on the optimal weight vector wRd in sublinear space. We first give an algorithm solving the more difficult 2 point query problem on w, also in sublinear space. We also give an algorithm which solves the 2 heavy hitter problem on w, in sublinear space and running time. Finally, we give an algorithm which can deterministically solve the 1 point query problem on w, with sublinear space improving upon that of (Tai, et al. 2018). For kernel classification, if wRdp is the optimal weight vector classifying points in the stream according to their pth-degree polynomial kernel, then we give an algorithm solving the 2 point query problem on w in poly(plogdε) space, and an algorithm solving the 2 heavy hitter problem in poly(plogdε) space and running time. Note that our space and running time are polynomial in p, making our algorithms well-suited to high-degree polynomial kernels and the Gaussian kernel (approximated by the polynomial kernel of degree p=Θ(logT)).

Chat is not available.