Skip to yearly menu bar Skip to main content


Poster

Fast Distance Oracles for Any Symmetric Norm

Yichuan Deng · Zhao Song · OMRI WEINSTEIN · Ruizhe Zhang

Hall J (level 1) #820

Keywords: [ Distance oracle ] [ Symmetric norm ] [ Sketching ]


Abstract: In the \emph{Distance Oracle} problem, the goal is to preprocess n vectors x1,x2,,xn in a d-dimensional normed space (Xd,l) into a cheap data structure, so that given a query vector qXd, all distances qxil to the data points {xi}i[n] can be quickly approximated (faster than the trivial nd query time). This primitive is a basic subroutine in machine learning, data mining and similarity search applications. In the case of p norms, the problem is well understood, and optimal data structures are known for most values of p. Our main contribution is a fast (1±ε) distance oracle for \emph{any symmetric} norm l. This class includes p norms and Orlicz norms as special cases, as well as other norms used in practice, e.g. top-k norms, max-mixture and sum-mixture of p norms, small-support norms and the box-norm. We propose a novel data structure with O~(n(d+mmc(l)2)) preprocessing time and space, and tq=O~(d+nmmc(l)2) query time, where mmc(l) is a complexity-measure (modulus) of the symmetric norm under consideration. When l=p , this runtime matches the aforementioned state-of-art oracles.

Chat is not available.