Poster
Beyond Pairwise: Provably Fast Algorithms for Approximate k-Way Similarity Search
Anshumali Shrivastava · Ping Li
Harrah's Special Events Center, 2nd Floor
[
Abstract
]
Abstract:
We go beyond the notion of pairwise similarity and look into search problems with k-way similarity functions. In this paper, we focus on problems related to \emph{3-way Jaccard} similarity: R3way=|S1∩S2∩S3||S1∪S2∪S3|, S1,S2,S3∈C, where C is a size n collection of sets (or binary vectors). We show that approximate R3way similarity search problems admit fast algorithms with provable guarantees, analogous to the pairwise case. Our analysis and speedup guarantees naturally extend to k-way resemblance. In the process, we extend traditional framework of \emph{locality sensitive hashing (LSH)} to handle higher order similarities, which could be of independent theoretical interest. The applicability of R3way search is shown on the "Google sets" application. In addition, we demonstrate the advantage of R3way resemblance over the pairwise case in improving retrieval quality.
Live content is unavailable. Log in and register to view live content