`

Timezone: »

 
Poster
Practical Data-Dependent Metric Compression with Provable Guarantees
Piotr Indyk · Ilya Razenshteyn · Tal Wagner

Tue Dec 05 06:30 PM -- 10:30 PM (PST) @ Pacific Ballroom #24 #None

We introduce a new distance-preserving compact representation of multi-dimensional point-sets. Given n points in a d-dimensional space where each coordinate is represented using B bits (i.e., dB bits per point), it produces a representation of size O( d log(d B/epsilon) +log n) bits per point from which one can approximate the distances up to a factor of 1 + epsilon. Our algorithm almost matches the recent bound of Indyk et al, 2017} while being much simpler. We compare our algorithm to Product Quantization (PQ) (Jegou et al, 2011) a state of the art heuristic metric compression method. We evaluate both algorithms on several data sets: SIFT, MNIST, New York City taxi time series and a synthetic one-dimensional data set embedded in a high-dimensional space. Our algorithm produces representations that are comparable to or better than those produced by PQ, while having provable guarantees on its performance.

Author Information

Piotr Indyk (MIT)
Ilya Razenshteyn (Columbia University)
Tal Wagner (MIT)

More from the Same Authors

  • 2021 Poster: Few-Shot Data-Driven Algorithms for Low Rank Approximation »
    Piotr Indyk · Tal Wagner · David Woodruff
  • 2019 : Poster Session »
    Lili Yu · Aleksei Kroshnin · Alex Delalande · Andrew Carr · Anthony Tompkins · Aram-Alexandre Pooladian · Arnaud Robert · Ashok Makkuva · Aude Genevay · Bangjie Liu · Bo Zeng · Charlie Frogner · Elsa Cazelles · Esteban G Tabak · Fabio Ramos · François-Pierre PATY · Georgios Balikas · Giulio Trigila · Hao Wang · Hinrich Mahler · Jared Nielsen · karim Lounici · Kyle Swanson · Mukul Bhutani · Pierre Bréchet · Piotr Indyk · samuel cohen · Stefanie Jegelka · Tao Wu · Thibault Sejourne · Tudor Manole · Wenjun Zhao · Wenlin Wang · Wenqi Wang · Yonatan Dukler · Zihao Wang · Chaosheng Dong
  • 2019 : Poster Session »
    Jonathan Scarlett · Piotr Indyk · Ali Vakilian · Adrian Weller · Partha P Mitra · Benjamin Aubin · Bruno Loureiro · Florent Krzakala · Lenka Zdeborová · Kristina Monakhova · Joshua Yurtsever · Laura Waller · Hendrik Sommerhoff · Michael Moeller · Rushil Anirudh · Shuang Qiu · Xiaohan Wei · Zhuoran Yang · Jayaraman Thiagarajan · Salman Asif · Michael Gillhofer · Johannes Brandstetter · Sepp Hochreiter · Felix Petersen · Dhruv Patel · Assad Oberai · Akshay Kamath · Sushrut Karmalkar · Eric Price · Ali Ahmed · Zahra Kadkhodaie · Sreyas Mohan · Eero Simoncelli · Carlos Fernandez-Granda · Oscar Leong · Wesam Sakla · Rebecca Willett · Stephan Hoyer · Jascha Sohl-Dickstein · Sam Greydanus · Gauri Jagatap · Chinmay Hegde · Michael Kellman · Jon Tamir · Numan Laanait · Ousmane Dia · Mirco Ravanelli · Jonathan Binas · Negar Rostamzadeh · Shirin Jalali · Tiantian Fang · Alex Schwing · Sébastien Lachapelle · Philippe Brouillard · Tristan Deleu · Simon Lacoste-Julien · Stella Yu · Arya Mazumdar · Ankit Singh Rawat · Yue Zhao · Jianshu Chen · Rebecca Li · Hubert Ramsauer · Gabrio Rizzuti · Nikolaos Mitsakos · Dingzhou Cao · Thomas Strohmer · Yang Li · Pei Peng · Greg Ongie
  • 2019 : Learning-Based Low-Rank Approximations »
    Piotr Indyk
  • 2019 Poster: Estimating Entropy of Distributions in Constant Space »
    Jayadev None Acharya · Sourbh Bhadane · Piotr Indyk · Ziteng Sun
  • 2019 Poster: Learning-Based Low-Rank Approximations »
    Piotr Indyk · Ali Vakilian · Yang Yuan
  • 2019 Poster: Space and Time Efficient Kernel Density Estimation in High Dimensions »
    Arturs Backurs · Piotr Indyk · Tal Wagner
  • 2017 : Data-dependent methods for similarity search in high dimensions »
    Piotr Indyk
  • 2017 Poster: A graph-theoretic approach to multitasking »
    Noga Alon · Daniel Reichman · Igor Shinkar · Tal Wagner · Sebastian Musslick · Jonathan D Cohen · Tom Griffiths · Biswadip dey · Kayhan Ozcimder
  • 2017 Oral: A graph-theoretic approach to multitasking »
    Noga Alon · Daniel Reichman · Igor Shinkar · Tal Wagner · Sebastian Musslick · Jonathan D Cohen · Tom Griffiths · Biswadip dey · Kayhan Ozcimder
  • 2017 Poster: On the Fine-Grained Complexity of Empirical Risk Minimization: Kernel Methods and Neural Networks »
    Arturs Backurs · Piotr Indyk · Ludwig Schmidt
  • 2016 Poster: Fast recovery from a union of subspaces »
    Chinmay Hegde · Piotr Indyk · Ludwig Schmidt
  • 2015 Poster: Practical and Optimal LSH for Angular Distance »
    Alexandr Andoni · Piotr Indyk · Thijs Laarhoven · Ilya Razenshteyn · Ludwig Schmidt
  • 2014 Workshop: Optimal Transport and Machine Learning »
    Marco Cuturi · Gabriel Peyré · Justin Solomon · Alexander Barvinok · Piotr Indyk · Robert McCann · Adam Oberman