Timezone: »
We examine the Bayes-consistency of a recently proposed 1-nearest-neighbor-based multiclass learning algorithm. This algorithm is derived from sample compression bounds and enjoys the statistical advantages of tight, fully empirical generalization bounds, as well as the algorithmic advantages of a faster runtime and memory savings. We prove that this algorithm is strongly Bayes-consistent in metric spaces with finite doubling dimension --- the first consistency result for an efficient nearest-neighbor sample compression scheme. Rather surprisingly, we discover that this algorithm continues to be Bayes-consistent even in a certain infinite-dimensional setting, in which the basic measure-theoretic conditions on which classic consistency proofs hinge are violated. This is all the more surprising, since it is known that k-NN is not Bayes-consistent in this setting. We pose several challenging open problems for future research.
Author Information
Aryeh Kontorovich (Ben Gurion University)
Sivan Sabato (Ben Gurion University)
Roi Weiss (Weizmann institute of science)
More from the Same Authors
-
2021 Poster: Dimension-free empirical entropy estimation »
Doron Cohen · Aryeh Kontorovich · Aaron Koolyk · Geoffrey Wolfer -
2020 Poster: Learning discrete distributions with infinite support »
Doron Cohen · Aryeh Kontorovich · Geoffrey Wolfer -
2018 Poster: Learning convex polytopes with margin »
Lee-Ad Gottlieb · Eran Kaufman · Aryeh Kontorovich · Gabriel Nivasch -
2016 Poster: Active Nearest-Neighbor Learning in Metric Spaces »
Aryeh Kontorovich · Sivan Sabato · Ruth Urner -
2015 Poster: Mixing Time Estimation in Reversible Markov Chains from a Single Sample Path »
Daniel Hsu · Aryeh Kontorovich · Csaba Szepesvari -
2014 Poster: Active Regression by Stratification »
Sivan Sabato · Remi Munos -
2014 Poster: Near-optimal sample compression for nearest neighbors »
Lee-Ad Gottlieb · Aryeh Kontorovich · Pinhas Nisnevitch -
2014 Poster: Consistency of weighted majority votes »
Daniel Berend · Aryeh Kontorovich -
2013 Poster: Predictive PAC Learning and Process Decompositions »
Cosma Shalizi · Aryeh Kontorovich -
2013 Poster: Auditing: Active Learning with Outcome-Dependent Query Costs »
Sivan Sabato · Anand D Sarwate · Nati Srebro -
2012 Poster: Multiclass Learning Approaches: A Theoretical Comparison with Implications »
Amit Daniely · Sivan Sabato · Shai Shalev-Shwartz -
2012 Spotlight: Multiclass Learning Approaches: A Theoretical Comparison with Implications »
Amit Daniely · Sivan Sabato · Shai Shalev-Shwartz -
2010 Poster: Tight Sample Complexity of Large-Margin Learning »
Sivan Sabato · Nati Srebro · Naftali Tishby