NeurIPS Poster Hypothesis Selection with Memory Constraints

Poster

Hypothesis Selection with Memory Constraints

Maryam Aliakbarpour · Mark Bun · Adam Smith

Great Hall & Hall B1+B2 (level 1) #2027

[ Abstract ]

[ Paper] [ Poster] [ OpenReview]

Abstract: Hypothesis selection is a fundamental problem in learning theory and statistics. Given a dataset and a finite set of candidate distributions, the goal is to select a distribution that matches the data as well as possible. More specifically, suppose we have sample access to an unknown distribution

P

$P$ over a domain

X

$\mathcal{X}$ that we know is well-approximated by one of a a class of

n

$n$ distributions (a.k.a. hypotheses),

H \coloneqq {H_{1}, H_{2}, \dots, H_{n}}

$\mathcal{H} \coloneqq \{H_1, H_2, \ldots, H_n\}$ . The goal is to design an algorithm that outputs a distribution

\hat{H} \in H

$\hat{H} \in \mathcal{H}$ whose total variation distance from

P

$P$ is nearly minimal.In this work, we study the hypothesis selection problem under memory constraints. We consider a model where samples from

P

$P$ are presented in a stream and we access each sample

x

$x$ via

PDF-comparison'' queries that allow us to compare the probability densities of any pair of hypothesesat the domain point

x

$x$ (i.e., is

H_{i} (x) < H_{j} (x)

$H_i(x) < H_j(x)$ ?). This model allows us to study how much memory is needed at any point in time to store information about the portion of the stream seen so far.Our main result is an algorithm that achieves a nearly optimal tradeoff between memory usage and the number of samples required. In particular, given

b

$b$ bits of memory (for

b

$b$ roughly between

\log n

$\log n$ and

n

$n$ ), our algorithm solves the hypothesis selection problem with

s

$s$ samples, where

b \cdot s = O (n \log n)

$b \cdot s = O(n \log n)$ . This result is optimal up to an

O (\log n)

$O(\log n)$ factor, for all

b

$b$ .

Chat is not available.