Skip to yearly menu bar Skip to main content


Poster

The Power of Adaptivity in Identifying Statistical Alternatives

Kevin Jamieson · Daniel Haas · Benjamin Recht

Area 5+6+7+8 #56

Keywords: [ Bandit Algorithms ] [ Online Learning ] [ Active Learning ]


Abstract: This paper studies the trade-off between two different kinds of pure exploration: breadth versus depth. We focus on the most biased coin problem, asking how many total coin flips are required to identify a heavy'' coin from an infinite bag containing both heavy'' coins with mean θ1(0,1)θ1(0,1), and light" coins with mean θ0(0,θ1)θ0(0,θ1), where heavy coins are drawn from the bag with proportion α(0,1/2)α(0,1/2). When α,θ0,θ1α,θ0,θ1 are unknown, the key difficulty of this problem lies in distinguishing whether the two kinds of coins have very similar means, or whether heavy coins are just extremely rare. While existing solutions to this problem require some prior knowledge of the parameters θ0,θ1,αθ0,θ1,α, we propose an adaptive algorithm that requires no such knowledge yet still obtains near-optimal sample complexity guarantees. In contrast, we provide a lower bound showing that non-adaptive strategies require at least quadratically more samples. In characterizing this gap between adaptive and nonadaptive strategies, we make connections to anomaly detection and prove lower bounds on the sample complexity of differentiating between a single parametric distribution and a mixture of two such distributions.

Live content is unavailable. Log in and register to view live content