Poster
Sequential Algorithms for Testing Closeness of Distributions
Aadil Oufkir · Omar Fawzi · Nicolas Flammarion · Aurélien Garivier
Keywords: [ Theory ]
Abstract:
What advantage do sequential procedures provide over batch algorithms for testing properties of unknown distributions? Focusing on the problem of testing whether two distributions D1D1 and D2D2 on {1,…,n}{1,…,n} are equal or ϵϵ-far, we give several answers to this question. We show that for a small alphabet size nn, there is a sequential algorithm that outperforms any batch algorithm by a factor of at least 44 in terms sample complexity. For a general alphabet size nn, we give a sequential algorithm that uses no more samples than its batch counterpart, and possibly fewer if the actual distance between D1D1 and D2D2 is larger than ϵϵ. As a corollary, letting ϵϵ go to 00, we obtain a sequential algorithm for testing closeness (with no a priori bound on the distance between D1D1 and D2D2) with a sample complexity ˜O(n2/3TV(D1,D2)4/3)~O(n2/3TV(D1,D2)4/3): this improves over the ˜O(n/lognTV(D1,D2)2)~O(n/lognTV(D1,D2)2) tester of [Daskalakis and Kawase 2017] and is optimal up to multiplicative constants. We also establish limitations of sequential algorithms for the problem of testing closeness: they can improve the worst case number of samples by at most a constant factor.
Chat is not available.