Poster
Federated Linear Bandits with Finite Adversarial Actions
Li Fan · Ruida Zhou · Chao Tian · Cong Shen
Great Hall & Hall B1+B2 (level 1) #1904
Abstract:
We study a federated linear bandits model, where clients communicate with a central server to solve a linear contextual bandits problem with finite adversarial action sets that may be different across clients. To address the unique challenges of **adversarial finite** action sets, we propose the FedSupLinUCB algorithm, which extends the principles of SupLinUCB and OFUL algorithms in linear contextual bandits. We prove that FedSupLinUCB achieves a total regret of , where is the total number of arm pulls from all clients, and is the ambient dimension of the linear model. This matches the minimax lower bound and thus is order-optimal (up to polylog terms). We study both asynchronous and synchronous cases and show that the communication cost can be controlled as and , respectively. The FedSupLinUCB design is further extended to two scenarios: (1) variance-adaptive, where a total regret of can be achieved with being the noise variance of round ; and (2) adversarial corruption, where a total regret of can be achieved with being the total corruption budget. Experiment results corroborate the theoretical analysis and demonstrate the effectiveness of \alg on both synthetic and real-world datasets.
Chat is not available.