Poster
Cascading Contextual Assortment Bandits
Hyun-jun Choi · Rajan Udwani · Min-hwan Oh
Great Hall & Hall B1+B2 (level 1) #1812
Abstract:
We present a new combinatorial bandit model, the \textit{cascading contextual assortment bandit}. This model serves as a generalization of both existing cascading bandits and assortment bandits, broadening their applicability in practice. For this model, we propose our first UCB bandit algorithm, UCB-CCA. We prove that this algorithm achieves a -step regret upper-bound of , sharper than existing bounds for cascading contextual bandits by eliminating dependence on cascade length . To improve the dependence on problem-dependent constant , we introduce our second algorithm, UCB-CCA+, which leverages a new Bernstein-type concentration result. This algorithm achieves without dependence on in the leading term. We substantiate our theoretical claims with numerical experiments, demonstrating the practical efficacy of our proposed methods.
Chat is not available.