Timezone: »
We consider the continuum-armed bandits problem, under a novel setting of recommending the best arms within a fixed budget under aggregated feedback.
This is motivated by applications where the precise rewards are impossible or expensive to obtain, while an aggregated reward or feedback, such as the average over a subset, is available.
We constrain the set of reward functions by assuming that they are from a Gaussian Process and propose the Gaussian Process Optimistic Optimisation (GPOO) algorithm.
We adaptively construct a tree with nodes as subsets of the arm space, where the feedback is the aggregated reward of representatives of a node.
We propose a new simple regret notion with respect to aggregated feedback on the recommended arms.
We provide theoretical analysis for the proposed algorithm, and recover single point feedback as a special case.
We illustrate GPOO and compare it with related algorithms on simulated data.
Author Information
Mengyan Zhang (Australian National University)
Russell Tsuchida (CSIRO)
Cheng Soon Ong (Data61 and Australian National University)
Cheng Soon Ong is a principal research scientist at the Machine Learning Research Group, Data61, CSIRO, and is the director of the machine learning and artificial intelligence future science platform at CSIRO. He is also an adjunct associate professor at the Australian National University. He is interested in enabling scientific discovery by extending statistical machine learning methods.
More from the Same Authors
-
2021 : Factorized Fourier Neural Operators »
Alasdair Tran · Alexander Mathews · Lexing Xie · Cheng Soon Ong -
2022 : Detecting structured signals in radio telescope data using RKHS »
Russell Tsuchida · Suk Yee Yong -
2022 : When are equilibrium networks scoring algorithms? »
Russell Tsuchida · Cheng Soon Ong -
2020 Tutorial: (Track1) There and Back Again: A Tale of Slopes and Expectations »
Marc Deisenroth · Cheng Soon Ong -
2019 Poster: Disentangled behavioural representations »
Amir Dezfouli · Hassan Ashtiani · Omar Ghattas · Richard Nock · Peter Dayan · Cheng Soon Ong -
2018 Poster: Representation Learning of Compositional Data »
Marta Avalos · Richard Nock · Cheng Soon Ong · Julien Rouar · Ke Sun -
2016 Poster: A scaled Bregman theorem with applications »
Richard Nock · Aditya Menon · Cheng Soon Ong -
2013 Workshop: Machine Learning Open Source Software: Towards Open Workflows »
Antti Honkela · Cheng Soon Ong -
2011 Poster: Contextual Gaussian Process Bandit Optimization »
Andreas Krause · Cheng Soon Ong -
2010 Workshop: New Directions in Multiple Kernel Learning »
Marius Kloft · Ulrich Rueckert · Cheng Soon Ong · Alain Rakotomamonjy · Soeren Sonnenburg · Francis Bach -
2010 Demonstration: mldata.org - machine learning data and benchmark »
Cheng Soon Ong -
2008 Workshop: Machine Learning Open Source Software »
Soeren Sonnenburg · Mikio L Braun · Cheng Soon Ong