firstbacksecondback
Filter by Keyword:
61 Results
Poster
|
Tue 17:30 |
Offline Contextual Bayesian Optimization Ian Char · Youngseog Chung · Willie Neiswanger · Kirthevasan Kandasamy · Oak Nelson · Mark Boyer · Egemen Kolemen · Jeff Schneider |
|
Poster
|
Thu 10:45 |
Thompson Sampling for Multinomial Logit Contextual Bandits Min-hwan Oh · Garud Iyengar |
|
Poster
|
Thu 10:45 |
Categorized Bandits Matthieu Jedor · Vianney Perchet · Jonathan Louedec |
|
Poster
|
Wed 17:00 |
Nonparametric Contextual Bandits in Metric Spaces with Unknown Metric Nirandika Wanigasekara · Christina Yu |
|
Poster
|
Tue 10:45 |
Provably Efficient Q-Learning with Low Switching Cost Yu Bai · Tengyang Xie · Nan Jiang · Yu-Xiang Wang |
|
Poster
|
Thu 10:45 |
Stochastic Bandits with Context Distributions Johannes Kirschner · Andreas Krause |
|
Poster
|
Tue 17:30 |
Regret Bounds for Thompson Sampling in Episodic Restless Bandit Problems Young H Jung · Ambuj Tewari |
|
Poster
|
Wed 17:00 |
Surrogate Objectives for Batch Policy Optimization in One-step Decision Making Minmin Chen · Ramki Gummadi · Chris Harris · Dale Schuurmans |
|
Poster
|
Tue 17:30 |
Online EXP3 Learning in Adversarial Bandits with Delayed Feedback Ilai Bistritz · Zhengyuan Zhou · Xi Chen · Nicholas Bambos · Jose Blanchet |
|
Poster
|
Tue 17:30 |
Oracle-Efficient Algorithms for Online Linear Optimization with Bandit Feedback Shinji Ito · Daisuke Hatano · Hanna Sumita · Kei Takemura · Takuro Fukunaga · Naonori Kakimura · Ken-Ichi Kawarabayashi |
|
Poster
|
Tue 10:45 |
Non-Asymptotic Gap-Dependent Regret Bounds for Tabular MDPs Max Simchowitz · Kevin Jamieson |
|
Poster
|
Wed 17:00 |
Batched Multi-armed Bandits Problem Zijun Gao · Yanjun Han · Zhimei Ren · Zhengqing Zhou |