firstbacksecondback
36 Results
Poster
|
Tue 8:45 |
Robust Learning for Smoothed Online Convex Optimization with Feedback Delay Pengfei Li · Jianyi Yang · Adam Wierman · Shaolei Ren |
|
Poster
|
Tue 15:15 |
Double Auctions with Two-sided Bandit Feedback Soumya Basu · Abishek Sankararaman |
|
Poster
|
Wed 15:00 |
Posterior Sampling with Delayed Feedback for Reinforcement Learning with Linear Function Approximation Nikki Lijing Kuang · Ming Yin · Mengdi Wang · Yu-Xiang Wang · Yian Ma |
|
Poster
|
Tue 15:15 |
Nearest Neighbour with Bandit Feedback Stephen Pasteris · Chris Hicks · Vasilios Mavroudis |
|
Poster
|
Tue 15:15 |
Uncoupled and Convergent Learning in Two-Player Zero-Sum Markov Games with Bandit Feedback Yang Cai · Haipeng Luo · Chen-Yu Wei · Weiqiang Zheng |
|
Poster
|
Tue 8:45 |
On the Minimax Regret for Online Learning with Feedback Graphs Khaled Eldowa · Emmanuel Esposito · Tom Cesari · Nicolò Cesa-Bianchi |
|
Workshop
|
Online Learning of Optimal Prescriptions under Bandit Feedback with Unknown Contexts Hongju Park · Mohamad Kazem Shirani Faradonbeh |
||
Poster
|
Thu 8:45 |
Continual Learning for Instruction Following from Realtime Feedback Alane Suhr · Yoav Artzi |
|
Poster
|
Tue 8:45 |
Practical Contextual Bandits with Feedback Graphs Mengxiao Zhang · Yuheng Zhang · Olga Vrousgou · Haipeng Luo · Paul Mineiro |
|
Poster
|
Tue 15:15 |
A Reduction-based Framework for Sequential Decision Making with Delayed Feedback Yunchang Yang · Han Zhong · Tianhao Wu · Bin Liu · Liwei Wang · Simon Du |
|
Poster
|
Thu 15:00 |
Exploiting Correlated Auxiliary Feedback in Parameterized Bandits Arun Verma · Zhongxiang Dai · Zhongxiang Dai · YAO SHU · Bryan Kian Hsiang Low |
|
Poster
|
Tue 15:15 |
Imitation Learning from Vague Feedback Xin-Qiang Cai · Yu-Jie Zhang · Chao-Kai Chiang · Masashi Sugiyama |