firstbacksecondback
123 Results
Poster
|
Thu 9:00 |
Minimax Regret for Cascading Bandits Daniel Vial · Sujay Sanghavi · Sanjay Shakkottai · R. Srikant |
|
Poster
|
Wed 14:00 |
Toward Understanding Privileged Features Distillation in Learning-to-Rank Shuo Yang · Sujay Sanghavi · Holakou Rahmanian · Jan Bakus · Vishwanathan S. V. N. |
|
Poster
|
Thu 14:00 |
Efficient Frameworks for Generalized Low-Rank Matrix Bandit Problems Yue Kang · Cho-Jui Hsieh · Thomas Chun Man Lee |
|
Poster
|
Tue 14:00 |
Differentially Private Online-to-batch for Smooth Losses Qinzi Zhang · Hoang Tran · Ashok Cutkosky |
|
Workshop
|
Uncertainty-Driven Pessimistic Q-Ensemble for Offline-to-Online Reinforcement Learning Ingook Jang · Seonghyun Kim |
||
Poster
|
Tue 14:00 |
Parameter-free Regret in High Probability with Heavy Tails Jiujia Zhang · Ashok Cutkosky |
|
Poster
|
Trading Off Resource Budgets For Improved Regret Bounds Thomas Orton · Damon Falck |
||
Poster
|
Wed 14:00 |
An α-regret analysis of Adversarial Bilateral Trade Yossi Azar · Amos Fiat · Federico Fusco |
|
Poster
|
Tue 14:00 |
Diversified Recommendations for Agents with Adaptive Preferences William Brown · Arpit Agarwal |
|
Poster
|
Thu 14:00 |
Video PreTraining (VPT): Learning to Act by Watching Unlabeled Online Videos Bowen Baker · Ilge Akkaya · Peter Zhokov · Joost Huizinga · Jie Tang · Adrien Ecoffet · Brandon Houghton · Raul Sampedro · Jeff Clune |
|
Workshop
|
Understanding Curriculum Learning in Policy Optimization for Online Combinatorial Optimization Runlong Zhou · Yuandong Tian · YI WU · Simon Du |
||
Workshop
|
Towards customizable reinforcement learning agents: Enabling preference specification through online vocabulary expansion Utkarsh Soni · Sarath Sreedharan · Mudit Verma · Lin Guan · Matthew Marquez · Subbarao Kambhampati |