Processing math: 100%
Skip to yearly menu bar Skip to main content


Search All 2022 Events
 

123 Results

<<   <   Page 1 of 11   >   >>
Poster
Thu 9:00 Minimax Regret for Cascading Bandits
Daniel Vial · Sujay Sanghavi · Sanjay Shakkottai · R. Srikant
Poster
Wed 14:00 Toward Understanding Privileged Features Distillation in Learning-to-Rank
Shuo Yang · Sujay Sanghavi · Holakou Rahmanian · Jan Bakus · Vishwanathan S. V. N.
Poster
Thu 14:00 Efficient Frameworks for Generalized Low-Rank Matrix Bandit Problems
Yue Kang · Cho-Jui Hsieh · Thomas Chun Man Lee
Poster
Tue 14:00 Differentially Private Online-to-batch for Smooth Losses
Qinzi Zhang · Hoang Tran · Ashok Cutkosky
Workshop
Uncertainty-Driven Pessimistic Q-Ensemble for Offline-to-Online Reinforcement Learning
Ingook Jang · Seonghyun Kim
Poster
Tue 14:00 Parameter-free Regret in High Probability with Heavy Tails
Jiujia Zhang · Ashok Cutkosky
Poster
Trading Off Resource Budgets For Improved Regret Bounds
Thomas Orton · Damon Falck
Poster
Wed 14:00 An α-regret analysis of Adversarial Bilateral Trade
Yossi Azar · Amos Fiat · Federico Fusco
Poster
Tue 14:00 Diversified Recommendations for Agents with Adaptive Preferences
William Brown · Arpit Agarwal
Poster
Thu 14:00 Video PreTraining (VPT): Learning to Act by Watching Unlabeled Online Videos
Bowen Baker · Ilge Akkaya · Peter Zhokov · Joost Huizinga · Jie Tang · Adrien Ecoffet · Brandon Houghton · Raul Sampedro · Jeff Clune
Workshop
Understanding Curriculum Learning in Policy Optimization for Online Combinatorial Optimization
Runlong Zhou · Yuandong Tian · YI WU · Simon Du
Workshop
Towards customizable reinforcement learning agents: Enabling preference specification through online vocabulary expansion
Utkarsh Soni · Sarath Sreedharan · Mudit Verma · Lin Guan · Matthew Marquez · Subbarao Kambhampati