firstbacksecondback
157 Results
Poster
|
Tue 8:45 |
Improved Best-of-Both-Worlds Guarantees for Multi-Armed Bandits: FTRL with General Regularizers and Multiple Optimal Arms Tiancheng Jin · Junyan Liu · Haipeng Luo |
|
Workshop
|
Non-adaptive Online Finetuning for Offline Reinforcement Learning Audrey Huang · Mohammad Ghavamzadeh · Nan Jiang · Marek Petrik |
||
Poster
|
Tue 8:45 |
Fast Asymptotically Optimal Algorithms for Non-Parametric Stochastic Bandits Dorian Baudry · Fabien Pesquerel · Rémy Degenne · Odalric-Ambrym Maillard |
|
Poster
|
Tue 15:15 |
Dynamic Regret of Adversarial Linear Mixture MDPs Long-Fei Li · Peng Zhao · Zhi-Hua Zhou |
|
Poster
|
Wed 8:45 |
Online Learning under Adversarial Nonlinear Constraints Pavel Kolev · Georg Martius · Michael Muehlebach |
|
Poster
|
Wed 8:45 |
Multi-Step Generalized Policy Improvement by Leveraging Approximate Models Lucas N. Alegre · Ana Bazzan · Ann Nowe · Bruno C. da Silva |
|
Workshop
|
Stochastic linear dynamics in parameters to deal with Neural Networks plasticity loss Alexandre Galashov · Michalis Titsias · Razvan Pascanu · Yee Whye Teh · Maneesh Sahani |
||
Poster
|
Thu 8:45 |
Adaptive Online Replanning with Diffusion Models Siyuan Zhou · Yilun Du · Shun Zhang · Mengdi Xu · Yikang Shen · Wei Xiao · Dit-Yan Yeung · Chuang Gan |
|
Poster
|
Wed 15:00 |
No-regret Algorithms for Fair Resource Allocation Abhishek Sinha · Ativ Joshi · Rajarshi Bhattacharjee · Cameron Musco · Mohammad Hajiesmaili |
|
Poster
|
Thu 15:00 |
Statistical Limits of Adaptive Linear Models: Low-Dimensional Estimation and Inference Licong Lin · Mufang Ying · Suvrojit Ghosh · Koulik Khamaru · Cun-Hui Zhang |
|
Poster
|
Wed 15:00 |
Maximum Average Randomly Sampled: A Scale Free and Non-parametric Algorithm for Stochastic Bandits Masoud Moravej Khorasani · Erik Weyer |
|
Poster
|
Wed 8:45 |
ALIM: Adjusting Label Importance Mechanism for Noisy Partial Label Learning Mingyu Xu · Zheng Lian · Lei Feng · Bin Liu · Jianhua Tao |