firstbacksecondback
157 Results
Poster
|
Thu 8:45 |
Meta-Learning Adversarial Bandit Algorithms Misha Khodak · Ilya Osadchiy · Keegan Harris · Maria-Florina Balcan · Kfir Y. Levy · Ron Meir · Steven Wu |
|
Poster
|
Tue 8:45 |
Smoothed Analysis of Sequential Probability Assignment Alankrita Bhatt · Nika Haghtalab · Abhishek Shetty |
|
Poster
|
Tue 8:45 |
How to Fine-tune the Model: Unified Model Shift and Model Bias Policy Optimization Hai Zhang · Hang Yu · Junqiao Zhao · Di Zhang · xiao zhang · Hongtu Zhou · Chang Huang · Chen Ye |
|
Poster
|
Thu 15:00 |
Optimal Regret Is Achievable with Bounded Approximate Inference Error: An Enhanced Bayesian Upper Confidence Bound Framework Ziyi Huang · Henry Lam · Amirhossein Meisami · Haofeng Zhang |
|
Poster
|
Wed 8:45 |
Nash Regret Guarantees for Linear Bandits Ayush Sawarni · Ayush Sawarni · Soumyabrata Pal · Siddharth Barman |
|
Poster
|
Wed 8:45 |
Asymptotically Optimal Quantile Pure Exploration for Infinite-Armed Bandits Evelyn Xiao-Yue Gong · Mark Sellke |
|
Poster
|
Tue 8:45 |
High-dimensional Contextual Bandit Problem without Sparsity Junpei Komiyama · Masaaki Imaizumi |
|
Poster
|
Thu 15:00 |
Covariance-adaptive best arm identification El Mehdi Saad · Gilles Blanchard · Nicolas Verzelen |
|
Poster
|
Wed 15:00 |
Attacks on Online Learners: a Teacher-Student Analysis Riccardo Giuseppe Margiotta · Sebastian Goldt · Guido Sanguinetti |
|
Poster
|
Tue 8:45 |
Adversarial Attacks on Online Learning to Rank with Click Feedback Jinhang Zuo · Zhiyao Zhang · Zhiyong Wang · Shuai Li · Mohammad Hajiesmaili · Adam Wierman |
|
Workshop
|
Risk Aversion of Online Learning Algorithms Andreas Haupt · Aroon Narayanan |
||
Workshop
|
Bilevel Optimization to Learn Training Distributions for Language Modeling under Domain Shift David Grangier · Pierre Ablin · Awni Hannun |