firstbacksecondback
157 Results
Poster
|
Tue 8:45 |
Online Convex Optimization with Unbounded Memory Raunak Kumar · Sarah Dean · Robert Kleinberg |
|
Workshop
|
Paper 4: Beyond Hallucination: Building a Reliable Question Answering & Explanation System with GPTs Kazem Jahanbakhsh · Hajiabadi · Vipul Gagrani · Jennifer Louie · Saurabh Khanwalkar · Kazem Jahanbakhsh |
||
Poster
|
Wed 8:45 |
Online Performative Gradient Descent for Learning Nash Equilibria in Decision-Dependent Games Zihan Zhu · Ethan Fang · Zhuoran Yang |
|
Poster
|
Wed 15:00 |
Regret Minimization via Saddle Point Optimization Johannes Kirschner · Alireza Bakhtiari · Kushagra Chandak · Volodymyr Tkachuk · Csaba Szepesvari |
|
Poster
|
Tue 15:15 |
Adjustable Robust Reinforcement Learning for Online 3D Bin Packing Yuxin Pan · Yize Chen · Fangzhen Lin |
|
Poster
|
Wed 15:00 |
An ε-Best-Arm Identification Algorithm for Fixed-Confidence and Beyond Marc Jourdan · Rémy Degenne · Emilie Kaufmann |
|
Poster
|
Wed 8:45 |
When Can We Track Significant Preference Shifts in Dueling Bandits? Joe Suk · Arpit Agarwal · Arpit Agarwal |
|
Poster
|
Thu 8:45 |
Follow-ups Also Matter: Improving Contextual Bandits via Post-serving Contexts Chaoqi Wang · Ziyu Ye · Zhe Feng · Ashwinkumar Badanidiyuru Varadaraja · Haifeng Xu |
|
Poster
|
Wed 8:45 |
Reward Imputation with Sketching for Contextual Batched Bandits Xiao Zhang · Ninglu Shao · Zihua Si · Jun Xu · Wenhan Wang · Hanjing Su · Ji-Rong Wen |
|
Poster
|
Tue 15:15 |
Efficient Online Clustering with Moving Costs Dimitrios Christou · Stratis Skoulakis · Volkan Cevher |
|
Poster
|
Wed 8:45 |
Finite-Time Logarithmic Bayes Regret Upper Bounds Alexia Atsidakou · Branislav Kveton · Sumeet Katariya · Constantine Caramanis · Sujay Sanghavi |
|
Poster
|
Wed 8:45 |
Cascading Bandits: Optimizing Recommendation Frequency in Delayed Feedback Environments Dairui Wang · Junyu Cao · Yan Zhang · Wei Qi |