firstbacksecondback
122 Results
Workshop
|
Sun 16:00 |
Online Learning Guided Quasi-Newton Methods: Improved Global Non-asymptotic Guarantees, Aryan Mokhtari Aryan Mokhtari |
|
Poster
|
Wed 16:30 |
An Adaptive Approach for Infinitely Many-armed Bandits under Generalized Rotting Constraints Jung-hun Kim · Milan Vojnovic · Se-Young Yun |
|
Poster
|
Thu 11:00 |
Local and Adaptive Mirror Descents in Extensive-Form Games Côme Fiegel · Pierre Ménard · Tadashi Kozuno · Remi Munos · Vianney Perchet · Michal Valko |
|
Poster
|
Wed 16:30 |
Online Non-convex Learning in Dynamic Environments Zhipan Xu · Lijun Zhang |
|
Poster
|
Wed 11:00 |
No-Regret Learning for Fair Multi-Agent Social Welfare Optimization Mengxiao Zhang · Ramiro Deo-Campo Vuong · Haipeng Luo |
|
Workshop
|
Bias in Motion: Theoretical Insights into the Dynamics of Bias in SGD Training Anchit Jain · Rozhin Nobahari · Aristide Baratin · Stefano Sarao Mannelli |
||
Poster
|
Optimistic Critic Reconstruction and Constrained Fine-Tuning for General Offline-to-Online RL Qin-Wen Luo · Ming-Kun Xie · Yewen Wang · Sheng-Jun Huang |
||
Poster
|
Thu 16:30 |
Bandits with Ranking Feedback Davide Maran · Francesco Bacchiocchi · Francesco Emanuele Stradi · Matteo Castiglioni · Nicola Gatti · Marcello Restelli |
|
Poster
|
Thu 11:00 |
Optimal Multi-Fidelity Best-Arm Identification Riccardo Poiani · Rémy Degenne · Emilie Kaufmann · Alberto Maria Metelli · Marcello Restelli |
|
Workshop
|
ABEL: Sample Efficient Online Reinforcement Learning for Neural Theorem Proving Fabian Gloeckle · Jannis Limperg · Gabriel Synnaeve · Amaury Hayat |
||
Poster
|
Thu 16:30 |
Mixture of Experts Meets Prompt-Based Continual Learning Minh Le · An Nguyen The · Huy Nguyen · Trang Nguyen · Trang Pham · Linh Ngo · Nhat Ho |
|
Workshop
|
Flow-DPO: Improving LLM Mathematical Reasoning through Online Multi-Agent Learning Yihe Deng · Paul Mineiro |