firstbacksecondback
496 Results
Poster
|
Wed 11:00 |
Decision Mamba: A Multi-Grained State Space Model with Self-Evolution Regularization for Offline RL Qi Lv · Xiang Deng · Gongwei Chen · MICHAEL YU WANG · Liqiang Nie |
|
Poster
|
Fri 16:30 |
Boosting Sample Efficiency and Generalization in Multi-agent Reinforcement Learning via Equivariance Josh McClellan · Naveed Haghani · John Winder · Furong Huang · Pratap Tokekar |
|
Poster
|
Wed 16:30 |
Constrained Latent Action Policies for Model-Based Offline Reinforcement Learning Marvin Alles · Philip Becker-Ehmck · Patrick van der Smagt · Maximilian Karl |
|
Poster
|
Wed 16:30 |
Last-Iterate Global Convergence of Policy Gradients for Constrained Reinforcement Learning Alessandro Montenegro · Marco Mussi · Matteo Papini · Alberto Maria Metelli |
|
Poster
|
Wed 11:00 |
Sub-optimal Experts mitigate Ambiguity in Inverse Reinforcement Learning Riccardo Poiani · Curti Gabriele · Alberto Maria Metelli · Marcello Restelli |
|
Workshop
|
Safe Reinforcement Learning for Remote Microgrid Optimization with Industrial Constraints Hadi Nekoei · Alexandre Blondin Massé · Rachid Hassani · Sarath Chandar · Vincent Mai |
||
Poster
|
Thu 16:30 |
Language Grounded Multi-agent Reinforcement Learning with Human-interpretable Communication Huao Li · Hossein Nourkhiz Mahjoub · Behdad Chalaki · Vaishnav Tadiparthi · Kwonjoon Lee · Ehsan Moradi Pari · Charles Lewis · Katia Sycara |
|
Workshop
|
ABEL: Sample Efficient Online Reinforcement Learning for Neural Theorem Proving Fabian Gloeckle · Jannis Limperg · Gabriel Synnaeve · Amaury Hayat |
||
Workshop
|
Imitation Guided Automated Red Teaming Sajad Mousavi · Desik Rengarajan · Ashwin Ramesh Babu · Vineet Gundecha · Antonio Guillen-Perez · Ricardo Luna Gutierrez · Avisek Naug · Sahand Ghorbanpour · Soumyendu Sarkar |
||
Poster
|
Wed 11:00 |
Distributional Preference Alignment of LLMs via Optimal Transport Igor Melnyk · Youssef Mroueh · Brian Belgodere · Mattia Rigotti · Apoorva Nitsure · Mikhail Yurochkin · Kristjan Greenewald · Jiri Navratil · Jarret Ross |
|
Poster
|
Fri 16:30 |
Near-Optimal Dynamic Regret for Adversarial Linear Mixture MDPs Long-Fei Li · Peng Zhao · Zhi-Hua Zhou |
|
Workshop
|
A Large Recurrent Action Model: xLSTM enables Fast Inference for Robotics Tasks Thomas Schmied · Thomas Adler · Vihang Patil · Maximilian Beck · Korbinian Pöppel · Johannes Brandstetter · Günter Klambauer · Razvan Pascanu · Sepp Hochreiter |