Processing math: 100%
Skip to yearly menu bar Skip to main content


Search All 2024 Events
 

553 Results

<<   <   Page 46 of 47   >   >>
Workshop
VinePPO: Accurate Credit Assignment in RL for LLM Mathematical Reasoning
Amirhossein Kazemnejad · Milad Aghajohari · Eva Portelance · Alessandro Sordoni · Siva Reddy · Aaron Courville · Nicolas Le Roux
Poster
Wed 11:00 SustainDC: Benchmarking for Sustainable Data Center Control
Avisek Naug · Antonio Guillen-Perez · Ricardo Luna Gutierrez · Vineet Gundecha · Cullen Bash · Sahand Ghorbanpour · Sajad Mousavi · Ashwin Ramesh Babu · Dejan Markovikj · Lekhapriya Dheeraj Kashyap · Desik Rengarajan · Soumyendu Sarkar
Poster
Thu 11:00 Reinforcement Learning with LTL and ω-Regular Objectives via Optimality-Preserving Translation to Average Rewards
Xuan Bach Le · Dominik Wagner · Leon Witzman · Alexander Rabinovich · Luke Ong
Workshop
Crystal Design Amidst Noisy DFT Signals: A Reinforcement Learning Approach
Prashant Govindarajan · Mathieu Reymond · Santiago Miret · Mariano Phielipp · Sarath Chandar
Poster
Wed 11:00 Offline Multitask Representation Learning for Reinforcement Learning
Haque Ishfaq · Thanh Nguyen-Tang · Songtao Feng · Raman Arora · Mengdi Wang · Ming Yin · Doina Precup
Workshop
ENHANCING DATA EFFICIENCY IN REINFORCEMENT LEARNING: A NOVEL IMAGINATION MECHANISM BASED ON MESH INFORMATION PROPAGATION
Zihang Wang · Maowei Jiang · Pengyu Zeng · ruiqi li · Quangao Liu · Peter Búš
Poster
Thu 16:30 OmniJARVIS: Unified Vision-Language-Action Tokenization Enables Open-World Instruction Following Agents
Zihao Wang · Shaofei Cai · Zhancun Mu · Haowei Lin · Ceyao Zhang · Xuejie Liu · Qing Li · Anji Liu · Xiaojian (Shawn) Ma · Yitao Liang
Workshop
Honesty to Subterfuge: In-Context Reinforcement Learning Can Make Honest Models Reward Hack
Leo McKee-Reid · Joe Needham · Maria Martinez · Christoph Sträter · Mikita Balesni
Workshop
Learning to Bridge the Gap: Efficient Novelty Recovery with Planning and Reinforcement Learning
Alicia Li · Nishanth Kumar · Tomás Lozano-Pérez · Leslie Kaelbling
Poster
Wed 16:30 Local Linearity: the Key for No-regret Reinforcement Learning in Continuous MDPs
Davide Maran · Alberto Maria Metelli · Matteo Papini · Marcello Restelli
Poster
Wed 11:00 Autoregressive Policy Optimization for Constrained Allocation Tasks
David Winkel · Niklas Strauß · Maximilian Bernhard · Zongyue Li · Thomas Seidl · Matthias Schubert
Poster
Wed 16:30 JaxMARL: Multi-Agent RL Environments and Algorithms in JAX
Alexander Rutherford · Benjamin Ellis · Matteo Gallici · Jonathan Cook · Andrei Lupu · Garðar Ingvarsson Juto · Timon Willi · Ravi Hammond · Akbir Khan · Christian Schroeder de Witt · Alexandra Souly · Saptarashmi Bandyopadhyay · Mikayel Samvelyan · Minqi Jiang · Robert Lange · Shimon Whiteson · Bruno Lacerda · Nick Hawes · Tim Rocktäschel · Chris Lu · Jakob Foerster