Processing math: 100%
Skip to yearly menu bar Skip to main content


Search All 2023 Events
 

673 Results

<<   <   Page 56 of 57   >   >>
Workshop
Sat 9:15 Hierarchical Reinforcement Learning with AI Planning Models
Junkyu Lee · Michael Katz · Don Joven Agravante · Miao Liu · Geraud Nangue Tasse · Tim Klinger · Shirin Sohrabi Araghi
Poster
Thu 8:45 Unified Off-Policy Learning to Rank: a Reinforcement Learning Perspective
Zeyu Zhang · Yi Su · Hui Yuan · Yiran Wu · Rishab Balasubramanian · Qingyun Wu · Huazheng Wang · Mengdi Wang
Poster
Wed 8:45 Conditional Adapters: Parameter-efficient Transfer Learning with Fast Inference
Tao Lei · Junwen Bai · Siddhartha Brahma · Joshua Ainslie · Kenton Lee · Yanqi Zhou · Nan Du · Vincent Zhao · Yuexin Wu · Bo Li · Yu Zhang · Ming-Wei Chang
Workshop
Sat 9:36 [Paper-Oral 7] MultiPrompter: Cooperative Prompt Optimization with Multi-Agent Reinforcement Learning
Dong-Ki Kim · Sungryull Sohn · Lajanugen Logeswaran · Dongsub Shim · Honglak Lee
Poster
Thu 15:00 Gigastep - One Billion Steps per Second Multi-agent Reinforcement Learning
Mathias Lechner · lianhao yin · Tim Seyde · Tsun-Hsuan Johnson Wang · Wei Xiao · Ramin Hasani · Joshua Rountree · Daniela Rus
Poster
Wed 15:00 On Imitation in Mean-field Games
Giorgia Ramponi · Pavel Kolev · Olivier Pietquin · Niao He · Mathieu Lauriere · Matthieu Geist
Poster
Thu 8:45 Distributional Pareto-Optimal Multi-Objective Reinforcement Learning
Xin-Qiang Cai · Pushi Zhang · Li Zhao · Jiang Bian · Masashi Sugiyama · Ashley Llorens
Poster
Wed 8:45 Model-free Posterior Sampling via Learning Rate Randomization
Daniil Tiapkin · Denis Belomestny · Daniele Calandriello · Eric Moulines · Remi Munos · Alexey Naumov · Pierre Perrault · Michal Valko · Pierre Ménard
Poster
Tue 8:45 On the Convergence and Sample Complexity Analysis of Deep Q-Networks with ϵ-Greedy Exploration
Shuai Zhang · Hongkang Li · Meng Wang · Miao Liu · Pin-Yu Chen · Songtao Lu · Songtao Lu · Sijia Liu · Keerthiram Murugesan · Subhajit Chaudhury
Workshop
PREMIER-TACO is a Few-Shot Policy Learner: Pretraining Multitask Representation via Temporal Action-Driven Contrastive Loss
Ruijie Zheng · Yongyuan Liang · Xiyao Wang · Shuang Ma · Hal Daumé III · Huazhe Xu · John Langford · Praveen Palanisamy · Kalyan Basu · Furong Huang
Poster
Tue 8:45 Policy Optimization in a Noisy Neighborhood: On Return Landscapes in Continuous Control
Nate Rahn · Pierluca D&#x27;Oro · Harley Wiltzer · Pierre-Luc Bacon · Marc Bellemare
Workshop
JaxMARL: Multi-Agent RL Environments in JAX
Alexander Rutherford · Benjamin Ellis · Matteo Gallici · Jonathan Cook · Andrei Lupu · Garðar Ingvarsson Juto · Timon Willi · Akbir Khan · Christian Schroeder de Witt · Alexandra Souly · Saptarashmi Bandyopadhyay · Mikayel Samvelyan · Minqi Jiang · Robert Lange · Shimon Whiteson · Bruno Lacerda · Nick Hawes · Tim Rocktäschel · Chris Lu · Jakob Foerster