firstbacksecondback
635 Results
Workshop
|
Sat 9:15 |
Hierarchical Reinforcement Learning with AI Planning Models Junkyu Lee · Michael Katz · Don Joven Agravante · Miao Liu · Geraud Nangue Tasse · Tim Klinger · Shirin Sohrabi Araghi |
|
Poster
|
Wed 15:00 |
On Imitation in Mean-field Games Giorgia Ramponi · Pavel Kolev · Olivier Pietquin · Niao He · Mathieu Lauriere · Matthieu Geist |
|
Poster
|
Thu 8:45 |
Distributional Pareto-Optimal Multi-Objective Reinforcement Learning Xin-Qiang Cai · Pushi Zhang · Li Zhao · Jiang Bian · Masashi Sugiyama · Ashley Llorens |
|
Poster
|
Wed 8:45 |
Model-free Posterior Sampling via Learning Rate Randomization Daniil Tiapkin · Denis Belomestny · Daniele Calandriello · Eric Moulines · Remi Munos · Alexey Naumov · Pierre Perrault · Michal Valko · Pierre Ménard |
|
Workshop
|
is a Few-Shot Policy Learner: Pretraining Multitask Representation via Temporal Action-Driven Contrastive Loss Ruijie Zheng · Yongyuan Liang · Xiyao Wang · Shuang Ma · Hal Daumé III · Huazhe Xu · John Langford · Praveen Palanisamy · Kalyan Basu · Furong Huang |
||
Poster
|
Tue 8:45 |
On the Convergence and Sample Complexity Analysis of Deep Q-Networks with -Greedy Exploration Shuai Zhang · Hongkang Li · Meng Wang · Miao Liu · Pin-Yu Chen · Songtao Lu · Songtao Lu · Sijia Liu · Keerthiram Murugesan · Subhajit Chaudhury |
|
Poster
|
Tue 8:45 |
Policy Optimization in a Noisy Neighborhood: On Return Landscapes in Continuous Control Nate Rahn · Pierluca D'Oro · Harley Wiltzer · Pierre-Luc Bacon · Marc Bellemare |
|
Workshop
|
JaxMARL: Multi-Agent RL Environments in JAX Alexander Rutherford · Benjamin Ellis · Matteo Gallici · Jonathan Cook · Andrei Lupu · Garðar Ingvarsson Juto · Timon Willi · Akbir Khan · Christian Schroeder de Witt · Alexandra Souly · Saptarashmi Bandyopadhyay · Mikayel Samvelyan · Minqi Jiang · Robert Lange · Shimon Whiteson · Bruno Lacerda · Nick Hawes · Tim Rocktäschel · Chris Lu · Jakob Foerster |
||
Poster
|
Thu 15:00 |
Minigrid & Miniworld: Modular & Customizable Reinforcement Learning Environments for Goal-Oriented Tasks Maxime Chevalier-Boisvert · Bolun Dai · Mark Towers · Rodrigo Perez-Vicente · Lucas Willems · Salem Lahlou · Suman Pal · Pablo Samuel Castro · J Terry |
|
Competition
|
Fri 7:00 |
The NeurIPS 2023 Neural MMO Challenge: Multi-Task Reinforcement Learning and Curriculum Generation Joseph Suarez · Phillip Isola · David Bloomin · Kyoung Whan Choe · Hao Li · Ryan Sullivan · Nishaanth Kanna · Daniel Scott · Rose Shuman · Herbie Bradley · Louis Castricato · Chenghui Yu · Yuhao Jiang · Qimai Li · Jiaxin Chen · Xiaolong Zhu · Dipam Chakrabroty · Sharada Mohanty · Nikhil Pinnaparaju |
|
Workshop
|
Learning Conditional Policies for Crystal Design Using Offline Reinforcement Learning Prashant Govindarajan · Santiago Miret · Jarrid Rector-Brooks · Mariano Phielipp · Janarthanan Rajendran · Sarath Chandar |