firstbacksecondback
790 Results
Poster
|
Wed 8:45 |
Describe, Explain, Plan and Select: Interactive Planning with LLMs Enables Open-World Multi-Task Agents Zihao Wang · Shaofei Cai · Guanzhou Chen · Anji Liu · Xiaojian (Shawn) Ma · Yitao Liang |
|
Poster
|
Thu 8:45 |
Unified Off-Policy Learning to Rank: a Reinforcement Learning Perspective Zeyu Zhang · Yi Su · Hui Yuan · Yiran Wu · Rishab Balasubramanian · Qingyun Wu · Huazheng Wang · Mengdi Wang |
|
Poster
|
Thu 8:45 |
Distributional Pareto-Optimal Multi-Objective Reinforcement Learning Xin-Qiang Cai · Pushi Zhang · Li Zhao · Jiang Bian · Masashi Sugiyama · Ashley Llorens |
|
Poster
|
Wed 15:00 |
On Imitation in Mean-field Games Giorgia Ramponi · Pavel Kolev · Olivier Pietquin · Niao He · Mathieu Lauriere · Matthieu Geist |
|
Poster
|
Wed 8:45 |
Model-free Posterior Sampling via Learning Rate Randomization Daniil Tiapkin · Denis Belomestny · Daniele Calandriello · Eric Moulines · Remi Munos · Alexey Naumov · Pierre Perrault · Michal Valko · Pierre Ménard |
|
Workshop
|
is a Few-Shot Policy Learner: Pretraining Multitask Representation via Temporal Action-Driven Contrastive Loss Ruijie Zheng · Yongyuan Liang · Xiyao Wang · Shuang Ma · Hal Daumé III · Huazhe Xu · John Langford · Praveen Palanisamy · Kalyan Basu · Furong Huang |
||
Poster
|
Tue 8:45 |
On the Convergence and Sample Complexity Analysis of Deep Q-Networks with -Greedy Exploration Shuai Zhang · Hongkang Li · Meng Wang · Miao Liu · Pin-Yu Chen · Songtao Lu · Songtao Lu · Sijia Liu · Keerthiram Murugesan · Subhajit Chaudhury |
|
Competition
|
Fri 7:00 |
The NeurIPS 2023 Neural MMO Challenge: Multi-Task Reinforcement Learning and Curriculum Generation Joseph Suarez · Phillip Isola · David Bloomin · Kyoung Whan Choe · Hao Li · Ryan Sullivan · Nishaanth Kanna · Daniel Scott · Rose Shuman · Herbie Bradley · Louis Castricato · Chenghui Yu · Yuhao Jiang · Qimai Li · Jiaxin Chen · Xiaolong Zhu · Dipam Chakrabroty · Sharada Mohanty · Nikhil Pinnaparaju |
|
Poster
|
Thu 15:00 |
Minigrid & Miniworld: Modular & Customizable Reinforcement Learning Environments for Goal-Oriented Tasks Maxime Chevalier-Boisvert · Bolun Dai · Mark Towers · Rodrigo Perez-Vicente · Lucas Willems · Salem Lahlou · Suman Pal · Pablo Samuel Castro · J Terry |
|
Workshop
|
Learning Conditional Policies for Crystal Design Using Offline Reinforcement Learning Prashant Govindarajan · Santiago Miret · Jarrid Rector-Brooks · Mariano Phielipp · Janarthanan Rajendran · Sarath Chandar |