firstbacksecondback
589 Results
Poster
|
Thu 14:00 |
Dungeons and Data: A Large-Scale NetHack Dataset Eric Hambro · Roberta Raileanu · Danielle Rothermel · Vegard Mella · Tim Rocktäschel · Heinrich Küttler · Naila Murray |
|
Poster
|
Thu 14:00 |
Honor of Kings Arena: an Environment for Generalization in Competitive Reinforcement Learning Hua Wei · Jingxiao Chen · Xiyang Ji · Hongyang Qin · Minwen Deng · Siqin Li · Liang Wang · Weinan Zhang · Yong Yu · Liu Linc · Lanxiao Huang · Deheng Ye · Qiang Fu · Wei Yang |
|
Poster
|
Tue 14:00 |
MineDojo: Building Open-Ended Embodied Agents with Internet-Scale Knowledge Linxi Fan · Guanzhi Wang · Yunfan Jiang · Ajay Mandlekar · Yuncong Yang · Haoyi Zhu · Andrew Tang · De-An Huang · Yuke Zhu · Anima Anandkumar |
|
Poster
|
Wed 14:00 |
MATE: Benchmarking Multi-Agent Reinforcement Learning in Distributed Target Coverage Control Xuehai Pan · Mickel Liu · Fangwei Zhong · Yaodong Yang · Song-Chun Zhu · Yizhou Wang |
|
Poster
|
Wed 14:00 |
The Surprising Effectiveness of PPO in Cooperative Multi-Agent Games Chao Yu · Akash Velu · Eugene Vinitsky · Jiaxuan Gao · Yu Wang · Alexandre Bayen · YI WU |
|
Poster
|
Wed 9:00 |
Exploration via Elliptical Episodic Bonuses Mikael Henaff · Roberta Raileanu · Minqi Jiang · Tim Rocktäschel |
|
Poster
|
Wed 9:00 |
Challenging Common Assumptions in Convex Reinforcement Learning Mirco Mutti · Riccardo De Santi · Piersilvio De Bartolomeis · Marcello Restelli |
|
Poster
|
Wed 9:00 |
Avalon: A Benchmark for RL Generalization Using Procedurally Generated Worlds Joshua Albrecht · Abraham Fetterman · Bryden Fogelman · Ellie Kitanidis · Bartosz Wróblewski · Nicole Seo · Michael Rosenthal · Maksis Knutins · Zack Polizzi · James Simon · Kanjun Qiu |
|
Poster
|
Constrained Update Projection Approach to Safe Policy Optimization Long Yang · Jiaming Ji · Juntao Dai · Linrui Zhang · Binbin Zhou · Pengfei Li · Yaodong Yang · Gang Pan |
||
Poster
|
Wed 9:00 |
Model-based Lifelong Reinforcement Learning with Bayesian Exploration Haotian Fu · Shangqun Yu · Michael Littman · George Konidaris |
|
Poster
|
Wed 9:00 |
Distributionally Adaptive Meta Reinforcement Learning Anurag Ajay · Abhishek Gupta · Dibya Ghosh · Sergey Levine · Pulkit Agrawal |
|
Poster
|
Tue 9:00 |
Learning Long-Term Crop Management Strategies with CyclesGym Matteo Turchetta · Luca Corinzia · Scott Sussex · Amanda Burton · Juan Herrera · Ioannis Athanasiadis · Joachim M Buhmann · Andreas Krause |