firstbacksecondback
126 Results
Poster
|
Thu 9:00 |
Globally Convergent Policy Search for Output Estimation Jack Umenberger · Max Simchowitz · Juan Perdomo · Kaiqing Zhang · Russ Tedrake |
|
Poster
|
Wed 9:00 |
Leveraging Factored Action Spaces for Efficient Offline Reinforcement Learning in Healthcare Shengpu Tang · Maggie Makar · Michael Sjoding · Finale Doshi-Velez · Jenna Wiens |
|
Workshop
|
Efficient Offline Policy Optimization with a Learned Model Zichen Liu · Siyi Li · Wee Sun Lee · Shuicheng Yan · Zhongwen Xu |
||
Workshop
|
Real World Offline Reinforcement Learning with Realistic Data Source Gaoyue Zhou · Liyiming Ke · Siddhartha Srinivasa · Abhinav Gupta · Aravind Rajeswaran · Vikash Kumar |
||
Workshop
|
Homomorphism AutoEncoder --- Learning Group Structured Representations from Observed Transitions Hamza Keurti · Hsiao-Ru Pan · Michel Besserve · Benjamin F. Grewe · Bernhard Schölkopf |
||
Workshop
|
Train Offline, Test Online: A Real Robot Learning Benchmark Gaoyue Zhou · Victoria Dean · Mohan Kumar Srirama · Aravind Rajeswaran · Jyothish Pari · Kyle Hatch · Aryan Jain · Tianhe Yu · Pieter Abbeel · Lerrel Pinto · Chelsea Finn · Abhinav Gupta |
||
Poster
|
Thu 9:00 |
Skills Regularized Task Decomposition for Multi-task Offline Reinforcement Learning Minjong Yoo · SangWoo Cho · Honguk Woo |
|
Poster
|
Tue 9:00 |
A Unified Framework for Alternating Offline Model Training and Policy Learning Shentao Yang · Shujian Zhang · Yihao Feng · Mingyuan Zhou |
|
Workshop
|
Sat 10:15 |
Train Offline, Test Online: A Real Robot Learning Benchmark Gaoyue Zhou · Victoria Dean · Mohan Kumar Srirama · Aravind Rajeswaran · Jyothish Pari · Kyle Hatch · Aryan Jain · Tianhe Yu · Pieter Abbeel · Lerrel Pinto · Chelsea Finn · Abhinav Gupta |
|
Poster
|
Wed 14:00 |
A Near-Optimal Primal-Dual Method for Off-Policy Learning in CMDP Fan Chen · Junyu Zhang · Zaiwen Wen |
|
Workshop
|
Sat 9:20 |
Homomorphism AutoEncoder --- Learning Group Structured Representations from Observed Transitions Hamza Keurti · Hsiao-Ru Pan · Michel Besserve · Benjamin F. Grewe · Bernhard Schölkopf |
|
Poster
|
Tue 14:00 |
RAMBO-RL: Robust Adversarial Model-Based Offline Reinforcement Learning Marc Rigter · Bruno Lacerda · Nick Hawes |