firstbacksecondback
100 Results
Workshop
|
TOD-Flow: Modeling the Structure of Task-Oriented Dialogues Sungryull Sohn · Yiwei Lyu · Anthony Liu · Lajanugen Logeswaran · Dong-Ki Kim · Dongsub Shim · Honglak Lee |
||
Workshop
|
PREMIER-TACO is a Few-Shot Policy Learner: Pretraining Multitask Representation via Temporal Action-Driven Contrastive Loss Ruijie Zheng · Yongyuan Liang · Xiyao Wang · Shuang Ma · Hal Daumé III · Huazhe Xu · John Langford · Praveen Palanisamy · Kalyan Basu · Furong Huang |
||
Poster
|
Tue 8:45 |
Policy Optimization in a Noisy Neighborhood: On Return Landscapes in Continuous Control Nate Rahn · Pierluca D'Oro · Harley Wiltzer · Pierre-Luc Bacon · Marc Bellemare |
|
Workshop
|
Learning Conditional Policies for Crystal Design Using Offline Reinforcement Learning Prashant Govindarajan · Santiago Miret · Jarrid Rector-Brooks · Mariano Phielipp · Janarthanan Rajendran · Sarath Chandar |