Timezone: »
We propose SPRINT, an approach for scalable offline policy pre-training based on natural language instructions. SPRINT pre-trains an agent’s policy to execute a diverse set of semantically meaningful skills that it can leverage to learn new tasks faster. Prior work on offline pre-training required tedious manual definition of pre-training tasks or learned semantically meaningless skills via random goal-reaching. Instead, our approach SPRINT (Scalable Pre-training via Relabeling Language INsTructions) leverages natural language instruction labels on offline agent experience, collected at scale (e.g., via crowd-sourcing), to define a rich set of tasks with minimal human effort. Furthermore, by using natural language to define tasks, SPRINT can use pre-trained large language models to automatically expand the initial task set. By relabeling and aggregating task instructions, even across multiple training trajectories, we can learn a large set of new skills during pre-training. In experiments using a realistic household simulator, we show that agents pre-trained with SPRINT learn new long-horizon household tasks substantially faster than with previous pre-training approaches.
Author Information
Jesse Zhang (University of Southern California)
2nd year PhD student at USC, working on reinforcement learning and robotics.
Karl Pertsch (University of Southern California)
Jiahui Zhang (University of Southern California)
Taewook Nam (KAIST)
Sung Ju Hwang (KAIST, AITRICS)
Xiang Ren (University of Southern California)
Joseph Lim (Korea Advanced Institute of Science & Technology)
More from the Same Authors
-
2020 : Poster #2 »
Xiang Ren -
2021 Spotlight: Refining Language Models with Compositional Explanations »
Huihan Yao · Ying Chen · Qinyuan Ye · Xisen Jin · Xiang Ren -
2021 Spotlight: Hardware-adaptive Efficient Latency Prediction for NAS via Meta-Learning »
Hayeon Lee · Sewoong Lee · Song Chong · Sung Ju Hwang -
2021 Spotlight: Task-Adaptive Neural Network Search with Meta-Contrastive Learning »
Wonyong Jeong · Hayeon Lee · Geon Park · Eunyoung Hyung · Jinheon Baek · Sung Ju Hwang -
2021 : Task-Induced Representation Learning »
Jun Yamada · Karl Pertsch · Anisha Gunjal · Joseph Lim -
2021 : Skill-based Meta-Reinforcement Learning »
Taewook Nam · Shao-Hua Sun · Karl Pertsch · Sung Ju Hwang · Joseph Lim -
2021 : Skill-based Meta-Reinforcement Learning »
Taewook Nam · Shao-Hua Sun · Karl Pertsch · Sung Ju Hwang · Joseph Lim -
2022 Poster: Learning to Generate Inversion-Resistant Model Explanations »
Hoyong Jeong · Suyoung Lee · Sung Ju Hwang · Sooel Son -
2022 : PINTO: Faithful Language Reasoning Using Prompt-Generated Rationales »
Peifeng Wang · Aaron Chan · Filip Ilievski · Muhao Chen · Xiang Ren -
2022 : Adaptive Pre-training of Language Models for Better Logical Reasoning »
Soumya Sanyal · Yichong Xu · Shuohang Wang · Ziyi Yang · Reid Pryzant · Wenhao Yu · Chenguang Zhu · Xiang Ren -
2022 : Information-Theoretic Evaluation of Free-Text Rationales with Conditional $\mathcal{V}$-Information »
Hanjie Chen · Faeze Brahman · Xiang Ren · Yangfeng Ji · Yejin Choi · Swabha Swayamdipta -
2022 : PINTO: Faithful Language Reasoning Using Prompt-Generated Rationales »
Peifeng Wang · Aaron Chan · Filip Ilievski · Muhao Chen · Xiang Ren -
2022 : Efficient Multi-Task Reinforcement Learning via Selective Behavior Sharing »
Grace Zhang · Ayush Jain · Injune Hwang · Shao-Hua Sun · Joseph Lim -
2022 : SPRINT: Scalable Semantic Policy Pre-training via Language Instruction Relabeling »
Jesse Zhang · Karl Pertsch · Jiahui Zhang · Taewook Nam · Sung Ju Hwang · Xiang Ren · Joseph Lim -
2022 Poster: NS3: Neuro-symbolic Semantic Code Search »
Shushan Arakelyan · Anna Hakhverdyan · Miltiadis Allamanis · Luis Garcia · Christophe Hauser · Xiang Ren -
2022 Poster: Factorized-FL: Personalized Federated Learning with Parameter Factorization & Similarity Matching »
Wonyong Jeong · Sung Ju Hwang -
2022 Poster: Graph Self-supervised Learning with Accurate Discrepancy Learning »
Dongki Kim · Jinheon Baek · Sung Ju Hwang -
2022 Poster: Set-based Meta-Interpolation for Few-Task Meta-Learning »
Seanie Lee · Bruno Andreis · Kenji Kawaguchi · Juho Lee · Sung Ju Hwang -
2022 Poster: Unsupervised Cross-Task Generalization via Retrieval Augmentation »
Bill Yuchen Lin · Kangmin Tan · Chris Miller · Beiwen Tian · Xiang Ren -
2021 Poster: Edge Representation Learning with Hypergraphs »
Jaehyeong Jo · Jinheon Baek · Seul Lee · Dongki Kim · Minki Kang · Sung Ju Hwang -
2021 Poster: Hit and Lead Discovery with Explorative RL and Fragment-based Molecule Generation »
Soojung Yang · Doyeong Hwang · Seul Lee · Seongok Ryu · Sung Ju Hwang -
2021 Poster: SalKG: Learning From Knowledge Graph Explanations for Commonsense Reasoning »
Aaron Chan · Jiashu Xu · Boyuan Long · Soumya Sanyal · Tanishq Gupta · Xiang Ren -
2021 Poster: Learning to Synthesize Programs as Interpretable and Generalizable Policies »
Dweep Trivedi · Jesse Zhang · Shao-Hua Sun · Joseph Lim -
2021 Poster: Hardware-adaptive Efficient Latency Prediction for NAS via Meta-Learning »
Hayeon Lee · Sewoong Lee · Song Chong · Sung Ju Hwang -
2021 Poster: Task-Adaptive Neural Network Search with Meta-Contrastive Learning »
Wonyong Jeong · Hayeon Lee · Geon Park · Eunyoung Hyung · Jinheon Baek · Sung Ju Hwang -
2021 Poster: Gradient-based Editing of Memory Examples for Online Task-free Continual Learning »
Xisen Jin · Arka Sadhu · Junyi Du · Xiang Ren -
2021 Poster: Refining Language Models with Compositional Explanations »
Huihan Yao · Ying Chen · Qinyuan Ye · Xisen Jin · Xiang Ren -
2021 Poster: Mini-Batch Consistent Slot Set Encoder for Scalable Set Encoding »
Bruno Andreis · Jeffrey Willette · Juho Lee · Sung Ju Hwang -
2020 : Contributed Talk: Accelerating Reinforcement Learning with Learned Skill Priors »
Karl Pertsch · Youngwoon Lee · Joseph Lim -
2020 : Contributed Talk 1 - "Accelerating Reinforcement Learning with Learned Skill Priors" (Best Paper Runner-Up) »
Karl Pertsch -
2020 Poster: Long-Horizon Visual Planning with Goal-Conditioned Hierarchical Predictors »
Karl Pertsch · Oleh Rybkin · Frederik Ebert · Shenghao Zhou · Dinesh Jayaraman · Chelsea Finn · Sergey Levine -
2019 : Poster Session »
Ahana Ghosh · Javad Shafiee · Akhilan Boopathy · Alex Tamkin · Theodoros Vasiloudis · Vedant Nanda · Ali Baheri · Paul Fieguth · Andrew Bennett · Guanya Shi · Hao Liu · Arushi Jain · Jacob Tyo · Benjie Wang · Boxiao Chen · Carroll Wainwright · Chandramouli Shama Sastry · Chao Tang · Daniel S. Brown · David Inouye · David Venuto · Dhruv Ramani · Dimitrios Diochnos · Divyam Madaan · Dmitrii Krashenikov · Joel Oren · Doyup Lee · Eleanor Quint · elmira amirloo · Matteo Pirotta · Gavin Hartnett · Geoffroy Dubourg-Felonneau · Gokul Swamy · Pin-Yu Chen · Ilija Bogunovic · Jason Carter · Javier Garcia-Barcos · Jeet Mohapatra · Jesse Zhang · Jian Qian · John Martin · Oliver Richter · Federico Zaiter · Tsui-Wei Weng · Karthik Abinav Sankararaman · Kyriakos Polymenakos · Lan Hoang · mahdieh abbasi · Marco Gallieri · Mathieu Seurin · Matteo Papini · Matteo Turchetta · Matthew Sotoudeh · Mehrdad Hosseinzadeh · Nathan Fulton · Masatoshi Uehara · Niranjani Prasad · Oana-Maria Camburu · Patrik Kolaric · Philipp Renz · Prateek Jaiswal · Reazul Hasan Russel · Riashat Islam · Rishabh Agarwal · Alexander Aldrick · Sachin Vernekar · Sahin Lale · Sai Kiran Narayanaswami · Samuel Daulton · Sanjam Garg · Sebastian East · Shun Zhang · Soheil Dsidbari · Justin Goodwin · Victoria Krakovna · Wenhao Luo · Wesley Chung · Yuanyuan Shi · Yuh-Shyang Wang · Hongwei Jin · Ziping Xu -
2018 Poster: Hierarchical Graph Representation Learning with Differentiable Pooling »
Zhitao Ying · Jiaxuan You · Christopher Morris · Xiang Ren · Will Hamilton · Jure Leskovec -
2018 Spotlight: Hierarchical Graph Representation Learning with Differentiable Pooling »
Zhitao Ying · Jiaxuan You · Christopher Morris · Xiang Ren · Will Hamilton · Jure Leskovec