Timezone: »
Karthik Narasimhan: Semantic Supervision for few-shot generalization and personalization
Karthik Narasimhan
Sat Dec 03 07:05 AM -- 07:35 AM (PST) @
A desirable feature of interactive NLP systems is the ability to receive feedback from humans and personalize to new users. Existing paradigms encounter challenges in acquiring new concepts due to the use of discrete labels and scalar rewards. As one solution to alleviate this problem, I will present our work on Semantic Supervision (SemSUP), which trains models to predict over multiple natural language descriptions of classes (or even structured ones like JSON). SemSUP can seamlessly replace any standard supervised learning setup without sacrificing any in-distribution accuracy, while providing generalization to unseen concepts and scalability to large label spaces.
Author Information
Karthik Narasimhan (Princeton University)
More from the Same Authors
-
2022 : REACT: Synergizing Reasoning and Acting in Language Models »
Shunyu Yao · Jeffrey Zhao · Dian Yu · Izhak Shafran · Karthik Narasimhan · Yuan Cao -
2022 : Towards an Enhanced, Faithful, and Adaptable Web Interaction Environment »
John Yang · Howard Chen · Karthik Narasimhan -
2023 Poster: Reflexion: language agents with verbal reinforcement learning »
Noah Shinn · Federico Cassano · Ashwin Gopinath · Karthik Narasimhan · Shunyu Yao -
2023 Poster: Tree of Thoughts: Deliberate Problem Solving with Large Language Models »
Shunyu Yao · Dian Yu · Jeffrey Zhao · Izhak Shafran · Tom Griffiths · Yuan Cao · Karthik Narasimhan -
2023 Poster: InterCode: Standardizing and Benchmarking Interactive Coding with Execution Feedback »
John Yang · Akshara Prabhakar · Karthik Narasimhan · Shunyu Yao -
2023 Oral: Tree of Thoughts: Deliberate Problem Solving with Large Language Models »
Shunyu Yao · Dian Yu · Jeffrey Zhao · Izhak Shafran · Tom Griffiths · Yuan Cao · Karthik Narasimhan -
2022 Poster: Using natural language and program abstractions to instill human inductive biases in machines »
Sreejan Kumar · Carlos G. Correa · Ishita Dasgupta · Raja Marjieh · Michael Y Hu · Robert Hawkins · Jonathan D Cohen · nathaniel daw · Karthik Narasimhan · Tom Griffiths -
2022 Poster: WebShop: Towards Scalable Real-World Web Interaction with Grounded Language Agents »
Shunyu Yao · Howard Chen · John Yang · Karthik Narasimhan -
2022 Poster: Learning Physics Constrained Dynamics Using Autoencoders »
Tsung-Yen Yang · Justinian Rosca · Karthik Narasimhan · Peter J. Ramadge -
2022 Poster: DataMUX: Data Multiplexing for Neural Networks »
Vishvak Murahari · Carlos Jimenez · Runzhe Yang · Karthik Narasimhan -
2016 Poster: Hierarchical Deep Reinforcement Learning: Integrating Temporal Abstraction and Intrinsic Motivation »
Tejas Kulkarni · Karthik Narasimhan · Ardavan Saeedi · Josh Tenenbaum