Timezone: »
Recent large pre-trained language models such as GPT-3 have achieved remarkable progress on mathematical reasoning tasks written in text form, such as math word problems (MWP). However, it is unknown if models can handle more complex problems that involve heterogeneous information, such as tabular data. To fill the gap, we present Tabular Math Word Problems (TabMWP), a new dataset containing 38,431 open-domain problems that require mathematical reasoning on both textual and tabular data, where each question is aligned with a tabular context. We evaluate different pre-trained models on TabMWP, including the GPT-3 model in a few-shot setting. As earlier studies suggest, since few-shot GPT-3 relies on the selection of in-context examples, its performance is unstable and can degrade to near chance. This issue is more severe when handling complex problems like TabMWP. To mitigate this, we further propose a novel approach, PromptPG, which utilizes policy gradient to learn to select good in-context examples from a small amount of training data. Experimental results show that our method outperforms the best baseline by 5.31% in accuracy and reduces the prediction variance significantly compared to random selection.
Author Information
Pan Lu (UCLA; AI2)
Liang Qiu (University of California, Los Angeles)
My name is Liang Qiu. I am doing research in the Center for Vision, Cognition, Learning, and Autonomy (VCLA) at UCLA. I design algorithms and write code to build multi-modal socially intelligent interactive systems. My research interests include natural language processing, dialogue management, computational sociology, and medical AI.
Kai-Wei Chang (UCLA)
Ying Nian Wu (University of California, Los Angeles)
Song-Chun Zhu (UCLA)
Tanmay Rajpurohit (Georgia Institute of Technology)
Peter Clark (Allen Institute for AI)
Ashwin Kalyan (AI2)
More from the Same Authors
-
2020 : Paper 2: Energy-Based Continuous Inverse Optimal Control »
Yifei Xu · Jianwen Xie · Chris Baker · Yibiao Zhao · Ying Nian Wu -
2021 : IconQA: A New Benchmark for Abstract Diagram Understanding and Visual Language Reasoning »
Pan Lu · Liang Qiu · Jiaqi Chen · Tanglin Xia · Yizhou Zhao · Wei Zhang · Zhou Yu · Xiaodan Liang · Song-Chun Zhu -
2021 : Theorem-Aware Geometry Problem Solving with Symbolic Reasoning and Theorem Prediction »
Pan Lu · Ran Gong · Shibiao Jiang · Liang Qiu · Siyuan Huang · Xiaodan Liang · Song-Chun Zhu · Ran Gong -
2021 : Towards Diagram Understanding and Cognitive Reasoning in Icon Question Answering »
Pan Lu · Liang Qiu · Jiaqi Chen · Tanglin Xia · Yizhou Zhao · Wei Zhang · Zhou Yu · Xiaodan Liang · Song-Chun Zhu -
2021 : Deep Generative model with Hierarchical Latent Factors for Timeseries Anomaly Detection »
Cristian Challu · Peihong Jiang · Ying Nian Wu · Laurent Callot -
2021 : Unsupervised Meta-Learning via Latent Space Energy-based Model of Symbol Vector Coupling »
Bo Pang · Deqian Kong · Ying Nian Wu -
2021 : Deep Generative model with Hierarchical Latent Factors for Timeseries Anomaly Detection »
Cristian Challu · Peihong Jiang · Ying Nian Wu · Laurent Callot -
2022 Poster: Towards Human-Level Bimanual Dexterous Manipulation with Reinforcement Learning »
Yuanpei Chen · Tianhao Wu · Shengjie Wang · Xidong Feng · Jiechuan Jiang · Zongqing Lu · Stephen McAleer · Hao Dong · Song-Chun Zhu · Yaodong Yang -
2022 : Estimating Numbers without Regression »
Avijit Thawani · Jay Pujara · Ashwin Kalyan -
2022 : LILA: A Unified Benchmark for Mathematical Reasoning »
Swaroop Mishra · Matthew Finlayson · Pan Lu · Leonard Tang · Sean Welleck · Chitta Baral · Tanmay Rajpurohit · Oyvind Tafjord · Ashish Sabharwal · Peter Clark · Ashwin Kalyan -
2022 : Conformal Isometry of Lie Group Representation in Recurrent Network of Grid Cells »
Dehong Xu · Ruiqi Gao · Wenhao Zhang · Xue-Xin Wei · Ying Nian Wu -
2022 : Neural-Symbolic Recursive Machine for Systematic Generalization »
Qing Li · Yixin Zhu · Yitao Liang · Ying Nian Wu · Song-Chun Zhu · Siyuan Huang -
2023 : MathVista: Evaluating Mathematical Reasoning of Foundation Models in Visual Contexts »
Pan Lu · Hritik Bansal · Tanglin Xia · Jiacheng Liu · Chunyuan Li · Hannaneh Hajishirzi · Hao Cheng · Kai-Wei Chang · Michel Galley · Jianfeng Gao -
2023 : Chameleon: Plug-and-Play Compositional Reasoning with Large Language Models »
Pan Lu · Baolin Peng · Hao Cheng · Michel Galley · Kai-Wei Chang · Ying Nian Wu · Song-Chun Zhu · Jianfeng Gao -
2023 : SCIBENCH: Evaluating College-Level Scientific Problem-Solving Abilities of Large Language Models »
Xiaoxuan Wang · Ziniu Hu · Pan Lu · Yanqiao Zhu · Jieyu Zhang · Satyen Subramaniam · Arjun Loomba · Shichang Zhang · Yizhou Sun · Wei Wang -
2023 : Automated distillation of genomic equations governing single cell gene expression »
Edouardo Honig · Edouardo Honig · Frederique Ruf Zamojski · Stuart Sealfon · Stuart Sealfon · Ying Nian Wu · Zijun Frank Zhang · Zijun Frank Zhang -
2023 : Molecule Design by Latent Prompt Transformer »
Deqian Kong · Yuhao Huang · Jianwen Xie · Ying Nian Wu -
2023 : CLIN: A Continually Learning Language Agent for Rapid Task Adaptation and Generalization »
Bodhisattwa Prasad Majumder · Bhavana Dalvi Mishra · Peter A Jansen · Oyvind Tafjord · Niket Tandon · Li Zhang · Chris Callison-Burch · Peter Clark -
2023 : Anthropomorphization of AI: Opportunities and Risks »
Ameet Deshpande · Tanmay Rajpurohit · Karthik Narasimhan · Ashwin Kalyan -
2023 Workshop: MATH-AI: The 3rd Workshop on Mathematical Reasoning and AI »
Zhenwen Liang · Albert Q. Jiang · Katie Collins · Pan Lu · Kaiyu Yang · Sean Welleck · James McClelland -
2023 Poster: Learning Energy-Based Prior Model with Diffusion-Amortized MCMC »
Peiyu Yu · Yaxuan Zhu · Sirui Xie · Xiaojian (Shawn) Ma · Ruiqi Gao · Song-Chun Zhu · Ying Nian Wu -
2023 Poster: Learning non-Markovian Decision-Making from State-only Sequences »
Aoyang Qin · Feng Gao · Qing Li · Song-Chun Zhu · Sirui Xie -
2023 Poster: Self-Refine: Iterative Refinement with Self-Feedback »
Aman Madaan · Niket Tandon · Prakhar Gupta · Skyler Hallinan · Luyu Gao · Sarah Wiegreffe · Uri Alon · Nouha Dziri · Shrimai Prabhumoye · Yiming Yang · Shashank Gupta · Bodhisattwa Prasad Majumder · Katherine Hermann · Sean Welleck · Amir Yazdanbakhsh · Peter Clark -
2023 Poster: A Recurrent Neural Circuit Mechanism of Temporal-scaling Equivariant Representation »
Junfeng Zuo · Xiao Liu · Ying Nian Wu · Si Wu · Wenhao Zhang -
2023 Poster: Evaluating and Inducing Personality in Pre-trained Language Models »
Guangyuan Jiang · Manjie Xu · Song-Chun Zhu · Wenjuan Han · Chi Zhang · Yixin Zhu -
2023 Poster: Chameleon: Plug-and-Play Compositional Reasoning with Large Language Models »
Pan Lu · Baolin Peng · Hao Cheng · Michel Galley · Kai-Wei Chang · Ying Nian Wu · Song-Chun Zhu · Jianfeng Gao -
2023 Poster: Diplomat: A Dialogue Dataset for Situated PragMATic Reasoning »
Hengli Li · Song-Chun Zhu · Zilong Zheng -
2022 Spotlight: Towards Human-Level Bimanual Dexterous Manipulation with Reinforcement Learning »
Yuanpei Chen · Tianhao Wu · Shengjie Wang · Xidong Feng · Jiechuan Jiang · Zongqing Lu · Stephen McAleer · Hao Dong · Song-Chun Zhu · Yaodong Yang -
2022 : Towards Systematic Reasoning with Language Models »
Peter Clark -
2022 Workshop: MATH-AI: Toward Human-Level Mathematical Reasoning »
Pan Lu · Swaroop Mishra · Sean Welleck · Yuhuai Wu · Hannaneh Hajishirzi · Percy Liang -
2022 Poster: EgoTaskQA: Understanding Human Tasks in Egocentric Videos »
Baoxiong Jia · Ting Lei · Song-Chun Zhu · Siyuan Huang -
2022 Poster: Emergent Graphical Conventions in a Visual Communication Game »
Shuwen Qiu · Sirui Xie · Lifeng Fan · Tao Gao · Jungseock Joo · Song-Chun Zhu · Yixin Zhu -
2022 Poster: Translation-equivariant Representation in Recurrent Networks with a Continuous Manifold of Attractors »
Wenhao Zhang · Ying Nian Wu · Si Wu -
2022 Poster: MATE: Benchmarking Multi-Agent Reinforcement Learning in Distributed Target Coverage Control »
Xuehai Pan · Mickel Liu · Fangwei Zhong · Yaodong Yang · Song-Chun Zhu · Yizhou Wang -
2022 Poster: Learning Probabilistic Models from Generator Latent Spaces with Hat EBM »
Mitch Hill · Erik Nijkamp · Jonathan Mitchell · Bo Pang · Song-Chun Zhu -
2022 Poster: Learn to Explain: Multimodal Reasoning via Thought Chains for Science Question Answering »
Pan Lu · Swaroop Mishra · Tanglin Xia · Liang Qiu · Kai-Wei Chang · Song-Chun Zhu · Oyvind Tafjord · Peter Clark · Ashwin Kalyan -
2021 Workshop: Math AI for Education (MATHAI4ED): Bridging the Gap Between Research and Smart Education »
Pan Lu · Yuhuai Wu · Sean Welleck · Xiaodan Liang · Eric Xing · James McClelland -
2021 Poster: On Path Integration of Grid Cells: Group Representation and Isotropic Scaling »
Ruiqi Gao · Jianwen Xie · Xue-Xin Wei · Song-Chun Zhu · Ying Nian Wu -
2021 Poster: Iterative Teacher-Aware Learning »
Luyao Yuan · Dongruo Zhou · Junhong Shen · Jingdong Gao · Jeffrey L Chen · Quanquan Gu · Ying Nian Wu · Song-Chun Zhu -
2021 Poster: Unsupervised Foreground Extraction via Deep Region Competition »
Peiyu Yu · Sirui Xie · Xiaojian (Shawn) Ma · Yixin Zhu · Ying Nian Wu · Song-Chun Zhu -
2021 Poster: Exploring Forensic Dental Identification with Deep Learning »
Yuan Liang · Weikun Han · Liang Qiu · Chen Wu · Yiting Shao · Kun Wang · Lei He -
2020 Poster: Learning Latent Space Energy-Based Prior Model »
Bo Pang · Tian Han · Erik Nijkamp · Song-Chun Zhu · Ying Nian Wu -
2020 Poster: Leap-Of-Thought: Teaching Pre-Trained Models to Systematically Reason Over Implicit Knowledge »
Alon Talmor · Oyvind Tafjord · Peter Clark · Yoav Goldberg · Jonathan Berant -
2020 Spotlight: Leap-Of-Thought: Teaching Pre-Trained Models to Systematically Reason Over Implicit Knowledge »
Alon Talmor · Oyvind Tafjord · Peter Clark · Yoav Goldberg · Jonathan Berant -
2019 : Extended Poster Session »
Travis LaCroix · Marie Ossenkopf · Mina Lee · Nicole Fitzgerald · Daniela Mihai · Jonathon Hare · Ali Zaidi · Alexander Cowen-Rivers · Alana Marzoev · Eugene Kharitonov · Luyao Yuan · Tomasz Korbak · Paul Pu Liang · Yi Ren · Roberto Dessì · Peter Potash · Shangmin Guo · Tatsunori Hashimoto · Percy Liang · Julian Zubek · Zipeng Fu · Song-Chun Zhu · Adam Lerer -
2019 Poster: Learning Perceptual Inference by Contrasting »
Chi Zhang · Baoxiong Jia · Feng Gao · Yixin Zhu · HongJing Lu · Song-Chun Zhu -
2019 Spotlight: Learning Perceptual Inference by Contrasting »
Chi Zhang · Baoxiong Jia · Feng Gao · Yixin Zhu · HongJing Lu · Song-Chun Zhu -
2019 Poster: PerspectiveNet: 3D Object Detection from a Single RGB Image via Perspective Points »
Siyuan Huang · Yixin Chen · Tao Yuan · Siyuan Qi · Yixin Zhu · Song-Chun Zhu -
2019 Poster: Learning Non-Convergent Non-Persistent Short-Run MCMC Toward Energy-Based Model »
Erik Nijkamp · Mitch Hill · Song-Chun Zhu · Ying Nian Wu -
2018 Poster: Cooperative Holistic Scene Understanding: Unifying 3D Object, Layout, and Camera Pose Estimation »
Siyuan Huang · Siyuan Qi · Yinxue Xiao · Yixin Zhu · Ying Nian Wu · Song-Chun Zhu -
2014 Workshop: 4th Workshop on Automated Knowledge Base Construction (AKBC) »
Sameer Singh · Fabian M Suchanek · Sebastian Riedel · Partha Pratim Talukdar · Kevin Murphy · Christopher Ré · William Cohen · Tom Mitchell · Andrew McCallum · Jason Weston · Ramanathan Guha · Boyan Onyshkevych · Hoifung Poon · Oren Etzioni · Ari Kobren · Arvind Neelakantan · Peter Clark -
2013 Poster: Unsupervised Structure Learning of Stochastic And-Or Grammars »
Kewei Tu · Maria Pavlovskaia · Song-Chun Zhu -
2011 Poster: Image Parsing with Stochastic Scene Grammar »
Yibiao Zhao · Song-Chun Zhu