Timezone: »

ReaSCAN: Compositional Reasoning in Language Grounding
Zhengxuan Wu · Elisa Kreiss · Desmond Ong · Christopher Potts

The ability to compositionally map language to referents, relations, and actions is an essential component of language understanding. The recent gSCAN dataset (Ruis et al. 2020, NeurIPS) is an inspiring attempt to assess the capacity of models to learn this kind of grounding in scenarios involving navigational instructions. However, we show that gSCAN's highly constrained design means that it does not require compositional interpretation and that many details of its instructions and scenarios are not required for task success. To address these limitations, we propose ReaSCAN, a benchmark dataset that builds off gSCAN but requires compositional language interpretation and reasoning about entities and relations. We assess two models on ReaSCAN: a multi-modal baseline and a state-of-the-art graph convolutional neural model. These experiments show that ReaSCAN is substantially harder than gSCAN for both neural architectures. This suggests that ReaSCAN can serve as a valuable benchmark for advancing our understanding of models' compositional generalization and reasoning capabilities.

Author Information

Zhengxuan Wu (Stanford University)
Elisa Kreiss (Stanford University)
Desmond Ong (National University of Singapore)
Christopher Potts (Stanford University)

More from the Same Authors