Skip to yearly menu bar Skip to main content


Poster
in
Datasets and Benchmarks: Dataset and Benchmark Poster Session 1

Adversarial GLUE: A Multi-Task Benchmark for Robustness Evaluation of Language Models

Boxin Wang ⋅ Chejian Xu ⋅ Shuohang Wang ⋅ Zhe Gan ⋅ Yu Cheng ⋅ Jianfeng Gao ⋅ Ahmed Awadallah ⋅ Bo Li

Abstract

Video

Chat is not available.