Skip to yearly menu bar Skip to main content


Poster

Characteristics of Harmful Text: Towards Rigorous Benchmarking of Language Models

Maribeth Rauh · John Mellor · Jonathan Uesato · Po-Sen Huang · Johannes Welbl · Laura Weidinger · Sumanth Dathathri · Amelia Glaese · Geoffrey Irving · Iason Gabriel · William Isaac · Lisa Anne Hendricks
2022 Poster
[ Paper [ Poster [ OpenReview

Abstract

Video

Chat is not available.