Skip to yearly menu bar Skip to main content


Poster

On scalable oversight with weak LLMs judging strong LLMs

Zachary Kenton · Noah Siegel · Janos Kramar · Jonah Brown-Cohen · Samuel Albanie · Jannis Bulian · Rishabh Agarwal · David Lindner · Yunhao Tang · Noah Goodman · Rohin Shah
2024 Poster
[ Paper [ Poster [ OpenReview

Abstract

Video

Chat is not available.