Skip to yearly menu bar Skip to main content


Evaluating Language Models' Evaluations of Games

Katie Collins ⋅ Cedegao (Ced) Zhang ⋅ Graham Todd ⋅ Lance Ying ⋅ Mauricio Barba ⋅ Ryan Liu ⋅ Adrian Weller ⋅ Ionatan Kuperwajs ⋅ Catherine Wong ⋅ Josh Tenenbaum ⋅ Tom Griffiths

Abstract

Chat is not available.