Skip to yearly menu bar Skip to main content


Search All 2022 Events
 

10 Results

<<   <   Page 1 of 1   >>   >
Poster
Wed 9:00 FedSR: A Simple and Effective Domain Generalization Method for Federated Learning
A. Tuan Nguyen · Philip Torr · Ser Nam Lim
Poster
Tue 9:00 Fine-tuning language models to find agreement among humans with diverse preferences
Michiel Bakker · Martin Chadwick · Hannah Sheahan · Michael Tessler · Lucy Campbell-Gillingham · Jan Balaguer · Nat McAleese · Amelia Glaese · John Aslanides · Matt Botvinick · Christopher Summerfield
Poster
Thu 9:00 Defining and Characterizing Reward Gaming
Joar Skalse · Nikolaus Howe · Dmitrii Krasheninnikov · David Krueger
Workshop
A Multi-Level Framework for the AI Alignment Problem
Betty L Hou · Brian Green
Workshop
Revisiting Value Alignment Through the Lens of Human-Aware AI
Sarath Sreedharan · Subbarao Kambhampati
Poster
Wed 9:00 How to talk so AI will learn: Instructions, descriptions, and autonomy
Theodore Sumers · Robert Hawkins · Mark Ho · Tom Griffiths · Dylan Hadfield-Menell
Poster
Tue 14:00 Second Thoughts are Best: Learning to Re-Align With Human Values from Text Edits
Ruibo Liu · Chenyan Jia · Ge Zhang · Ziyu Zhuang · Tony Liu · Soroush Vosoughi
Poster
Tue 14:00 Harmonizing the object recognition strategies of deep neural networks with humans
Thomas FEL · Ivan F Rodriguez Rodriguez · Drew Linsley · Thomas Serre
Workshop
Adversarial Policies Beat Professional-Level Go AIs
Tony Wang · Adam Gleave · Nora Belrose · Tom Tseng · Michael Dennis · Yawen Duan · Viktor Pogrebniak · Joseph Miller · Sergey Levine · Stuart Russell
Workshop
Adversarial Policies Beat Professional-Level Go AIs
Tony Wang · Adam Gleave · Nora Belrose · Tom Tseng · Michael Dennis · Yawen Duan · Viktor Pogrebniak · Joseph Miller · Sergey Levine · Stuart J Russell