firstbacksecondback
10 Results
Poster
|
Wed 9:00 |
FedSR: A Simple and Effective Domain Generalization Method for Federated Learning A. Tuan Nguyen · Philip Torr · Ser Nam Lim |
|
Poster
|
Tue 9:00 |
Fine-tuning language models to find agreement among humans with diverse preferences Michiel Bakker · Martin Chadwick · Hannah Sheahan · Michael Tessler · Lucy Campbell-Gillingham · Jan Balaguer · Nat McAleese · Amelia Glaese · John Aslanides · Matt Botvinick · Christopher Summerfield |
|
Poster
|
Thu 9:00 |
Defining and Characterizing Reward Gaming Joar Skalse · Nikolaus Howe · Dmitrii Krasheninnikov · David Krueger |
|
Workshop
|
A Multi-Level Framework for the AI Alignment Problem Betty L Hou · Brian Green |
||
Workshop
|
Revisiting Value Alignment Through the Lens of Human-Aware AI Sarath Sreedharan · Subbarao Kambhampati |
||
Poster
|
Wed 9:00 |
How to talk so AI will learn: Instructions, descriptions, and autonomy Theodore Sumers · Robert Hawkins · Mark Ho · Tom Griffiths · Dylan Hadfield-Menell |
|
Poster
|
Tue 14:00 |
Second Thoughts are Best: Learning to Re-Align With Human Values from Text Edits Ruibo Liu · Chenyan Jia · Ge Zhang · Ziyu Zhuang · Tony Liu · Soroush Vosoughi |
|
Poster
|
Tue 14:00 |
Harmonizing the object recognition strategies of deep neural networks with humans Thomas FEL · Ivan F Rodriguez Rodriguez · Drew Linsley · Thomas Serre |
|
Workshop
|
Adversarial Policies Beat Professional-Level Go AIs Tony Wang · Adam Gleave · Nora Belrose · Tom Tseng · Michael Dennis · Yawen Duan · Viktor Pogrebniak · Joseph Miller · Sergey Levine · Stuart Russell |
||
Workshop
|
Adversarial Policies Beat Professional-Level Go AIs Tony Wang · Adam Gleave · Nora Belrose · Tom Tseng · Michael Dennis · Yawen Duan · Viktor Pogrebniak · Joseph Miller · Sergey Levine · Stuart J Russell |