NeurIPS 2022

Poster

Wed 9:00

FedSR: A Simple and Effective Domain Generalization Method for Federated Learning
A. Tuan Nguyen · Philip Torr · Ser Nam Lim

Poster

Tue 9:00

Fine-tuning language models to find agreement among humans with diverse preferences
Michiel Bakker · Martin Chadwick · Hannah Sheahan · Michael Tessler · Lucy Campbell-Gillingham · Jan Balaguer · Nat McAleese · Amelia Glaese · John Aslanides · Matt Botvinick · Christopher Summerfield

Poster

Thu 9:00

Defining and Characterizing Reward Gaming
Joar Skalse · Nikolaus Howe · Dmitrii Krasheninnikov · David Krueger

Workshop

A Multi-Level Framework for the AI Alignment Problem
Betty L Hou · Brian Green

Workshop

Revisiting Value Alignment Through the Lens of Human-Aware AI
Sarath Sreedharan · Subbarao Kambhampati

Poster

Wed 9:00

How to talk so AI will learn: Instructions, descriptions, and autonomy
Theodore Sumers · Robert Hawkins · Mark Ho · Tom Griffiths · Dylan Hadfield-Menell

Poster

Tue 14:00

Second Thoughts are Best: Learning to Re-Align With Human Values from Text Edits
Ruibo Liu · Chenyan Jia · Ge Zhang · Ziyu Zhuang · Tony Liu · Soroush Vosoughi

Poster

Tue 14:00

Harmonizing the object recognition strategies of deep neural networks with humans
Thomas FEL · Ivan F Rodriguez Rodriguez · Drew Linsley · Thomas Serre

Workshop

Adversarial Policies Beat Professional-Level Go AIs
Tony Wang · Adam Gleave · Nora Belrose · Tom Tseng · Michael Dennis · Yawen Duan · Viktor Pogrebniak · Joseph Miller · Sergey Levine · Stuart Russell

Workshop

Adversarial Policies Beat Professional-Level Go AIs
Tony Wang · Adam Gleave · Nora Belrose · Tom Tseng · Michael Dennis · Yawen Duan · Viktor Pogrebniak · Joseph Miller · Sergey Levine · Stuart J Russell

Main Navigation

10 Results