Workshop
|
Disclosing the Biases in Large Language Models via Reward Structured Questions Ezgi Korkmaz |
||
Workshop
|
Revealing the Bias in Large Language Models via Reward Structured Questions Ezgi Korkmaz |
||
Workshop
|
Revealing the Bias in Large Language Models via Reward Structured Questions Ezgi Korkmaz |
||
Poster
|
Thu 9:00 |
On Reinforcement Learning and Distribution Matching for Fine-Tuning Language Models with no Catastrophic Forgetting Tomasz Korbak · Hady Elsahar · Germán Kruszewski · Marc Dymetman |
|
Poster
|
Wed 14:00 |
Hedging as Reward Augmentation in Probabilistic Graphical Models Debarun Bhattacharjya · Radu Marinescu |
|
Poster
|
Tue 9:00 |
Fine-tuning language models to find agreement among humans with diverse preferences Michiel Bakker · Martin Chadwick · Hannah Sheahan · Michael Tessler · Lucy Campbell-Gillingham · Jan Balaguer · Nat McAleese · Amelia Glaese · John Aslanides · Matt Botvinick · Christopher Summerfield |