Timezone: »
Learning from Human Feedback
Paul Christiano
Author Information
Paul Christiano (OpenAI)
More from the Same Authors
-
2022 Poster: Training language models to follow instructions with human feedback »
Long Ouyang · Jeffrey Wu · Xu Jiang · Diogo Almeida · Carroll Wainwright · Pamela Mishkin · Chong Zhang · Sandhini Agarwal · Katarina Slama · Alex Ray · John Schulman · Jacob Hilton · Fraser Kelton · Luke Miller · Maddie Simens · Amanda Askell · Peter Welinder · Paul Christiano · Jan Leike · Ryan Lowe