Timezone: »

 
Learning from Human Feedback
Paul Christiano

Sat Dec 09 01:15 PM -- 01:45 PM (PST) @

Author Information

Paul Christiano (OpenAI)

More from the Same Authors

  • 2022 Poster: Training language models to follow instructions with human feedback »
    Long Ouyang · Jeffrey Wu · Xu Jiang · Diogo Almeida · Carroll Wainwright · Pamela Mishkin · Chong Zhang · Sandhini Agarwal · Katarina Slama · Alex Ray · John Schulman · Jacob Hilton · Fraser Kelton · Luke Miller · Maddie Simens · Amanda Askell · Peter Welinder · Paul Christiano · Jan Leike · Ryan Lowe