Skip to yearly menu bar Skip to main content


Spotlight Poster

Value Imprint: A Technique for Auditing the Human Values Embedded in RLHF Datasets

Ike Obi ⋅ Rohan Pant ⋅ Srishti Shekhar Agrawal ⋅ Maham Ghazanfar ⋅ Aaron Basiletti
2024 Spotlight Poster

Abstract

Video

Chat is not available.