Skip to yearly menu bar Skip to main content


Poster
in
Workshop: Safe Generative AI

Model Editing as a Robust and Denoised variant of DPO: A Case Study on Toxicity

Rheeya Uppaal ⋅ Apratim Dey ⋅ Yiting He ⋅ Yiqiao Zhong ⋅ Junjie Hu

Abstract

Chat is not available.