Skip to yearly menu bar Skip to main content


Poster

Aligning Large Language Models with Representation Editing: A Control Perspective

Lingkai Kong · Haorui Wang · Wenhao Mu · Yuanqi Du · Yuchen Zhuang · Yifei Zhou · Yue Song · Rongzhi Zhang · Kai Wang · Chao Zhang
2024 Poster

Abstract

Video

Chat is not available.