Skip to yearly menu bar Skip to main content


San Diego Poster Wed, Dec 3, 2025 • 4:30 PM – 7:30 PM PST Exhibit Hall C,D,E #1112

LLMs Encode Harmfulness and Refusal Separately

Jiachen Zhao · Jing Huang · Zhengxuan Wu · David Bau · Weiyan Shi

Abstract

Log in and register to view live content