Skip to yearly menu bar Skip to main content


Poster

FALCON: Fine-grained Activation Manipulation by Contrastive Orthogonal Unalignment for Large Language Model

Jinwei Hu ⋅ Zhenglin Huang ⋅ Xiangyu Yin ⋅ Wenjie Ruan ⋅ Guangliang Cheng ⋅ Yi Dong ⋅ Xiaowei Huang
2025 Poster

Abstract

Video

Chat is not available.