Skip to yearly menu bar Skip to main content


LatentGuard: Controllable Latent Steering for Robust Refusal of Attacks and Reliable Response Generation

Huizhen Shu · xuying li · Zhuo Li

Abstract

Chat is not available.