Skip to yearly menu bar Skip to main content


Automatic Hallucination Assessment for Aligned Large Language Models via Transferable Adversarial Attacks

Xiaodong Yu ⋅ Hao Cheng ⋅ Xiaodong Liu ⋅ Dan Roth ⋅ Jianfeng Gao

Abstract

Video

Chat is not available.