Skip to yearly menu bar Skip to main content


Oral
in
Workshop: Safe Generative AI

Adversarial Prompt Evaluation: Systematic Benchmarking of Guardrails Against Prompt Input Attacks on LLMs

Giulio Zizzo ⋅ Giandomenico Cornacchia ⋅ Kieran Fraser ⋅ Muhammad Zaid Hameed ⋅ Ambrish Rawat ⋅ Beat Buesser ⋅ Mark Purcell ⋅ Pin-Yu Chen ⋅ Prasanna Sattigeri ⋅ Kush Varshney

Abstract

Chat is not available.