Skip to yearly menu bar Skip to main content


Poster

JailbreakBench: An Open Robustness Benchmark for Jailbreaking Large Language Models

Patrick Chao · Edoardo Debenedetti · Alexander Robey · Maksym Andriushchenko · Francesco Croce · Vikash Sehwag · Edgar Dobriban · Nicolas Flammarion · George J. Pappas · Florian Tramer · Hamed Hassani · Eric Wong
2024 Poster

Abstract

Video

Chat is not available.