Skip to yearly menu bar Skip to main content


Poster

WildGuard: Open One-stop Moderation Tools for Safety Risks, Jailbreaks, and Refusals of LLMs

Seungju Han ⋅ Kavel Rao ⋅ Allyson Ettinger ⋅ Liwei Jiang ⋅ Bill Yuchen Lin ⋅ Nathan Lambert ⋅ Yejin Choi ⋅ Nouha Dziri
2024 Poster

Abstract

Video

Chat is not available.