Competition
|
Sun 9:20
|
Red-Team 2nd Winner - Amazing
|
|
Competition
|
Sun 9:30
|
Red-Team 3rd Winner - RealStrike
|
|
Competition
|
Sun 9:10
|
Red-Team 1st Winner - MORSE&ARCLab
|
|
Competition
|
Sun 9:40
|
Red-Team Special Winner - MEL-PETs
|
|
Poster
|
Fri 16:30
|
AgentPoison: Red-teaming LLM Agents via Poisoning Memory or Knowledge Bases
Zhaorun Chen · Zhen Xiang · Chaowei Xiao · Dawn Song · Bo Li
|
|
Poster
|
Thu 11:00
|
ART: Automatic Red-teaming for Text-to-Image Models to Protect Benign Users
Guanlin Li · Kangjie Chen · Shudong Zhang · Jie Zhang · Tianwei Zhang
|
|
Workshop
|
|
MedAIScout: Automated Retrieval of Known Machine Learning Vulnerabilities in Medical Applications
Athish Pranav Dharmalingam · Gargi Mitra
|
|
Workshop
|
Sun 17:00
|
Invited Talk 7: Max Kaufmann on Red-teaming AI systems in government
Max Kaufmann
|
|
Workshop
|
|
Code-Switching Red-Teaming: LLM Evaluation for Safety and Multilingual Understanding
Haneul Yoo · Yongjin Yang · Hwaran Lee
|
|
Workshop
|
|
Jailbreaking Large Language Models with Symbolic Mathematics
Emet Bethany · Mazal Bethany · Juan Nolazco-Flores · Sumit Jha · peyman najafirad
|
|
Workshop
|
|
Text-Diffusion Red-Teaming of Large Language Models: Unveiling Harmful Behaviors with Proximity Constraints
Jonathan Noether · Adish Singla · Goran Radanovic
|
|
Workshop
|
|
What Features in Prompts Jailbreak LLMs? Investigating the Mechanisms Behind Attacks
Nathalie Kirch · Severin Field · Stephen Casper
|
|