NeurIPS 2024

Skip to yearly menu bar Skip to main content

34 Results

Competition	Sun 9:20	Red-Team 2nd Winner - Amazing
Competition	Sun 9:30	Red-Team 3rd Winner - RealStrike
Competition	Sun 9:10	Red-Team 1st Winner - MORSE&ARCLab
Competition	Sun 9:40	Red-Team Special Winner - MEL-PETs
Poster	Fri 16:30	AgentPoison: Red-teaming LLM Agents via Poisoning Memory or Knowledge Bases Zhaorun Chen · Zhen Xiang · Chaowei Xiao · Dawn Song · Bo Li
Poster	Thu 11:00	ART: Automatic Red-teaming for Text-to-Image Models to Protect Benign Users Guanlin Li · Kangjie Chen · Shudong Zhang · Jie Zhang · Tianwei Zhang
Workshop		MedAIScout: Automated Retrieval of Known Machine Learning Vulnerabilities in Medical Applications Athish Pranav Dharmalingam · Gargi Mitra
Workshop	Sun 17:00	Invited Talk 7: Max Kaufmann on Red-teaming AI systems in government Max Kaufmann
Workshop		Code-Switching Red-Teaming: LLM Evaluation for Safety and Multilingual Understanding Haneul Yoo · Yongjin Yang · Hwaran Lee
Workshop		Jailbreaking Large Language Models with Symbolic Mathematics Emet Bethany · Mazal Bethany · Juan Nolazco-Flores · Sumit Jha · peyman najafirad
Workshop		Text-Diffusion Red-Teaming of Large Language Models: Unveiling Harmful Behaviors with Proximity Constraints Jonathan Noether · Adish Singla · Goran Radanovic
Workshop		What Features in Prompts Jailbreak LLMs? Investigating the Mechanisms Behind Attacks Nathalie Kirch · Severin Field · Stephen Casper