Skip to yearly menu bar Skip to main content


Search All 2024 Events
 

17 Results

<<   <   Page 1 of 2   >   >>
Workshop
Sun 14:40 Adversarial Prompt Evaluation: Systematic Benchmarking of Guardrails Against Prompt Input Attacks on LLMs
Poster
Thu 16:30 Fight Back Against Jailbreaking via Prompt Adversarial Tuning
Yichuan Mo · Yuji Wang · Zeming Wei · Yisen Wang
Poster
Thu 11:00 Prompt-Agnostic Adversarial Perturbation for Customized Diffusion Models
Cong Wan · Yuhang He · Xiang Song · Yihong Gong
Poster
Thu 11:00 TARP-VP: Towards Evaluation of Transferred Adversarial Robustness and Privacy on Label Mapping Visual Prompting Models
Zhen Chen · Yi Zhang · Fu Wang · Xingyu Zhao · Xiaowei Huang · Wenjie Ruan
Workshop
SkewAct: Red Teaming Large Language Models via Activation-Skewed Adversarial Prompt Optimization
Hanxi Guo · Siyuan Cheng · Guanhong Tao · Guangyu Shen · Zhuo Zhang · Shengwei An · Kaiyuan Zhang · Xiangyu Zhang
Affinity Event
Towards Adversarially Robust Vision-Language Models: Insights from Design Choices and Prompt Formatting Techniques
Rishika Bhagwatkar · Shravan Nayak · Pouya Bashivan · Irina Rish
Poster
Wed 16:30 Few-Shot Adversarial Prompt Learning on Vision-Language Models
Yiwei Zhou · Xiaobo Xia · Zhiwei Lin · Bo Han · Tongliang Liu
Poster
Fri 11:00 Query-Based Adversarial Prompt Generation
Jonathan Hayase · Ema Borevković · Nicholas Carlini · Florian Tramer · Milad Nasr
Poster
Fri 11:00 GuardT2I: Defending Text-to-Image Models from Adversarial Prompts
Yijun Yang · Ruiyuan Gao · Xiao Yang · Jianyuan Zhong · Qiang Xu
Workshop
AdvBDGen: Adversarially Fortified Prompt-Specific Fuzzy Backdoor Generator Against LLM Alignment
Pankayaraj Pathmanathan · Udari Sehwag · Michael-Andrei Panaitescu-Liess · Furong Huang
Workshop
What Features in Prompts Jailbreak LLMs? Investigating the Mechanisms Behind Attacks
Nathalie Kirch · Severin Field · Stephen Casper
Workshop
Adversarial Prompt Evaluation: Systematic Benchmarking of Guardrails Against Prompt Input Attacks on LLMs
Giulio Zizzo · Giandomenico Cornacchia · Kieran Fraser · Muhammad Zaid Hameed · Ambrish Rawat · Beat Buesser · Mark Purcell · Pin-Yu Chen · Prasanna Sattigeri · Kush Varshney