firstbacksecondback
17 Results
Workshop
|
Sun 14:40 |
Adversarial Prompt Evaluation: Systematic Benchmarking of Guardrails Against Prompt Input Attacks on LLMs |
|
Poster
|
Thu 16:30 |
Fight Back Against Jailbreaking via Prompt Adversarial Tuning Yichuan Mo · Yuji Wang · Zeming Wei · Yisen Wang |
|
Poster
|
Thu 11:00 |
Prompt-Agnostic Adversarial Perturbation for Customized Diffusion Models Cong Wan · Yuhang He · Xiang Song · Yihong Gong |
|
Poster
|
Thu 11:00 |
TARP-VP: Towards Evaluation of Transferred Adversarial Robustness and Privacy on Label Mapping Visual Prompting Models Zhen Chen · Yi Zhang · Fu Wang · Xingyu Zhao · Xiaowei Huang · Wenjie Ruan |
|
Workshop
|
SkewAct: Red Teaming Large Language Models via Activation-Skewed Adversarial Prompt Optimization Hanxi Guo · Siyuan Cheng · Guanhong Tao · Guangyu Shen · Zhuo Zhang · Shengwei An · Kaiyuan Zhang · Xiangyu Zhang |
||
Affinity Event
|
Towards Adversarially Robust Vision-Language Models: Insights from Design Choices and Prompt Formatting Techniques Rishika Bhagwatkar · Shravan Nayak · Pouya Bashivan · Irina Rish |
||
Poster
|
Wed 16:30 |
Few-Shot Adversarial Prompt Learning on Vision-Language Models Yiwei Zhou · Xiaobo Xia · Zhiwei Lin · Bo Han · Tongliang Liu |
|
Poster
|
Fri 11:00 |
Query-Based Adversarial Prompt Generation Jonathan Hayase · Ema Borevković · Nicholas Carlini · Florian Tramer · Milad Nasr |
|
Poster
|
Fri 11:00 |
GuardT2I: Defending Text-to-Image Models from Adversarial Prompts Yijun Yang · Ruiyuan Gao · Xiao Yang · Jianyuan Zhong · Qiang Xu |
|
Workshop
|
AdvBDGen: Adversarially Fortified Prompt-Specific Fuzzy Backdoor Generator Against LLM Alignment Pankayaraj Pathmanathan · Udari Sehwag · Michael-Andrei Panaitescu-Liess · Furong Huang |
||
Workshop
|
What Features in Prompts Jailbreak LLMs? Investigating the Mechanisms Behind Attacks Nathalie Kirch · Severin Field · Stephen Casper |
||
Workshop
|
Adversarial Prompt Evaluation: Systematic Benchmarking of Guardrails Against Prompt Input Attacks on LLMs Giulio Zizzo · Giandomenico Cornacchia · Kieran Fraser · Muhammad Zaid Hameed · Ambrish Rawat · Beat Buesser · Mark Purcell · Pin-Yu Chen · Prasanna Sattigeri · Kush Varshney |