firstbacksecondback
1 Results
Workshop
|
Curiosity-driven Red teaming for Large Language Models Zhang-Wei Hong · Idan Shenfeld · Tsun-Hsuan Johnson Wang · Yung-Sung Chuang · Aldo Pareja · Jim Glass · Akash Srivastava · Pulkit Agrawal |