Skip to yearly menu bar Skip to main content


Search All 2024 Events
 

3 Results

<<   <   Page 1 of 1   >>   >
Workshop
Learning Diverse Attacks on Large Language Models for Robust Red-Teaming and Safety Tuning
Seanie Lee · Minsu Kim · Lynn Cherif · David Dobre · Juho Lee · Sung Ju Hwang · Kenji Kawaguchi · Gauthier Gidel · Yoshua Bengio · Nikolay Malkin · Moksh Jain
Workshop
Plentiful Jailbreaks with String Compositions
Brian Huang
Workshop
Plentiful Jailbreaks with String Compositions
Brian Huang