Skip to yearly menu bar Skip to main content


Poster

Representation Noising: A Defence Mechanism Against Harmful Finetuning

Domenic Rosati · Jan Wehner · Kai Williams · Lukasz Bartoszcze · Robie Gonzales · carsten maple · Subhabrata Majumdar · Hassan Sajjad · Frank Rudzicz
2024 Poster

Abstract

Video

Chat is not available.