Skip to yearly menu bar Skip to main content


Poster
in
Workshop: Safe Generative AI

Hidden in Plain Text: Emergence & Mitigation of Steganographic Collusion in LLMs

Yohan Mathew · Ollie Matthews · Robert McCarthy · Joan Velja · Christian Schroeder de Witt · Dylan Cope · Nandi Schoots

Abstract

Chat is not available.