Skip to yearly menu bar Skip to main content


Poster
in
Workshop: Safe Generative AI

Hidden in Plain Text: Emergence & Mitigation of Steganographic Collusion in LLMs

Yohan Mathew ⋅ Ollie Matthews ⋅ Robert McCarthy ⋅ Joan Velja ⋅ Christian Schroeder de Witt ⋅ Dylan Cope ⋅ Nandi Schoots

Abstract

Chat is not available.