Skip to yearly menu bar Skip to main content


Poster

Large language models can learn and generalize steganographic chain-of-thought under process supervision

ROBERT MC CARTHY ⋅ Joey SKAF ⋅ Luis Ibanez-Lissen ⋅ Vasil Georgiev ⋅ Connor Watts ⋅ Hannes Whittingham ⋅ Lorena Gonzalez-Manzano ⋅ Cameron Tice ⋅ Edward Young ⋅ Puria Radmard ⋅ David Lindner
2025 Poster

Abstract

Video

Chat is not available.