Skip to yearly menu bar Skip to main content


Poster
in
Workshop: Safe Generative AI

Steering Without Side Effects: Improving Post-Deployment Control of Language Models

Asa Cooper Stickland · Aleksandr Lyzhov · Jacob Pfau · Salsabila Mahdi · Samuel Bowman

Abstract

Chat is not available.