Skip to yearly menu bar Skip to main content


RelP: Faithful and Efficient Circuit Discovery in Language Models via Relevance Patching

Farnoush Rezaei Jafari · Oliver Eberle · Ashkan Khakzar · Neel Nanda

Abstract

Chat is not available.