Skip to yearly menu bar Skip to main content


Universal Properties of Activation Sparsity in Modern Large Language Models

Filip Szatkowski · Patryk Będkowski · Alessio Devoto · Jan Dubiński · Pasquale Minervini · Mikołaj Piórczyński · Simone Scardapane · Bartosz Wójcik

Abstract

Chat is not available.