Skip to yearly menu bar Skip to main content


FineQuant: Unlocking Efficiency with Fine-Grained Weight-Only Quantization for LLMs

Young Jin Kim · Rawn Henry · Raffy Fahim · Hany Awadalla
[ Poster

Abstract

Chat is not available.