Skip to yearly menu bar Skip to main content


FineQuant: Unlocking Efficiency with Fine-Grained Weight-Only Quantization for LLMs

Young Jin Kim ⋅ Rawn Henry ⋅ Raffy Fahim ⋅ Hany Awadalla
[ Poster

Abstract

Chat is not available.