Skip to yearly menu bar Skip to main content


SmoothQuant: Accurate and Efficient Post-Training Quantization for Large Language Models

Song Han

Abstract

Video

Chat is not available.