Skip to yearly menu bar Skip to main content


Post Training Quantization of Large Language Models with Microscaling Formats

Sayeh Sharify · Utkarsh Saxena · Zifei Xu · Wanzin Yazar · Ilya Soloveychik · Xin Wang

Abstract

Video

Chat is not available.