Skip to yearly menu bar Skip to main content


Poster
in
Workshop: ML with New Compute Paradigms

MoQ: Mixture-of-format Activation Quantization for Communication-efficient AI Inference System

Haonan Wang · Zeli Liu · Chao Fang · John Walters · Stephen Crago
2024 Poster
in
Workshop: ML with New Compute Paradigms

Abstract

Chat is not available.