Skip to yearly menu bar Skip to main content


Poster
in
Workshop: ML with New Compute Paradigms

MoQ: Mixture-of-format Activation Quantization for Communication-efficient AI Inference System

Haonan Wang ⋅ Zeli Liu ⋅ Chao Fang ⋅ John Walters ⋅ Stephen Crago
2024 Poster
in
Workshop: ML with New Compute Paradigms

Abstract

Chat is not available.