Skip to yearly menu bar Skip to main content


LLM-MQ: Mixed-precision Quantization for Efficient LLM Deployment

Shiyao Li ⋅ Xuefei Ning ⋅ Ke Hong ⋅ Tengxuan Liu ⋅ Luning Wang ⋅ Xiuhong Li ⋅ Kai Zhong ⋅ Guohao Dai ⋅ Huazhong Yang ⋅ Yu Wang

Abstract

Chat is not available.