Skip to yearly menu bar Skip to main content


LLM-MQ: Mixed-precision Quantization for Efficient LLM Deployment

Shiyao Li · Xuefei Ning · Ke Hong · Tengxuan Liu · Luning Wang · Xiuhong Li · Kai Zhong · Guohao Dai · Huazhong Yang · Yu Wang

Abstract

Chat is not available.