Skip to yearly menu bar Skip to main content


Search All 2022 Events
 

30 Results

<<   <   Page 1 of 3   >   >>
Workshop
Quantization-aware Policy Distillation (QPD)
Thomas Avé · Kevin Mets · Tom De Schepper · Steven Latre
Poster
Tue 14:00 FP8 Quantization: The Power of the Exponent
Andrey Kuzmin · Mart van Baalen · Yuwei Ren · Markus Nagel · Jorn Peters · Tijmen Blankevoort
Poster
Thu 14:00 GPT3.int8(): 8-bit Matrix Multiplication for Transformers at Scale
Tim Dettmers · Mike Lewis · Younes Belkada · Luke Zettlemoyer
Poster
MoVQ: Modulating Quantized Vectors for High-Fidelity Image Generation
Chuanxia Zheng · Tung-Long Vuong · Jianfei Cai · Dinh Phung
Poster
Theoretically Better and Numerically Faster Distributed Optimization with Smoothness-Aware Quantization Techniques
Bokun Wang · Mher Safaryan · Peter Richtarik
Poster
Thu 14:00 Redistribution of Weights and Activations for AdderNet Quantization
Ying Nie · Kai Han · Haikang Diao · Chuanjian Liu · Enhua Wu · Yunhe Wang
Poster
Thu 9:00 Optimal Brain Compression: A Framework for Accurate Post-Training Quantization and Pruning
Elias Frantar · Dan Alistarh
Poster
Tue 9:00 Fine-tuning Language Models over Slow Networks using Activation Quantization with Guarantees
Jue WANG · Binhang Yuan · Luka Rimanic · Yongjun He · Tri Dao · Beidi Chen · Christopher Ré · Ce Zhang
Poster
Tue 9:00 Distributed Optimization for Overparameterized Problems: Achieving Optimal Dimension Independent Communication Complexity
Bingqing Song · Ioannis Tsaknakis · Chung-Yiu Yau · Hoi-To Wai · Mingyi Hong
Poster
Tue 14:00 ClimbQ: Class Imbalanced Quantization Enabling Robustness on Efficient Inferences
Ting-An Chen · De-Nian Yang · Ming-syan Chen
Poster
Leveraging Inter-Layer Dependency for Post -Training Quantization
changbao wang · DanDan Zheng · Yuanliu Liu · Liang Li
Poster
Tue 9:00 XTC: Extreme Compression for Pre-trained Transformers Made Simple and Efficient
Xiaoxia Wu · Zhewei Yao · Minjia Zhang · Conglong Li · Yuxiong He