firstbacksecondback
30 Results
Workshop
|
Quantization-aware Policy Distillation (QPD) Thomas Avé · Kevin Mets · Tom De Schepper · Steven Latre |
||
Poster
|
Tue 14:00 |
FP8 Quantization: The Power of the Exponent Andrey Kuzmin · Mart van Baalen · Yuwei Ren · Markus Nagel · Jorn Peters · Tijmen Blankevoort |
|
Poster
|
Thu 14:00 |
GPT3.int8(): 8-bit Matrix Multiplication for Transformers at Scale Tim Dettmers · Mike Lewis · Younes Belkada · Luke Zettlemoyer |
|
Poster
|
MoVQ: Modulating Quantized Vectors for High-Fidelity Image Generation Chuanxia Zheng · Tung-Long Vuong · Jianfei Cai · Dinh Phung |
||
Poster
|
Theoretically Better and Numerically Faster Distributed Optimization with Smoothness-Aware Quantization Techniques Bokun Wang · Mher Safaryan · Peter Richtarik |
||
Poster
|
Thu 14:00 |
Redistribution of Weights and Activations for AdderNet Quantization Ying Nie · Kai Han · Haikang Diao · Chuanjian Liu · Enhua Wu · Yunhe Wang |
|
Poster
|
Thu 9:00 |
Optimal Brain Compression: A Framework for Accurate Post-Training Quantization and Pruning Elias Frantar · Dan Alistarh |
|
Poster
|
Tue 9:00 |
Fine-tuning Language Models over Slow Networks using Activation Quantization with Guarantees Jue WANG · Binhang Yuan · Luka Rimanic · Yongjun He · Tri Dao · Beidi Chen · Christopher Ré · Ce Zhang |
|
Poster
|
Tue 9:00 |
Distributed Optimization for Overparameterized Problems: Achieving Optimal Dimension Independent Communication Complexity Bingqing Song · Ioannis Tsaknakis · Chung-Yiu Yau · Hoi-To Wai · Mingyi Hong |
|
Poster
|
Tue 14:00 |
ClimbQ: Class Imbalanced Quantization Enabling Robustness on Efficient Inferences Ting-An Chen · De-Nian Yang · Ming-syan Chen |
|
Poster
|
Leveraging Inter-Layer Dependency for Post -Training Quantization changbao wang · DanDan Zheng · Yuanliu Liu · Liang Li |
||
Poster
|
Tue 9:00 |
XTC: Extreme Compression for Pre-trained Transformers Made Simple and Efficient Xiaoxia Wu · Zhewei Yao · Minjia Zhang · Conglong Li · Yuxiong He |