发布日期: 2022-11-19

2022-11-19 更新

CPT-V: A Contrastive Approach to Post-Training Quantization of Vision Transformers

Authors:Natalia Frumkin, Dibakar Gope, Diana Marculescu

When considering post-training quantization, prior work has typically focused on developing a mixed precision scheme or learning the best way to partition a network for quantization. In our work, CPT-V, we look at a general way to improve the accuracy of networks that have already been quantized, simply by perturbing the quantization scales. Borrowing the idea of contrastive loss from self-supervised learning, we find a robust way to jointly minimize a loss function using just 1,000 calibration images. In order to determine the best performing quantization scale, CPT-V contrasts the features of quantized and full precision models in a self-supervised fashion. Unlike traditional reconstruction-based loss functions, the use of a contrastive loss function not only rewards similarity between the quantized and full precision outputs but also helps in distinguishing the quantized output from other outputs within a given batch. In addition, in contrast to prior works, CPT-V proposes a block-wise evolutionary search to minimize a global contrastive loss objective, allowing for accuracy improvement of existing vision transformer (ViT) quantization schemes. For example, CPT-V improves the top-1 accuracy of a fully quantized ViT-Base by 10.30%, 0.78%, and 0.15% for 3-bit, 4-bit, and 8-bit weight quantization levels. Extensive experiments on a variety of other ViT architectures further demonstrate its robustness in extreme quantization scenarios. Our code is available at .
PDF

点此查看论文截图

木子已

https://ipaper.today/2022/11/19/2022-11-19-wu-jian-du-ban-jian-du-dui-bi-xue-xi/

本博客所有文章除特別声明外，均采用 CC BY 4.0 许可协议。转载请注明来源木子已 !

无监督半监督对比学习

Domain Adaptation

2022-11-21 Domain Adaptation

Domain Adaptation

人脸相关

2022-11-19 人脸相关

人脸相关

无监督/半监督/对比学习

2022-11-19 更新

CPT-V: A Contrastive Approach to Post-Training Quantization of Vision Transformers

打赏用于支持本站流量费