发布日期: 2022-05-30

2022-05-30 更新

Contrastive Learning Rivals Masked Image Modeling in Fine-tuning via Feature Distillation

Authors:Yixuan Wei, Han Hu, Zhenda Xie, Zheng Zhang, Yue Cao, Jianmin Bao, Dong Chen, Baining Guo

Masked image modeling (MIM) learns representations with remarkably good fine-tuning performances, overshadowing previous prevalent pre-training approaches such as image classification, instance contrastive learning, and image-text alignment. In this paper, we show that the inferior fine-tuning performance of these pre-training approaches can be significantly improved by a simple post-processing in the form of feature distillation (FD). The feature distillation converts the old representations to new representations that have a few desirable properties just like those representations produced by MIM. These properties, which we aggregately refer to as optimization friendliness, are identified and analyzed by a set of attention- and optimization-related diagnosis tools. With these properties, the new representations show strong fine-tuning performance. Specifically, the contrastive self-supervised learning methods are made as competitive in fine-tuning as the state-of-the-art masked image modeling (MIM) algorithms. The CLIP models’ fine-tuning performance is also significantly improved, with a CLIP ViT-L model reaching 89.0% top-1 accuracy on ImageNet-1K classification. More importantly, our work provides a way for the future research to focus more effort on the generality and scalability of the learnt representations without being pre-occupied with optimization friendliness since it can be enhanced rather easily. The code will be available at https://github.com/SwinTransformer/Feature-Distillation.
PDF

论文截图

木子已

https://ipaper.today/2022/05/30/2022-05-30-wu-jian-du-ban-jian-du-dui-bi-xue-xi/

本博客所有文章除特別声明外，均采用 CC BY 4.0 许可协议。转载请注明来源木子已 !

无监督半监督对比学习

I2I Translation

2022-05-31 I2I Translation

I2I Translation

人脸相关

2022-05-30 人脸相关

人脸相关

无监督/半监督/对比学习

2022-05-30 更新

Contrastive Learning Rivals Masked Image Modeling in Fine-tuning via Feature Distillation

打赏用于支持本站流量费