Vision Transformer

发布日期: 2022-07-30

2022-07-30 更新

Affective Behaviour Analysis Using Pretrained Model with Facial Priori

Authors:Yifan Li, Haomiao Sun, Zhaori Liu, Hu Han

Affective behaviour analysis has aroused researchers’ attention due to its broad applications. However, it is labor exhaustive to obtain accurate annotations for massive face images. Thus, we propose to utilize the prior facial information via Masked Auto-Encoder (MAE) pretrained on unlabeled face images. Furthermore, we combine MAE pretrained Vision Transformer (ViT) and AffectNet pretrained CNN to perform multi-task emotion recognition. We notice that expression and action unit (AU) scores are pure and intact features for valence-arousal (VA) regression. As a result, we utilize AffectNet pretrained CNN to extract expression scores concatenating with expression and AU scores from ViT to obtain the final VA features. Moreover, we also propose a co-training framework with two parallel MAE pretrained ViT for expression recognition tasks. In order to make the two views independent, we random mask most patches during the training process. Then, JS divergence is performed to make the predictions of the two views as consistent as possible. The results on ABAW4 show that our methods are effective.
PDF

点此查看论文截图

木子已

https://ipaper.today/2022/07/30/2022-07-30-vision-transformer/

本博客所有文章除特別声明外，均采用 CC BY 4.0 许可协议。转载请注明来源木子已 !

Vision Transformer

检测/分割/跟踪

2022-07-30 检测/分割/跟踪

检测分割跟踪

NeRF

2022-07-30 NeRF

NeRF

Vision Transformer

2022-07-30 更新

Affective Behaviour Analysis Using Pretrained Model with Facial Priori

打赏用于支持本站流量费