2022-03-22 更新An Empirical Study of Training End-to-End Vision-and-Language TransformersAuthors:Zi-Yi Dou, Yichong Xu,
2022-03-22