每日Paper进步屋
Vision Transformer Vision Transformer
2022-10-10 更新Pix2Struct: Screenshot Parsing as Pretraining for Visual Language UnderstandingAuthors:Kenton Lee, Mandar
Vision Transformer Vision Transformer
2022-10-09 更新MaPLe: Multi-modal Prompt LearningAuthors:Muhammad Uzair Khattak, Hanoona Rasheed, Muhammad Maaz, Salman Kh
Vision Transformer Vision Transformer
2022-10-06 更新SemMAE: Semantic-Guided Masking for Learning Masked AutoencodersAuthors:Gang Li, Heliang Zheng, Daqing Liu,
Vision Transformer Vision Transformer
2022-10-05 更新Architecture-Agnostic Masked Image Modeling — From ViT back to CNNAuthors:Siyuan Li, Di Wu, Fang Wu, Zelin
Vision Transformer Vision Transformer
2022-10-04 更新A Strong Transfer Baseline for RGB-D Fusion in Vision TransformersAuthors:Georgios Tziafas, Hamidreza Kasae
Vision Transformer Vision Transformer
2022-10-03 更新3D UX-Net: A Large Kernel Volumetric ConvNet Modernizing Hierarchical Transformer for Medical Image Segme
Vision Transformer Vision Transformer
2022-09-30 更新Adaptive Sparse ViT: Towards Learnable Adaptive Token Pruning by Fully Exploiting Self-AttentionAuthors:X
Vision Transformer Vision Transformer
2022-09-29 更新MTU-Net: Multi-level TransUNet for Space-based Infrared Tiny Ship DetectionAuthors:Tianhao Wu, Boyang Li,
Vision Transformer Vision Transformer
2022-09-27 更新Multimodal Learning with Channel-Mixing and Masked Autoencoder on Facial Action Unit DetectionAuthors:Xia
16 / 28