每日Paper进步屋
视频理解 视频理解
2024-01-04 更新Video Understanding with Large Language Models: A SurveyAuthors:Yunlong Tang, Jing Bi, Siting Xu, Luchuan S
2024-01-04
视频理解 视频理解
2023-12-24 更新RTQ: Rethinking Video-language Understanding Based on Image-text ModelAuthors:Xiao Wang, Yaoyu Li, Tian Gan
2023-12-24
视频理解 视频理解
2023-12-13 更新X4D-SceneFormer: Enhanced Scene Understanding on 4D Point Cloud Videos through Cross-modal Knowledge Tran
2023-12-13
视频理解 视频理解
2023-12-06 更新MVBench: A Comprehensive Multi-modal Video Understanding BenchmarkAuthors:Kunchang Li, Yali Wang, Yinan He,
2023-12-06
视频理解 视频理解
2023-12-01 更新MVBench: A Comprehensive Multi-modal Video Understanding BenchmarkAuthors:Kunchang Li, Yali Wang, Yinan He,
2023-12-01
视频理解 视频理解
2023-11-28 更新Mug-STAN: Adapting Image-Language Pretrained Models for General Video UnderstandingAuthors:Ruyang Liu, Ji
2023-11-28
视频理解 视频理解
2023-11-27 更新Vamos: Versatile Action Models for Video UnderstandingAuthors:Shijie Wang, Qi Zhao, Minh Quan Do, Nakul Aga
2023-11-27
视频理解 视频理解
2023-11-19 更新Chat-UniVi: Unified Visual Representation Empowers Large Language Models with Image and Video Understandi
2023-11-19
视频理解 视频理解
2023-11-05 更新MOFO: MOtion FOcused Self-Supervision for Video UnderstandingAuthors:Mona Ahmadian, Frank Guerin, Andrew Gi
2023-11-05
2 / 8