每日Paper进步屋
视频理解 视频理解
2023-12-13 更新X4D-SceneFormer: Enhanced Scene Understanding on 4D Point Cloud Videos through Cross-modal Knowledge Tran
2023-12-13
视频理解 视频理解
2023-12-06 更新MVBench: A Comprehensive Multi-modal Video Understanding BenchmarkAuthors:Kunchang Li, Yali Wang, Yinan He,
2023-12-06
视频理解 视频理解
2023-12-01 更新MVBench: A Comprehensive Multi-modal Video Understanding BenchmarkAuthors:Kunchang Li, Yali Wang, Yinan He,
2023-12-01
视频理解 视频理解
2023-11-28 更新Mug-STAN: Adapting Image-Language Pretrained Models for General Video UnderstandingAuthors:Ruyang Liu, Ji
2023-11-28
视频理解 视频理解
2023-11-27 更新Vamos: Versatile Action Models for Video UnderstandingAuthors:Shijie Wang, Qi Zhao, Minh Quan Do, Nakul Aga
2023-11-27
视频理解 视频理解
2023-11-19 更新Chat-UniVi: Unified Visual Representation Empowers Large Language Models with Image and Video Understandi
2023-11-19
视频理解 视频理解
2023-11-05 更新MOFO: MOtion FOcused Self-Supervision for Video UnderstandingAuthors:Mona Ahmadian, Frank Guerin, Andrew Gi
2023-11-05
视频理解 视频理解
2023-09-28 更新Video Timeline Modeling For News Story UnderstandingAuthors:Meng Liu, Mingda Zhang, Jialu Liu, Hanjun Dai,
2023-09-28
视频理解 视频理解
2023-09-23 更新Revisiting Kernel Temporal Segmentation as an Adaptive Tokenizer for Long-form Video UnderstandingAuthors
2023-09-23
2 / 8