每日Paper进步屋
视频理解 视频理解
2024-04-06 更新LongVLM: Efficient Long Video Understanding via Large Language ModelsAuthors:Yuetian Weng, Mingfei Han, Hao
2024-04-06
视频理解 视频理解
2024-04-03 更新Instrument-tissue Interaction Detection Framework for Surgical Video UnderstandingAuthors:Wenjun Lin, Yan
2024-04-03
视频理解 视频理解
2024-04-01 更新A Unified Framework for Human-centric Point Cloud Video UnderstandingAuthors:Yiteng Xu, Kecheng Ye, Xiao Ha
2024-04-01
视频理解 视频理解
2024-03-31 更新VideoPrism: A Foundational Visual Encoder for Video UnderstandingAuthors:Long Zhao, Nitesh B. Gundavarapu,
2024-03-31
视频理解 视频理解
2024-01-18 更新CrossVideo: Self-supervised Cross-modal Contrastive Learning for Point Cloud Video UnderstandingAuthors:Y
2024-01-18
视频理解 视频理解
2024-01-05 更新Video Understanding with Large Language Models: A SurveyAuthors:Yunlong Tang, Jing Bi, Siting Xu, Luchuan S
2024-01-05
视频理解 视频理解
2024-01-04 更新Video Understanding with Large Language Models: A SurveyAuthors:Yunlong Tang, Jing Bi, Siting Xu, Luchuan S
2024-01-04
视频理解 视频理解
2023-12-24 更新RTQ: Rethinking Video-language Understanding Based on Image-text ModelAuthors:Xiao Wang, Yaoyu Li, Tian Gan
2023-12-24
视频理解 视频理解
2023-12-13 更新X4D-SceneFormer: Enhanced Scene Understanding on 4D Point Cloud Videos through Cross-modal Knowledge Tran
2023-12-13
1 / 8