每日Paper进步屋
视频理解 视频理解
2024-08-16 更新Towards Event-oriented Long Video UnderstandingAuthors:Yifan Du, Kun Zhou, Yuqi Huo, Yifan Li, Wayne Xin Zh
2024-08-16
视频理解 视频理解
2024-05-14 更新MA-LMM: Memory-Augmented Large Multimodal Model for Long-Term Video UnderstandingAuthors:Bo He, Hengduo L
2024-05-14
视频理解 视频理解
2024-04-14 更新LongVLM: Efficient Long Video Understanding via Large Language ModelsAuthors:Yuetian Weng, Mingfei Han, Hao
2024-04-14
视频理解 视频理解
2024-04-06 更新LongVLM: Efficient Long Video Understanding via Large Language ModelsAuthors:Yuetian Weng, Mingfei Han, Hao
2024-04-06
视频理解 视频理解
2024-04-03 更新Instrument-tissue Interaction Detection Framework for Surgical Video UnderstandingAuthors:Wenjun Lin, Yan
2024-04-03
视频理解 视频理解
2024-04-01 更新A Unified Framework for Human-centric Point Cloud Video UnderstandingAuthors:Yiteng Xu, Kecheng Ye, Xiao Ha
2024-04-01
视频理解 视频理解
2024-03-31 更新VideoPrism: A Foundational Visual Encoder for Video UnderstandingAuthors:Long Zhao, Nitesh B. Gundavarapu,
2024-03-31
视频理解 视频理解
2024-01-18 更新CrossVideo: Self-supervised Cross-modal Contrastive Learning for Point Cloud Video UnderstandingAuthors:Y
2024-01-18
视频理解 视频理解
2024-01-05 更新Video Understanding with Large Language Models: A SurveyAuthors:Yunlong Tang, Jing Bi, Siting Xu, Luchuan S
2024-01-05
1 / 8