分类: 视频理解

进步屋

更新每天计算机视觉领域最新论文

每日Paper进步屋

文章分类

Anti-Spoofing 7 Domain Adaptation 15 Few-Shot 18 I2I Translation 11 Vision Transformer 13 对抗攻击 9 LLM 31 强化学习 17 视频理解 10 检测/分割/跟踪 22 Face Swapping 6 Diffusion Models 29 NeRF 4 GAN 15 Speech 16 人脸相关 8 医学影像/息肉检测分割 4 无监督/半监督/对比学习 15 Open-Set 9 元宇宙/虚拟人 6 场景文本检测识别 9 MMT 2 Human reconstruction 6 NeRF/3DGS 23 医学影像/Breast Ultrasound 2 图像生成 25 点云相关 11 视频生成 19

                            
                            视频理解
                        
                                2024-08-16 更新Towards Event-oriented Long Video UnderstandingAuthors:Yifan Du, Kun Zhou, Yuqi Huo, Yifan Li, Wayne Xin Zh
                            
                                2024-08-16
                            
                                    视频理解
                                
                            视频理解
                        
                            视频理解
                        
                                2024-05-14 更新MA-LMM: Memory-Augmented Large Multimodal Model for Long-Term Video   UnderstandingAuthors:Bo He, Hengduo L
                            
                                2024-05-14
                            
                                    视频理解
                                
                            视频理解
                        
                            视频理解
                        
                                2024-04-14 更新LongVLM: Efficient Long Video Understanding via Large Language ModelsAuthors:Yuetian Weng, Mingfei Han, Hao
                            
                                2024-04-14
                            
                                    视频理解
                                
                            视频理解
                        
                            视频理解
                        
                                2024-04-06 更新LongVLM: Efficient Long Video Understanding via Large Language ModelsAuthors:Yuetian Weng, Mingfei Han, Hao
                            
                                2024-04-06
                            
                                    视频理解
                                
                            视频理解
                        
                            视频理解
                        
                                2024-04-03 更新Instrument-tissue Interaction Detection Framework for Surgical Video   UnderstandingAuthors:Wenjun Lin, Yan
                            
                                2024-04-03
                            
                                    视频理解
                                
                            视频理解
                        
                            视频理解
                        
                                2024-04-01 更新A Unified Framework for Human-centric Point Cloud Video UnderstandingAuthors:Yiteng Xu, Kecheng Ye, Xiao Ha
                            
                                2024-04-01
                            
                                    视频理解
                                
                            视频理解
                        
                            视频理解
                        
                                2024-03-31 更新VideoPrism: A Foundational Visual Encoder for Video UnderstandingAuthors:Long Zhao, Nitesh B. Gundavarapu, 
                            
                                2024-03-31
                            
                                    视频理解
                                
                            视频理解
                        
                            视频理解
                        
                                2024-01-18 更新CrossVideo: Self-supervised Cross-modal Contrastive Learning for Point   Cloud Video UnderstandingAuthors:Y
                            
                                2024-01-18
                            
                                    视频理解
                                
                            视频理解
                        
                            视频理解
                        
                                2024-01-05 更新Video Understanding with Large Language Models: A SurveyAuthors:Yunlong Tang, Jing Bi, Siting Xu, Luchuan S
                            
                                2024-01-05
                            
                                    视频理解
                                
                            视频理解

            
1 / 2