视频理解

发布日期: 2022-12-21

2022-12-21 更新

Point Primitive Transformer for Long-Term 4D Point Cloud Video Understanding

Authors:Hao Wen, Yunze Liu, Jingwei Huang, Bo Duan, Li Yi

This paper proposes a 4D backbone for long-term point cloud video understanding. A typical way to capture spatial-temporal context is using 4Dconv or transformer without hierarchy. However, those methods are neither effective nor efficient enough due to camera motion, scene changes, sampling patterns, and the complexity of 4D data. To address those issues, we leverage the primitive plane as a mid-level representation to capture the long-term spatial-temporal context in 4D point cloud videos and propose a novel hierarchical backbone named Point Primitive Transformer(PPTr), which is mainly composed of intra-primitive point transformers and primitive transformers. Extensive experiments show that PPTr outperforms the previous state of the arts on different tasks.
PDF

点此查看论文截图

木子已

https://ipaper.today/2022/12/21/2022-12-21-shi-pin-li-jie/

本博客所有文章除特別声明外，均采用 CC BY 4.0 许可协议。转载请注明来源木子已 !

视频理解

Vision Transformer

2022-12-21 Vision Transformer

Vision Transformer

I2I Translation

2022-12-21 I2I Translation

I2I Translation

视频理解

2022-12-21 更新

Point Primitive Transformer for Long-Term 4D Point Cloud Video Understanding

打赏用于支持本站流量费