Timezone: »
3D scenes are dominated by a large number of background points, which is redundant for the detection task that mainly needs to focus on foreground objects. In this paper, we analyze major components of existing sparse 3D CNNs and find that 3D CNNs ignores the redundancy of data and further amplifies it in the down-sampling process, which brings a huge amount of extra and unnecessary computational overhead. Inspired by this, we propose a new convolution operator named spatial pruned sparse convolution (SPS-Conv), which includes two variants, spatial pruned submanifold sparse convolution (SPSS-Conv) and spatial pruned regular sparse convolution (SPRS-Conv), both of which are based on the idea of dynamically determine crucial areas for performing computations to reduce redundancy. We empirically find that magnitude of features can serve as an important cues to determine crucial areas which get rid of the heavy computations of learning-based methods. The proposed modules can easily be incorporated into existing sparse 3D CNNs without extra architectural modifications. Extensive experiments on the KITTI and nuScenes datasets demonstrate that our method can achieve more than 50% reduction in GFLOPs without compromising the performance.
Author Information
Jianhui Liu (The University of Hong Kong)
Yukang Chen (The Chinese University of Hong Kong)
Xiaoqing Ye (Baidu)
Zhuotao Tian (The Chinese University of Hong Kong)
Xiao Tan (Baidu Inc.)
Xiaojuan Qi (The University of Hong Kong)
More from the Same Authors
-
2022 Poster: Unifying Voxel-based Representation with Transformer for 3D Object Detection »
Yanwei Li · Yilun Chen · Xiaojuan Qi · Zeming Li · Jian Sun · Jiaya Jia -
2022 Poster: Towards Efficient 3D Object Detection with Knowledge Distillation »
Jihan Yang · Shaoshuai Shi · Runyu Ding · Zhe Wang · Xiaojuan Qi -
2023 Poster: Data Pruning via Moving-one-Sample-out »
Haoru Tan · Sitong Wu · Fei Du · Yukang Chen · Zhibin Wang · Fan Wang · Xiaojuan Qi -
2023 Poster: CL-NeRF: Continual Learning of Neural Radiance Fields for Evolving Scene Representation »
Xiuzhe Wu · Peng Dai · Weipeng DENG · Handi Chen · Yang Wu · Yan-Pei Cao · Ying Shan · Xiaojuan Qi -
2023 Poster: Query-based Temporal Fusion with Explicit Motion for 3D Object Detection »
Jinghua Hou · Zhe Liu · dingkang liang · Zhikang Zou · Xiaoqing Ye · Xiang Bai -
2023 Poster: CoDet: Co-occurrence Guided Region-Word Alignment for Open-Vocabulary Object Detection »
Chuofan Ma · Yi Jiang · Xin Wen · Zehuan Yuan · Xiaojuan Qi -
2022 Poster: Prototypical VoteNet for Few-Shot 3D Point Cloud Object Detection »
Shizhen Zhao · Xiaojuan Qi -
2022 Poster: Self-Supervised Visual Representation Learning with Semantic Grouping »
Xin Wen · Bingchen Zhao · Anlin Zheng · Xiangyu Zhang · Xiaojuan Qi -
2022 Poster: Rethinking Resolution in the Context of Efficient Video Recognition »
Chuofan Ma · Qiushan Guo · Yi Jiang · Ping Luo · Zehuan Yuan · Xiaojuan Qi -
2020 Poster: Lightweight Generative Adversarial Networks for Text-Guided Image Manipulation »
Bowen Li · Xiaojuan Qi · Philip Torr · Thomas Lukasiewicz -
2020 Poster: Discriminative Sounding Objects Localization via Self-supervised Audiovisual Matching »
Di Hu · Rui Qian · Minyue Jiang · Xiao Tan · Shilei Wen · Errui Ding · Weiyao Lin · Dejing Dou