Timezone: »
Most existing 3D point cloud object detection approaches heavily rely on large amounts of labeled training data. However, the labeling process is costly and time-consuming. This paper considers few-shot 3D point cloud object detection, where only a few annotated samples of novel classes are needed with abundant samples of base classes. To this end, we propose Prototypical VoteNet to recognize and localize novel instances, which incorporates two new modules: Prototypical Vote Module (PVM) and Prototypical Head Module (PHM). Specifically, as the 3D basic geometric structures can be shared among categories, PVM is designed to leverage class-agnostic geometric prototypes, which are learned from base classes, to refine local features of novel categories. Then PHM is proposed to utilize class prototypes to enhance the global feature of each object, facilitating subsequent object localization and classification, which is trained by the episodic training strategy. To evaluate the model in this new setting, we contribute two new benchmark datasets, FS-ScanNet and FS-SUNRGBD. We conduct extensive experiments to demonstrate the effectiveness of Prototypical VoteNet, and our proposed method shows significant and consistent improvements compared to baselines on two benchmark datasets.
Author Information
Shizhen Zhao (The University of Hong Kong,)
ShiZhen Zhao is a first year PhD student at the University of Hong Kong. He received his undergraduate degree from Wuhan University of Technology and his master degree from Huazhong University of Science and Technology. His current research interests focus on few-shot object detection.
Xiaojuan Qi (The University of Hong Kong)
More from the Same Authors
-
2022 Poster: Unifying Voxel-based Representation with Transformer for 3D Object Detection »
Yanwei Li · Yilun Chen · Xiaojuan Qi · Zeming Li · Jian Sun · Jiaya Jia -
2022 Poster: Towards Efficient 3D Object Detection with Knowledge Distillation »
Jihan Yang · Shaoshuai Shi · Runyu Ding · Zhe Wang · Xiaojuan Qi -
2023 Poster: Data Pruning via Moving-one-Sample-out »
Haoru Tan · Sitong Wu · Fei Du · Yukang Chen · Zhibin Wang · Fan Wang · Xiaojuan Qi -
2023 Poster: CL-NeRF: Continual Learning of Neural Radiance Fields for Evolving Scene Representation »
Xiuzhe Wu · Peng Dai · Weipeng DENG · Handi Chen · Yang Wu · Yan-Pei Cao · Ying Shan · Xiaojuan Qi -
2023 Poster: CoDet: Co-occurrence Guided Region-Word Alignment for Open-Vocabulary Object Detection »
Chuofan Ma · Yi Jiang · Xin Wen · Zehuan Yuan · Xiaojuan Qi -
2022 Poster: Spatial Pruned Sparse Convolution for Efficient 3D Object Detection »
Jianhui Liu · Yukang Chen · Xiaoqing Ye · Zhuotao Tian · Xiao Tan · Xiaojuan Qi -
2022 Poster: Self-Supervised Visual Representation Learning with Semantic Grouping »
Xin Wen · Bingchen Zhao · Anlin Zheng · Xiangyu Zhang · Xiaojuan Qi -
2022 Poster: Rethinking Resolution in the Context of Efficient Video Recognition »
Chuofan Ma · Qiushan Guo · Yi Jiang · Ping Luo · Zehuan Yuan · Xiaojuan Qi -
2020 Poster: Lightweight Generative Adversarial Networks for Text-Guided Image Manipulation »
Bowen Li · Xiaojuan Qi · Philip Torr · Thomas Lukasiewicz