FPNN: Field Probing Neural Networks for 3D Data

Abstract

Building discriminative representations for 3D data has been an important task in computer graphics and computer vision research. Convolutional Neural Networks (CNNs) have shown to operate on 2D images with great success for a variety of tasks. Lifting convolution operators to 3D (3DCNNs) seems like a plausible and promising next step. Unfortunately, the computational complexity of 3D CNNs grows cubically with respect to voxel resolution. Moreover, since most 3D geometry representations are boundary based, occupied regions do not increase proportionately with the size of the discretization, resulting in wasted computation. In this work, we represent 3D spaces as volumetric fields, and propose a novel design that employs field probing filters to efficiently extract features from them. Each field probing filter is a set of probing points -- sensors that perceive the space. Our learning algorithm optimizes not only the weights associated with the probing points, but also their locations, which deforms the shape of the probing filters and adaptively distributes them in 3D space. The optimized probing points sense the 3D space intelligently, rather than operating blindly over the entire domain. We show that field probing is significantly more efficient than 3DCNNs, while providing state-of-the-art performance, on classification tasks for 3D object recognition benchmark datasets.

Keywords

Computer scienceConvolutional neural networkArtificial intelligenceBenchmark (surveying)Discriminative modelField (mathematics)Convolution (computer science)Computer graphicsPattern recognition (psychology)Computer visionVoxelComputationFilter (signal processing)Set (abstract data type)Artificial neural networkAlgorithmMathematics

Affiliated Institutions

Related Publications

Pyramid Vision Transformer: A Versatile Backbone for Dense Prediction without Convolutions

Wenhai Wang , Enze Xie , Xiang Li +6 more

Although convolutional neural networks (CNNs) have achieved great success in computer vision, this work investigates a simpler, convolution-free backbone network use-fid for man...

2021 2021 IEEE/CVF International Conferenc... 4221 citations

VoxelNet: End-to-End Learning for Point Cloud Based 3D Object Detection

Yin Zhou , Oncel Tuzel

Accurate detection of objects in 3D point clouds is a central problem in many applications, such as autonomous navigation, housekeeping robots, and augmented/virtual reality. To...

2018 4245 citations

Encoder-Decoder with Atrous Separable Convolution for Semantic Image Segmentation

Liang-Chieh Chen , Yukun Zhu , George Papandreou +2 more

2018 Lecture notes in computer science 13300 citations

A Survey of Convolutional Neural Networks: Analysis, Applications, and Prospects

Zewen Li , Fan Liu , Wenjie Yang +2 more

A convolutional neural network (CNN) is one of the most significant networks in the deep learning field. Since CNN made impressive achievements in many areas, including but not ...

2021 IEEE Transactions on Neural Networks ... 4083 citations

Understanding deep image representations by inverting them

Aravindh Mahendran , Andrea Vedaldi

Image representations, from SIFT and Bag of Visual Words to Convolutional Neural Networks (CNNs), are a crucial component of almost any image understanding system. Nevertheless,...

2015 1831 citations

Publication Info

Year: 2016
Type: article
Volume: 29
Pages: 307-315
Citations: 121
Access: Closed

External Links

Citation Metrics

121

OpenAlex

Cite This

APA Style

                            
                                    Yangyan Li, 
                                
                                    Sören Pirk, 
                                
                                    Hao Su
                                
                                et al.
                            
                            (2016). 
                            FPNN: Field Probing Neural Networks for 3D Data. 
                            arXiv (Cornell University)
                            , 29
                            
                            , 307-315.