Abstract
Scene recognition is a fundamental task in 3-D scene understanding. It answers the question, 'What is this place?' In an indoor environment, the answer can be an office, kitchen, lobby, and so on. As the number of point clouds increases, using embedded point information in scene recognition becomes computationally heavy to process. To achieve computational efficiency and accurate classification, our idea is to use an indoor scene graph that represents the 3-D spatial structures via object instances. The proposed method comprises two parts, namely: 1) construction of indoor scene graphs leveraging object instances and their spatial relationships and 2) classification of these graphs using a deep learning network. Specifically, each indoor scene is represented by a graph, where each node represents either a structural element (like a ceiling, a wall, or a floor) or a piece of furniture (like a chair or a table), and each edge encodes the spatial relationship between these elements. Then, these graphs are used as input for our proposed graph classification network to learn different scene representations. The public indoor dataset, ScanNet v2, with 625.53 million points, is selected to test our method. Experiments yield good results with up to 88.00% accuracy and 82.30% F1 score in the fixed validation dataset and 90.46% accuracy and 81.45% F1 score in the ten-fold cross-validation method; moreover, if some indoor objects cannot be successfully identified, the scene classification accuracy depends sublinearly on the rate of missing objects in the scene.
Original language | English |
---|---|
Article number | 5703216 |
Pages (from-to) | 1-16 |
Number of pages | 16 |
Journal | IEEE transactions on geoscience and remote sensing |
Volume | 62 |
Early online date | 11 Apr 2024 |
DOIs | |
Publication status | Published - 2024 |
Keywords
- Convolution
- Deep learning
- Feature extraction
- Indoor
- Instance segmentation
- Point cloud compression
- Semantics
- Three-dimensional displays
- graph classification
- point clouds
- scene graphs
- scene recognition
- ITC-ISI-JOURNAL-ARTICLE
- 2024 OA procedure