Recognition of Indoor Scenes using 3D Scene Graphs

Han Yue, Ville Lehtola, Hangbin Wu*, George Vosselman, Jincheng Li, Chun Liu

*Corresponding author for this work

Research output: Contribution to journalArticleAcademicpeer-review

29 Downloads (Pure)

Abstract

Scene recognition is a fundamental task in 3-D scene understanding. It answers the question, 'What is this place?' In an indoor environment, the answer can be an office, kitchen, lobby, and so on. As the number of point clouds increases, using embedded point information in scene recognition becomes computationally heavy to process. To achieve computational efficiency and accurate classification, our idea is to use an indoor scene graph that represents the 3-D spatial structures via object instances. The proposed method comprises two parts, namely: 1) construction of indoor scene graphs leveraging object instances and their spatial relationships and 2) classification of these graphs using a deep learning network. Specifically, each indoor scene is represented by a graph, where each node represents either a structural element (like a ceiling, a wall, or a floor) or a piece of furniture (like a chair or a table), and each edge encodes the spatial relationship between these elements. Then, these graphs are used as input for our proposed graph classification network to learn different scene representations. The public indoor dataset, ScanNet v2, with 625.53 million points, is selected to test our method. Experiments yield good results with up to 88.00% accuracy and 82.30% F1 score in the fixed validation dataset and 90.46% accuracy and 81.45% F1 score in the ten-fold cross-validation method; moreover, if some indoor objects cannot be successfully identified, the scene classification accuracy depends sublinearly on the rate of missing objects in the scene.

Original languageEnglish
Article number5703216
Pages (from-to)1-16
Number of pages16
JournalIEEE transactions on geoscience and remote sensing
Volume62
Early online date11 Apr 2024
DOIs
Publication statusPublished - 2024

Keywords

  • Convolution
  • Deep learning
  • Feature extraction
  • Indoor
  • Instance segmentation
  • Point cloud compression
  • Semantics
  • Three-dimensional displays
  • graph classification
  • point clouds
  • scene graphs
  • scene recognition
  • ITC-ISI-JOURNAL-ARTICLE
  • 2024 OA procedure

Fingerprint

Dive into the research topics of 'Recognition of Indoor Scenes using 3D Scene Graphs'. Together they form a unique fingerprint.

Cite this