Abstract
The ability of robots to autonomously navigate through 3D environments depends on their comprehension of spatial concepts, ranging from low-level geometry to high-level semantics, such as objects, places, and buildings. To enable such comprehension, 3D scene graphs have emerged as a robust tool for representing the environment as a layered graph of concepts and their relationships. However, building these representations using monocular vision systems in real-time remains a difficult task that has not been explored in depth.
This paper puts forth a real-time spatial perception system Mono-Hydra, combining a monocular camera and an IMU sensor setup, focusing on indoor scenarios. However, the proposed approach is adaptable to outdoor applications, offering flexibility in its potential uses. The system employs a suite of deep learning algorithms to derive depth and semantics. It uses a robocentric visual-inertial odometry (VIO) algorithm based on square-root information, thereby ensuring consistent visual odometry with an IMU and a monocular camera. This system achieves sub-20 cm error in real-time processing at 15 fps, enabling real-time 3D scene graph construction using a laptop GPU (NVIDIA 3080). This enhances decision-making efficiency and effectiveness in simple camera setups, augmenting robotic system agility. We make Mono-Hydra publicly available at: https://github.com/UAV-Centre-ITC/Mono_Hydra.
This paper puts forth a real-time spatial perception system Mono-Hydra, combining a monocular camera and an IMU sensor setup, focusing on indoor scenarios. However, the proposed approach is adaptable to outdoor applications, offering flexibility in its potential uses. The system employs a suite of deep learning algorithms to derive depth and semantics. It uses a robocentric visual-inertial odometry (VIO) algorithm based on square-root information, thereby ensuring consistent visual odometry with an IMU and a monocular camera. This system achieves sub-20 cm error in real-time processing at 15 fps, enabling real-time 3D scene graph construction using a laptop GPU (NVIDIA 3080). This enhances decision-making efficiency and effectiveness in simple camera setups, augmenting robotic system agility. We make Mono-Hydra publicly available at: https://github.com/UAV-Centre-ITC/Mono_Hydra.
| Original language | English |
|---|---|
| Title of host publication | Geospatial Week 2023, Vol. 10-1 |
| Editors | N. El-Sheimy, A.A. Abdelbary, N. El-Bendary, Y. Mohasseb |
| Publisher | Copernicus |
| Pages | 439-445 |
| Number of pages | 7 |
| Volume | X-1/W1-2023 |
| DOIs | |
| Publication status | Published - 5 Dec 2023 |
| Event | 2023 ISPRS Geospatial Week, GSW 2023 - Intercontinental Cairo Semiramis, Cairo, Egypt Duration: 2 Sept 2023 → 7 Sept 2023 |
Publication series
| Name | ISPRS Annals of the Photogrammetry, Remote Sensing and Spatial Information Sciences |
|---|---|
| Publisher | Copernicus |
| ISSN (Print) | 2194-9042 |
Conference
| Conference | 2023 ISPRS Geospatial Week, GSW 2023 |
|---|---|
| Abbreviated title | GSW |
| Country/Territory | Egypt |
| City | Cairo |
| Period | 2/09/23 → 7/09/23 |
Keywords
- 3D scene graphs
- Deep learning
- Real-time Mapping
- Spatial understanding
- Visual Inertial odometry
Fingerprint
Dive into the research topics of 'Mono-hydra: Real-time 3D scene graph construction from monocular camera input with IMU'. Together they form a unique fingerprint.Research output
- 1 Citations
- 1 Preprint
-
Mono-hydra: Real-time 3D scene graph construction from monocular camera input with IMU
Udugama, U. V. B. L., Vosselman, G. & Nex, F., 10 Aug 2023, ArXiv.org, 7 p.Research output: Working paper › Preprint › Academic
Open AccessFile223 Downloads (Pure)
Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver