Abstract
Grasping the roof structure of a building is a key part of building reconstruction. Directly predicting the geometric structure of the roof from a raster image to a vectorized representation, however, remains challenging. This paper introduces an efficient and accurate parsing method based upon a vision Transformer we dubbed Roof-Former. Our method consists of three steps: 1) Image encoder and edge node initialization, 2) Image feature fusion with an enhanced segmentation refinement branch, and 3) Edge filtering and structural reasoning. The vertex and edge heat map F1-scores have increased by 2.0% and 1.9% on the VWB dataset when compared to HEAT. Additionally, qualitative evaluations suggest that our method is superior to the current state-of-the-art. It indicates effectiveness for extracting global image information and maintaining the consistency and topological validity of the roof structure.
Original language | English |
---|---|
Title of host publication | IGARSS 2023 - 2023 IEEE International Geoscience and Remote Sensing Symposium, Proceedings |
Publisher | IEEE |
Pages | 4899-4902 |
Number of pages | 4 |
ISBN (Electronic) | 9798350320107 |
DOIs | |
Publication status | Published - 2023 |
Event | 43rd IEEE International Geoscience and Remote Sensing Symposium, IGARSS 2023 - Pasadena Convention Center, Pasadena, United States Duration: 16 Jul 2023 → 21 Jul 2023 Conference number: 43 https://2023.ieeeigarss.org/index.php |
Publication series
Name | International Geoscience and Remote Sensing Symposium (IGARSS) |
---|---|
Volume | 2023-July |
Conference
Conference | 43rd IEEE International Geoscience and Remote Sensing Symposium, IGARSS 2023 |
---|---|
Abbreviated title | IGARSS 2023 |
Country/Territory | United States |
City | Pasadena |
Period | 16/07/23 → 21/07/23 |
Internet address |
Keywords
- geometry reconstruction
- remote sensing image
- Roof structure extraction
- Transformer
- 2024 OA procedure