TY - JOUR
T1 - Vectorizing planar roof structure from very high resolution remote sensing images using transformers
AU - Zhao, Wufan
AU - Persello, Claudio
AU - Lv, Xianwei
AU - Stein, Alfred
AU - Vergauwen, Maarten
N1 - Publisher Copyright:
© 2023 The Author(s). Published by Informa UK Limited, trading as Taylor & Francis Group.
PY - 2024
Y1 - 2024
N2 - Accurately predicting the geometric structure of a building's roof as a vectorized representation from a raster image is a challenging task in building reconstruction. In this paper, we propose an efficient and precise parsing method called Roof-Former, based on a vision Transformer. Our method involves three steps: (1) Image encoder and edge node initialization, (2) Image feature fusion with an enhanced segmentation refinement branch, and (3) Edge filtering and structural reasoning. Our method outperforms previous works on the vectorizing world building dataset and the Enschede dataset, with vertex and edge heat map F1-scores increasing from (Formula presented.), (Formula presented.) to (Formula presented.), (Formula presented.), and from (Formula presented.), (Formula presented.) to (Formula presented.), (Formula presented.), respectively. Furthermore, our method demonstrates superior performance compared to the current state-of-the-art based on qualitative evaluations, indicating its effectiveness in extracting global image information while maintaining the consistency and topological validity of the roof structure.
AB - Accurately predicting the geometric structure of a building's roof as a vectorized representation from a raster image is a challenging task in building reconstruction. In this paper, we propose an efficient and precise parsing method called Roof-Former, based on a vision Transformer. Our method involves three steps: (1) Image encoder and edge node initialization, (2) Image feature fusion with an enhanced segmentation refinement branch, and (3) Edge filtering and structural reasoning. Our method outperforms previous works on the vectorizing world building dataset and the Enschede dataset, with vertex and edge heat map F1-scores increasing from (Formula presented.), (Formula presented.) to (Formula presented.), (Formula presented.), and from (Formula presented.), (Formula presented.) to (Formula presented.), (Formula presented.), respectively. Furthermore, our method demonstrates superior performance compared to the current state-of-the-art based on qualitative evaluations, indicating its effectiveness in extracting global image information while maintaining the consistency and topological validity of the roof structure.
KW - geometry reconstruction
KW - Roof structure extraction
KW - Transformer
KW - very high resolution remote sensing
KW - ITC-GOLD
KW - ITC-ISI-JOURNAL-ARTICLE
U2 - 10.1080/17538947.2023.2292637
DO - 10.1080/17538947.2023.2292637
M3 - Article
AN - SCOPUS:85180364501
SN - 1753-8947
VL - 17
SP - 1
EP - 15
JO - International journal of digital earth
JF - International journal of digital earth
IS - 1
ER -