UAVPal: A New Dataset for Semantic Segmentation in Complex Urban Landscape with Efficient Multiscale Segmentation

Abhisek Maiti*, Sander Oude Elberink, George Vosselman

*Corresponding author for this work

Research output: Contribution to journalArticleAcademicpeer-review

40 Downloads (Pure)

Abstract

Semantic segmentation has recently emerged as a prominent area of interest in Earth observation. Several semantic segmentation datasets already exist, facilitating comparisons among different methods in complex urban scenes. However, most open high-resolution urban datasets are geographically skewed toward Europe and North America, while coverage of Southeast Asia is very limited. The considerable variation in city designs worldwide presents an obstacle to the applicability of computer vision models, especially when the training dataset lacks significant diversity. On the other hand, naively applying computationally expensive models leads to inefficacies and sometimes poor performance. To tackle the lack of data diversity, we introduce a new UAVPal dataset of complex urban scenes from the city of Bhopal, India. We complement this by introducing a novel dense predictor head and demonstrate that a well-designed head can efficiently take advantage of the multiscale features to enhance the benefits of a strong feature extractor backbone. We design our segmentation head to learn the importance of features at various scales for each individual class and refine the final dense prediction accordingly. We tested our proposed head with a state-of-the-art backbone on multiple UAV datasets and a high-resolution satellite image dataset for LULC classification. We observed improved intersection over union (IoU) in various classes and up to 2$\%$ better mean IoU. Apart from the performance improvements, we also observed nearly 50$\%$ reduction in computing operations required when using the proposed head compared to the traditional segmentation head.

Original languageEnglish
Pages (from-to)464-475
Number of pages12
JournalIEEE Journal of selected topics in applied earth observations and remote sensing
Volume17
DOIs
Publication statusPublished - 2024

Keywords

  • CNN
  • deep learning
  • Earth observstion (EO)
  • semantic segmentation
  • vision transformer
  • ITC-GOLD
  • ITC-ISI-JOURNAL-ARTICLE

Fingerprint

Dive into the research topics of 'UAVPal: A New Dataset for Semantic Segmentation in Complex Urban Landscape with Efficient Multiscale Segmentation'. Together they form a unique fingerprint.

Cite this