TY - JOUR
T1 - Land cover classification from remote sensing images based on multi-scale fully convolutional network
AU - Li, Rui
AU - Zheng, Shunyi
AU - Duan, Chenxi
AU - Wang, Libo
AU - Zhang, Ce
N1 - Funding Information:
This work is supported by the National Natural Science Foundation of China [grant number 41671452].
Publisher Copyright:
© 2022 Wuhan University. Published by Informa UK Limited, trading as Taylor & Francis Group.
PY - 2022/4/3
Y1 - 2022/4/3
N2 - Although the Convolutional Neural Network (CNN) has shown great potential for land cover classification, the frequently used single-scale convolution kernel limits the scope of information extraction. Therefore, we propose a Multi-Scale Fully Convolutional Network (MSFCN) with a multi-scale convolutional kernel as well as a Channel Attention Block (CAB) and a Global Pooling Module (GPM) in this paper to exploit discriminative representations from two-dimensional (2D) satellite images. Meanwhile, to explore the ability of the proposed MSFCN for spatio-temporal images, we expand our MSFCN to three-dimension using three-dimensional (3D) CNN, capable of harnessing each land cover category’s time series interaction from the reshaped spatio-temporal remote sensing images. To verify the effectiveness of the proposed MSFCN, we conduct experiments on two spatial datasets and two spatio-temporal datasets. The proposed MSFCN achieves 60.366% on the WHDLD dataset and 75.127% on the GID dataset in terms of mIoU index while the figures for two spatio-temporal datasets are 87.753% and 77.156%. Extensive comparative experiments and ablation studies demonstrate the effectiveness of the proposed MSFCN. Code will be available at https://github.com/lironui/MSFCN.
AB - Although the Convolutional Neural Network (CNN) has shown great potential for land cover classification, the frequently used single-scale convolution kernel limits the scope of information extraction. Therefore, we propose a Multi-Scale Fully Convolutional Network (MSFCN) with a multi-scale convolutional kernel as well as a Channel Attention Block (CAB) and a Global Pooling Module (GPM) in this paper to exploit discriminative representations from two-dimensional (2D) satellite images. Meanwhile, to explore the ability of the proposed MSFCN for spatio-temporal images, we expand our MSFCN to three-dimension using three-dimensional (3D) CNN, capable of harnessing each land cover category’s time series interaction from the reshaped spatio-temporal remote sensing images. To verify the effectiveness of the proposed MSFCN, we conduct experiments on two spatial datasets and two spatio-temporal datasets. The proposed MSFCN achieves 60.366% on the WHDLD dataset and 75.127% on the GID dataset in terms of mIoU index while the figures for two spatio-temporal datasets are 87.753% and 77.156%. Extensive comparative experiments and ablation studies demonstrate the effectiveness of the proposed MSFCN. Code will be available at https://github.com/lironui/MSFCN.
KW - land cover classification
KW - Multi-Scale Fully Convolutional Network
KW - Spatio-temporal remote sensing images
KW - ITC-GOLD
KW - ITC-ISI-JOURNAL-ARTICLE
KW - UT-Gold-D
UR - https://ezproxy2.utwente.nl/login?url=https://library.itc.utwente.nl/login/2022/isi/duan_lan.pdf
U2 - 10.1080/10095020.2021.2017237
DO - 10.1080/10095020.2021.2017237
M3 - Article
AN - SCOPUS:85122422997
VL - 25
SP - 278
EP - 294
JO - Geo-spatial information science
JF - Geo-spatial information science
SN - 1009-5020
IS - 2
ER -