TY - JOUR
T1 - Identifying a Slums’ Degree of Deprivation from VHR Images Using Convolutional Neural Networks
AU - Ajami, Alireza
AU - Kuffer, M.
AU - Persello, C.
AU - Pfeffer, K.
N1 - Funding Information:
Funding: We would like to acknowledge the support of the SimCity project (contract number: C.2324.0293) for data collection and the support from the NWO/Netherlands eScience Center funded project DynaSlum—Data Driven Modelling and Decision Support for Slums—under the contract number 27015G05.
Funding Information:
Acknowledgments: We would like to express our great appreciation to Chloe Pottinger-Glass who carried out the fieldwork and collected the data used to build the QS index; this also includes the photographs used in this article. We would like to acknowledge the support of the SimCity project (contract number: C.2324.0293) and the support from the NWO/Netherlands eScience Center funded project DynaSlum—Data Driven Modelling and Decision Support for Slums—under the contract number 27015G05. We wish to offer our special thanks to Champaka Rajagopal for their help in providing a deeper local insight into the available data and setting up the QS fieldwork. We are particularly grateful to Debraj Roy and Mike Lees from the University of Amsterdam (DynaSlum project leaders) and Berend Weel from eScience Center for granting access to the data acquired within DynaSlum.
Publisher Copyright:
© 2019 by the authors.
PY - 2019/6/1
Y1 - 2019/6/1
N2 - In the cities of the Global South, slum settlements are growing in size and number, but their locations and characteristics are often missing in official statistics and maps. Although several studies have focused on detecting slums from satellite images, only a few captured their variations. This study addresses this gap using an integrated approach that can identify a slums’ degree of deprivation in terms of socio-economic variability in Bangalore, India using image features derived from very high resolution (VHR) satellite images. To characterize deprivation, we use multiple correspondence analysis (MCA) and quantify deprivation with a data-driven index of multiple deprivation (DIMD). We take advantage of spatial features learned by a convolutional neural network (CNN) from VHR satellite images to predict the DIMD. To deal with a small training dataset of only 121 samples with known DIMD values, insufficient to train a deep CNN, we conduct a two-step transfer learning approach using 1461 delineated slum boundaries as follows. First, a CNN is trained using these samples to classify slums and formal areas. The trained network is then fine-tuned using the 121 samples to directly predict the DIMD. The best prediction is obtained by using an ensemble non-linear regression model, combining the results of the CNN and models based on hand-crafted and geographic information system (GIS) features, with R2 of 0.75. Our findings show that using the proposed two-step transfer learning approach, a deep CNN can be trained with a limited number of samples to predict the slums’ degree of deprivation. This demonstrates that the CNN-based approach can capture variations of deprivation in VHR images, providing a comprehensive understanding of the socio-economic situation of slums in Bangalore.
AB - In the cities of the Global South, slum settlements are growing in size and number, but their locations and characteristics are often missing in official statistics and maps. Although several studies have focused on detecting slums from satellite images, only a few captured their variations. This study addresses this gap using an integrated approach that can identify a slums’ degree of deprivation in terms of socio-economic variability in Bangalore, India using image features derived from very high resolution (VHR) satellite images. To characterize deprivation, we use multiple correspondence analysis (MCA) and quantify deprivation with a data-driven index of multiple deprivation (DIMD). We take advantage of spatial features learned by a convolutional neural network (CNN) from VHR satellite images to predict the DIMD. To deal with a small training dataset of only 121 samples with known DIMD values, insufficient to train a deep CNN, we conduct a two-step transfer learning approach using 1461 delineated slum boundaries as follows. First, a CNN is trained using these samples to classify slums and formal areas. The trained network is then fine-tuned using the 121 samples to directly predict the DIMD. The best prediction is obtained by using an ensemble non-linear regression model, combining the results of the CNN and models based on hand-crafted and geographic information system (GIS) features, with R2 of 0.75. Our findings show that using the proposed two-step transfer learning approach, a deep CNN can be trained with a limited number of samples to predict the slums’ degree of deprivation. This demonstrates that the CNN-based approach can capture variations of deprivation in VHR images, providing a comprehensive understanding of the socio-economic situation of slums in Bangalore.
KW - ITC-GOLD
KW - ITC-ISI-JOURNAL-ARTICLE
UR - https://ezproxy2.utwente.nl/login?url=https://library.itc.utwente.nl/login/2019/isi/kuffer_ide.pdf
U2 - 10.3390/rs11111282
DO - 10.3390/rs11111282
M3 - Article
SN - 2072-4292
VL - 11
SP - 1
EP - 24
JO - Remote sensing
JF - Remote sensing
IS - 11
M1 - 1282
ER -