TY - JOUR
T1 - Benford’s law and geographical information–the example of OpenStreetMap
AU - Mocnik, F.-B.
N1 - Funding Information:
This work has partly been supported by the Heidelberg Academy of Sciences and Humanities project Heterogeneity and convergence in shared data sources?the importance of cognitive coherence in collective decision making.
Publisher Copyright:
© 2021 The Author(s). Published by Informa UK Limited, trading as Taylor & Francis Group.
PY - 2021/9/2
Y1 - 2021/9/2
N2 - Few laws about geographical information are known, partly because geographical information is inherently complex. Tobler’s first law of Geography and, to a lesser degree, also his second law are among the rare exceptions. In this article, we explore the validity of Benford’s law in the context of the example of OpenStreetMap. More specifically, we compare the distribution of several numerical features of geographical entities to the Benford distribution. It is demonstrated that the numerical features examined are in accordance with Benford’s law to a varying degree with little variation between the types of geographical entities. Spatial patterns in the deviation from Benford’s law are shown to be similar for some aspects but to strongly differ for other ones. We show that many aspects of the data tend to deviate more than average from the Benford distribution in Africa, Greenland, smaller island countries, and, to a lesser degree, in South America. Also, the scale-dependency of Benford’s law is explored. Motivated by the use of Benford’s law to detect indications for fraud in economic and other datasets, future prospects and limitations to systematically develop intrinsic data quality measures are discussed.
AB - Few laws about geographical information are known, partly because geographical information is inherently complex. Tobler’s first law of Geography and, to a lesser degree, also his second law are among the rare exceptions. In this article, we explore the validity of Benford’s law in the context of the example of OpenStreetMap. More specifically, we compare the distribution of several numerical features of geographical entities to the Benford distribution. It is demonstrated that the numerical features examined are in accordance with Benford’s law to a varying degree with little variation between the types of geographical entities. Spatial patterns in the deviation from Benford’s law are shown to be similar for some aspects but to strongly differ for other ones. We show that many aspects of the data tend to deviate more than average from the Benford distribution in Africa, Greenland, smaller island countries, and, to a lesser degree, in South America. Also, the scale-dependency of Benford’s law is explored. Motivated by the use of Benford’s law to detect indications for fraud in economic and other datasets, future prospects and limitations to systematically develop intrinsic data quality measures are discussed.
KW - Benford’s law
KW - data quality
KW - Geographical Shared Data Sources (GSDS)
KW - OpenStreetMap (OSM)
KW - Volunteered Geographic Information (VGI)
KW - ITC-ISI-JOURNAL-ARTICLE
KW - ITC-HYBRID
KW - UT-Hybrid-D
UR - https://ezproxy2.utwente.nl/login?url=https://library.itc.utwente.nl/login/2021/isi/mocnik_ben.pdf
U2 - 10.1080/13658816.2020.1829627
DO - 10.1080/13658816.2020.1829627
M3 - Article
AN - SCOPUS:85103914901
SN - 1365-8816
VL - 35
SP - 1746
EP - 1772
JO - International journal of geographical information science
JF - International journal of geographical information science
IS - 9
ER -