Abstract
This paper describes a pilot study that implements a novel approach to validate data mining tasks by using the crowd to train a classifier. This hybrid approach to processing successfully addresses challenges faced during human curation or machine processing of user-generated geographic content (UGGC), namely quality control, reproducibility, sustainability, scaling, data quality, overfitting, and training costs. We test the approach on mining UGGC to derive information on local places as humans perceive them. Specifically, we retrieve Flickr image metadata, enrich it semantically by building term vectors using a controlled vocabulary, cluster it spatially, let online participants rate those clusters, classify them into noise and places by using both semantic and cluster characteristics, let online participants supervise the classification by annotating the results, and use their feedback to improve clustering and revise the trained model. The results show that the approach is feasible and suggest future studies to improve it, while also indicating that mining places from UGGC requires more than a single source.
Original language | English |
---|---|
Title of host publication | Societal Geo-Innovation |
Subtitle of host publication | short papers, posters and poster abstracts of the 20th AGILE Conference on Geographic Information Science, 9-12 May 2017, Wageningen, the Netherlands |
Editors | A. Bergt, T. Sarjakoski, R. van Lammeren, F. Rip |
Place of Publication | Wageningen |
Publisher | Wageningen University & Research Centre |
Number of pages | 5 |
ISBN (Print) | 978-90-816960-7-4 |
Publication status | Published - 2017 |
Event | 20th AGILE Conference on Geographic Information Science, AGILE 2017 - Wageningen, Netherlands Duration: 9 May 2017 → 12 May 2017 Conference number: 20 https://agile-online.org/index.php/conference/proceedings/proceedings-2017 |
Conference
Conference | 20th AGILE Conference on Geographic Information Science, AGILE 2017 |
---|---|
Abbreviated title | AGILE |
Country/Territory | Netherlands |
City | Wageningen |
Period | 9/05/17 → 12/05/17 |
Internet address |