Abstract
Toponym extraction and disambiguation have received much attention in recent years. Typical fields addressing these topics are information retrieval, natural language processing, and semantic web. This paper addresses two problems with toponym extraction and disambiguation. First, almost no existing works examine the extraction and disambiguation interdependency. Second, existing disambiguation techniques mostly take as input extracted named entities without considering the uncertainty and imperfection of the extraction process. In this paper we aim to investigate both avenues and to show that explicit handling of the uncertainty of annotation has much potential for making both extraction and disambiguation more robust. We conducted experiments with a set of holiday home descriptions with the aim to extract and disambiguate toponyms. We show that the extraction confidence probabilities are useful in enhancing the effectiveness of disambiguation. Reciprocally, retraining the extraction models with information automatically derived from the disambiguation results, improves the extraction models. This mutual reinforcement is shown to even have an effect after several automatic iterations.
Original language | Undefined |
---|---|
Title of host publication | Knowledge Discovery, Knowledge Engineering and Knowledge Management: 4th International Joint Conference, IC3K 2012, Barcelona, Spain, October 4-7, 2012, Revised Selected Papers |
Editors | A. Fred, J.L.G. Dietz, K. Liu, J. Filipe |
Place of Publication | Berlin Heidelberg |
Publisher | Springer |
Pages | 113-129 |
Number of pages | 17 |
ISBN (Print) | 978-3-642-54104-9 |
DOIs | |
Publication status | Published - 2013 |
Event | 4th International Joint Conference on Knowledge Discovery, Knowledge Engineering and Knowledge Management, IC3K 2012 - Barcelona, Spain Duration: 4 Oct 2012 → 7 Oct 2012 Conference number: 4 |
Publication series
Name | Communications in Computer and Information Science |
---|---|
Publisher | Springer Verlag |
Number | 415 |
Volume | 415 |
ISSN (Print) | 1865-0929 |
ISSN (Electronic) | 1865-0937 |
Conference
Conference | 4th International Joint Conference on Knowledge Discovery, Knowledge Engineering and Knowledge Management, IC3K 2012 |
---|---|
Abbreviated title | IC3K |
Country/Territory | Spain |
City | Barcelona |
Period | 4/10/12 → 7/10/12 |
Keywords
- Toponym RecognitionToponym ExtractionToponym DisambiguationToponym LinkingUncertain Annotations
- EWI-24610
- Uncertain Annotations
- IR-90489
- Toponyms Extraction
- METIS-304041
- Toponym Disambiguation