Abstract
Entity disambiguation is the task of mapping ambiguous terms in natural-language text to its entities in a knowledge base. It finds its application in the extraction of structured data in RDF (Resource Description Framework) from textual documents, but equally so in facilitating artificial intelligence applications, such as Seman-tic Search, Reasoning and Question & Answering. We propose a new collective, graph-based disambiguation algorithm utilizing semantic entity and document embeddings for robust entity disam-biguation. Robust thereby refers to the property of achieving better than state-of-the-art results over a wide range of very different data sets. Our approach is also able to abstain if no appropriate entity can be found for a specific surface form. Our evaluation shows, that our approach achieves significantly (>5%) better results than all other publicly available disambiguation algorithms on 7 of 9 datasets without data set specific tuning. Moreover, we discuss the influence of the quality of the knowledge base on the disambigua-tion accuracy and indicate that our algorithm achieves better results than non-publicly available state-of-the-art algorithms.
Original language | English |
---|---|
Title of host publication | SIGIR'16. Proceedings of the 39th International ACM SIGIR conference on Research and Development in Information Retrieval - SIGIR '16 |
Place of Publication | New York |
Publisher | ACM Press |
Pages | 425-434 |
Number of pages | 10 |
ISBN (Print) | 9781450340694 |
DOIs | |
Publication status | Published - 2016 |
Event | 39th International ACM SIGIR Conference on Research and Development in Information Retrieval 2016 - Palazzo dei Congressi di Pisa, Pisa, Italy Duration: 17 Jul 2016 → 21 Jul 2016 Conference number: 39 http://sigir.org/sigir2016/programp/ |
Conference
Conference | 39th International ACM SIGIR Conference on Research and Development in Information Retrieval 2016 |
---|---|
Abbreviated title | SIGIR 2016 |
Country/Territory | Italy |
City | Pisa |
Period | 17/07/16 → 21/07/16 |
Internet address |