Robust and Collective Entity Disambiguation through Semantic Embeddings

Stefan Zwicklbauer, Christin Seifert, Michael Granitzer

    Research output: Chapter in Book/Report/Conference proceedingChapterAcademicpeer-review

    78 Citations (Scopus)
    46 Downloads (Pure)

    Abstract

    Entity disambiguation is the task of mapping ambiguous terms in natural-language text to its entities in a knowledge base. It finds its application in the extraction of structured data in RDF (Resource Description Framework) from textual documents, but equally so in facilitating artificial intelligence applications, such as Seman-tic Search, Reasoning and Question & Answering. We propose a new collective, graph-based disambiguation algorithm utilizing semantic entity and document embeddings for robust entity disam-biguation. Robust thereby refers to the property of achieving better than state-of-the-art results over a wide range of very different data sets. Our approach is also able to abstain if no appropriate entity can be found for a specific surface form. Our evaluation shows, that our approach achieves significantly (>5%) better results than all other publicly available disambiguation algorithms on 7 of 9 datasets without data set specific tuning. Moreover, we discuss the influence of the quality of the knowledge base on the disambigua-tion accuracy and indicate that our algorithm achieves better results than non-publicly available state-of-the-art algorithms.
    Original languageEnglish
    Title of host publicationSIGIR'16. Proceedings of the 39th International ACM SIGIR conference on Research and Development in Information Retrieval - SIGIR '16
    Place of PublicationNew York
    PublisherACM Press
    Pages425-434
    Number of pages10
    ISBN (Print)9781450340694
    DOIs
    Publication statusPublished - 2016
    Event39th International ACM SIGIR Conference on Research and Development in Information Retrieval 2016 - Palazzo dei Congressi di Pisa, Pisa, Italy
    Duration: 17 Jul 201621 Jul 2016
    Conference number: 39
    http://sigir.org/sigir2016/programp/

    Conference

    Conference39th International ACM SIGIR Conference on Research and Development in Information Retrieval 2016
    Abbreviated titleSIGIR 2016
    Country/TerritoryItaly
    CityPisa
    Period17/07/1621/07/16
    Internet address

    Fingerprint

    Dive into the research topics of 'Robust and Collective Entity Disambiguation through Semantic Embeddings'. Together they form a unique fingerprint.

    Cite this