DoSeR - A knowledge-base-agnostic framework for entity disambiguation using semantic embeddings

Stefan Zwicklbauer, Christin Seifert, Michael Granitzer

    Research output: Chapter in Book/Report/Conference proceedingConference contributionAcademicpeer-review

    33 Citations (Scopus)
    1 Downloads (Pure)

    Abstract

    Entity disambiguation is the task of mapping ambiguous terms in natural-language text to its entities in a knowledge base. It finds its application in the extraction of structured data in RDF (Resource Description Framework) from textual documents, but equally so in facilitating artificial intelligence applications, such as Semantic Search, Reasoning and Question & Answering. In this work, we propose DoSeR (Disambiguation of Semantic Resources), a (named) entity disambiguation framework that is knowledge-base-agnostic in terms of RDF (e.g. DBpedia) and entity-annotated document knowledge bases (e.g. Wikipedia). Initially, our framework automatically generates semantic entity embeddings given one or multiple knowledge bases. In the following, DoSeR accepts documents with a given set of surface forms as input and collectively links them to an entity in a knowledge base with a graph-based approach. We evaluate DoSeR on seven different data sets against publicly available, state-of-the-art (named) entity disambiguation frameworks. Our approach outperforms the state-of-the-art approaches that make use of RDF knowledge bases and/or entity-annotated document knowledge bases by up to 10 % F1 measure.
    Original languageEnglish
    Title of host publicationThe Semantic Web. Latest Advances and New Domains
    Subtitle of host publication13th International Conference, ESWC 2016, Heraklion, Crete, Greece, May 29 -- June 2, 2016, Proceedings
    EditorsHarald Sack, Eva Bomqvist, Mathieu d'Aquin, Chiara Ghidini, Simone Paolo Ponzetto, Christoph Lange
    PublisherSpringer
    Pages182-198
    Number of pages17
    ISBN (Electronic)978-3-319-34129-3
    ISBN (Print)978-3-319-34128-6
    DOIs
    Publication statusPublished - 2016
    Event13th Extended Semantic Web Conference 2016 - Heraklion, Greece
    Duration: 29 May 20162 Jun 2016
    Conference number: 13

    Publication series

    NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
    Volume9678

    Conference

    Conference13th Extended Semantic Web Conference 2016
    Abbreviated titleESWC 2016
    Country/TerritoryGreece
    CityHeraklion
    Period29/05/162/06/16

    Keywords

    • Entity disambiguation
    • Linked Data
    • Neural networks
    • Semantic web

    Fingerprint

    Dive into the research topics of 'DoSeR - A knowledge-base-agnostic framework for entity disambiguation using semantic embeddings'. Together they form a unique fingerprint.

    Cite this