DoSeR - A knowledge-base-agnostic framework for entity disambiguation using semantic embeddings

Stefan Zwicklbauer, Christin Seifert, Michael Granitzer

Research output: Chapter in Book/Report/Conference proceedingConference contributionAcademicpeer-review

34 Citations (Scopus)
21 Downloads (Pure)

Abstract

Entity disambiguation is the task of mapping ambiguous terms in natural-language text to its entities in a knowledge base. It finds its application in the extraction of structured data in RDF (Resource Description Framework) from textual documents, but equally so in facilitating artificial intelligence applications, such as Semantic Search, Reasoning and Question & Answering. In this work, we propose DoSeR (Disambiguation of Semantic Resources), a (named) entity disambiguation framework that is knowledge-base-agnostic in terms of RDF (e.g. DBpedia) and entity-annotated document knowledge bases (e.g. Wikipedia). Initially, our framework automatically generates semantic entity embeddings given one or multiple knowledge bases. In the following, DoSeR accepts documents with a given set of surface forms as input and collectively links them to an entity in a knowledge base with a graph-based approach. We evaluate DoSeR on seven different data sets against publicly available, state-of-the-art (named) entity disambiguation frameworks. Our approach outperforms the state-of-the-art approaches that make use of RDF knowledge bases and/or entity-annotated document knowledge bases by up to 10 % F1 measure.
Original languageEnglish
Title of host publicationThe Semantic Web. Latest Advances and New Domains
Subtitle of host publication13th International Conference, ESWC 2016, Heraklion, Crete, Greece, May 29 -- June 2, 2016, Proceedings
EditorsHarald Sack, Eva Bomqvist, Mathieu d'Aquin, Chiara Ghidini, Simone Paolo Ponzetto, Christoph Lange
PublisherSpringer
Pages182-198
Number of pages17
ISBN (Electronic)978-3-319-34129-3
ISBN (Print)978-3-319-34128-6
DOIs
Publication statusPublished - 2016
Externally publishedYes
Event13th Extended Semantic Web Conference 2016 - Heraklion, Greece
Duration: 29 May 20162 Jun 2016
Conference number: 13

Publication series

NameLecture Notes in Computer Science
PublisherSpringer
Volume9678
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Conference

Conference13th Extended Semantic Web Conference 2016
Abbreviated titleESWC 2016
Country/TerritoryGreece
CityHeraklion
Period29/05/162/06/16

Keywords

  • Entity disambiguation
  • Linked Data
  • Neural networks
  • Semantic web
  • n/a OA procedure

Fingerprint

Dive into the research topics of 'DoSeR - A knowledge-base-agnostic framework for entity disambiguation using semantic embeddings'. Together they form a unique fingerprint.

Cite this