Evaluation of Automatic Linking Strategies for Wikipedia Pages

Michael Granitzer, Mario Zechner, Christin Seifert, Josef Kolbitsch, Peter Kemper, Ronald In`t Velt

    Research output: Chapter in Book/Report/Conference proceedingConference contributionAcademicpeer-review

    10 Downloads (Pure)


    Wikipedia contains an enormous amount of human knowledge. The wide range of covered topics is hierarchically organized in categories and strongly inter-linked. Its structure, its size and the fact that it is generated by humans are the reasons for the attention Wikipedia receives from researchers in different fields. Especially the link structure of Wikipedia is of huge importance not only for humans browsing the collection, but also as a resource for bootstrapping machine intelligence and the semantic web. Motivated by the fact that manual maintenance and creation of hyperlinks is labor intensive, this paper explores properties for automatic link creation between Wikipedia pages in this paper. Focusing on ad-hoc linking approaches we evaluate linking strategies on the word as well as on the document level using a standard test data set. As it is shown, rather simple approaches yield to reliable results and may be applicable in different application scenarios. Disambiguation strategies based on standard IR techniques help to boost accuracy delivering reasonable results.
    Original languageEnglish
    Title of host publicationWWW/Internet 2008 Proceedings
    Subtitle of host publicationProceedings of the IADIS International Conference on WWW/Internet, Freiburg, Germany, 13-15 October 2008
    EditorsMiguel Baptista Nunes, Pedro Isaías, Dirk Ifenthaler
    Number of pages8
    ISBN (Print)978-972-8924-68-3
    Publication statusPublished - 1 Oct 2008
    EventIADIS International Conference WWW/Internet 2008 - Freiburg, Germany
    Duration: 13 Oct 200815 Oct 2008


    ConferenceIADIS International Conference WWW/Internet 2008

    Fingerprint Dive into the research topics of 'Evaluation of Automatic Linking Strategies for Wikipedia Pages'. Together they form a unique fingerprint.

    Cite this