Abstract
Wikipedia contains an enormous amount of human knowledge. The wide range of covered topics is hierarchically organized in categories and strongly inter-linked. Its structure, its size and the fact that it is generated by humans are the reasons for the attention Wikipedia receives from researchers in different fields. Especially the link structure of Wikipedia is of huge importance not only for humans browsing the collection, but also as a resource for bootstrapping machine intelligence and the semantic web. Motivated by the fact that manual maintenance and creation of hyperlinks is labor intensive, this paper explores properties for automatic link creation between Wikipedia pages in this paper. Focusing on ad-hoc linking approaches we evaluate linking strategies on the word as well as on the document level using a standard test data set. As it is shown, rather simple approaches yield to reliable results and may be applicable in different application scenarios. Disambiguation strategies based on standard IR techniques help to boost accuracy delivering reasonable results.
| Original language | English |
|---|---|
| Title of host publication | WWW/Internet 2008 Proceedings |
| Subtitle of host publication | Proceedings of the IADIS International Conference on WWW/Internet, Freiburg, Germany, 13-15 October 2008 |
| Editors | Miguel Baptista Nunes, Pedro Isaías, Dirk Ifenthaler |
| Publisher | IADIS |
| Pages | 243-250 |
| Number of pages | 8 |
| ISBN (Print) | 978-972-8924-68-3 |
| Publication status | Published - 1 Oct 2008 |
| Externally published | Yes |
| Event | IADIS International Conference WWW/Internet 2008 - Freiburg, Germany Duration: 13 Oct 2008 → 15 Oct 2008 |
Conference
| Conference | IADIS International Conference WWW/Internet 2008 |
|---|---|
| Country/Territory | Germany |
| City | Freiburg |
| Period | 13/10/08 → 15/10/08 |