Abstract
The Internet has become an integral part of our daily lives. However, the essential task of
finding information is dominated by a handful of large centralised search engines. In this
thesis we study an alternative to this approach. Instead of using large data centres, we propose
using the machines that we all use every day: our desktop, laptop and tablet computers, to build
a peer-to-peer web search engine. We provide a definition of the associated research field: peer-to-peer
information retrieval. We examine what separates it from related fields, give an overview of
the work done so far and provide an economic perspective on peer-to-peer search. Furthermore,
we introduce our own architecture for peer-to-peer search systems, inspired by BitTorrent.
Distributing the task of providing search results for queries introduces the problem of query
routing: a query needs to be sent to a peer that can provide relevant search results. We investigate
how the content of peers can be represented so that queries can be directed to the best ones in
terms of relevance. While cooperative peers can provide their own representation, the content of
uncooperative peers can be accessed only through a search interface and thus they can not actively
provide a description of themselves. We look into representing these uncooperative peers by
probing their search interface to construct a representation. Finally, the capacity of the machines in
peer-to-peer networks differs considerably, making it challenging to provide search results quickly.
To address this, we present an approach where copies of search results for previous queries are
retained at peers and used to serve future requests and show participation can be incentivised
using reputations.
There are still problems to be solved before a real-world peer-to-peer web search engine can be
built. This thesis provides a starting point for this ambitious goal and also provides a solid basis
for reasoning about peer-to-peer information retrieval systems in general.
Original language | Undefined |
---|---|
Pages (from-to) | 116-116 |
Number of pages | 1 |
Journal | SIGIR forum |
Volume | 46 |
Issue number | 2 |
DOIs | |
Publication status | Published - Dec 2012 |
Keywords
- METIS-296328
- EWI-23123