Efficiency trade-offs in two-tier web search systems

R. Baeza-Yates, V. Murdock, C. Hauff

    Research output: Chapter in Book/Report/Conference proceedingConference contributionAcademicpeer-review

    30 Citations (Scopus)

    Abstract

    Search engines rely on searching multiple partitioned corpora to return results to users in a reasonable amount of time. In this paper we analyze the standard two-tier architecture for Web search with the difference that the corpus to be searched for a given query is predicted in advance. We show that any predictor better than random yields time savings, but this decrease in the processing time yields an increase in the infrastructure cost. We provide an analysis and investigate this trade-off in the context of two different scenarios on real-world data. We demonstrate that in general the decrease in answer time is justified by a small increase in infrastructure cost.
    Original languageUndefined
    Title of host publicationProceedings of the 32nd international ACM SIGIR conference on Research and development in information retrieval
    Place of PublicationNew York
    PublisherAssociation for Computing Machinery (ACM)
    Pages163-170
    Number of pages8
    ISBN (Print)978-1-60558-483-6
    DOIs
    Publication statusPublished - 2009
    Event32nd Annual International ACM/SIGIR Conference on Research and Development in Information Retrieval, SIGIR 2009 - Boston, United States
    Duration: 19 Jul 200923 Jul 2009
    Conference number: 32

    Publication series

    Name
    PublisherACM

    Conference

    Conference32nd Annual International ACM/SIGIR Conference on Research and Development in Information Retrieval, SIGIR 2009
    Abbreviated titleSIGIR
    CountryUnited States
    CityBoston
    Period19/07/0923/07/09

    Keywords

    • METIS-263975
    • CR-H.3.3
    • EWI-15901
    • IR-67849

    Cite this