Brute Force Information Retrieval Experiments using MapReduce

Djoerd Hiemstra, C. Hauff

Research output: Contribution to journalArticleProfessional

32 Downloads (Pure)

Abstract

MIREX (MapReduce Information Retrieval Experiments) is a software library initially developed by the Database Group of the University of Twente for running large scale information retrieval experiments on clusters of machines. MIREX has been tested on web crawls of up to half a billion web pages, totalling about 12.5 TB of data uncompressed. MIREX shows that the execution of test queries by a brute force linear scan of pages, is a viable alternative to running the test queries on a search engine’s inverted index. MIREX is open source and available for others.
Original languageUndefined
Pages (from-to)31-32
Number of pages2
JournalERCIM news
Volume2012
Issue number89
Publication statusPublished - Apr 2012

Keywords

  • EWI-21745
  • IR-80117
  • METIS-286315

Cite this