MapReduce for Experimental Search

Djoerd Hiemstra, C. Hauff

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

This report presents preliminary results for the TREC 2010 ad-hoc web search task. We ran our MIREX system on 0.5 billion web documents from the ClueWeb09 crawl. On average, the system retrieves at least 3 relevant documents on the first result page containing 10 results, using a simple index consisting of anchor texts, page titles, and spam removal.
LanguageUndefined
Title of host publicationProceedings of the Nineteenth Text REtrieval Conference (TREC 2010)
EditorsE.M Voorhees, L.P. Buckland
Place of PublicationGaithersburg, Maryland, USA
PublisherNational Institute of Standards and Technology (NIST)
Pages51
Number of pages5
ISBN (Print)not assigned
StatePublished - 23 Feb 2011
EventNineteenth Text REtrieval Conference, TREC-19 2010 - Gaithersburg, United States
Duration: 16 Nov 201019 Nov 2010
Conference number: 19

Publication series

NameNIST Special Publications
PublisherNational Institute of Standards and Technology (NIST)

Conference

ConferenceNineteenth Text REtrieval Conference, TREC-19 2010
Abbreviated titleTREC
CountryUnited States
CityGaithersburg
Period16/11/1019/11/10

Keywords

  • METIS-277578
  • EWI-19796
  • IR-76391

Cite this

Hiemstra, D., & Hauff, C. (2011). MapReduce for Experimental Search. In E. M. Voorhees, & L. P. Buckland (Eds.), Proceedings of the Nineteenth Text REtrieval Conference (TREC 2010) (pp. 51). (NIST Special Publications). Gaithersburg, Maryland, USA: National Institute of Standards and Technology (NIST).
Hiemstra, Djoerd ; Hauff, C./ MapReduce for Experimental Search. Proceedings of the Nineteenth Text REtrieval Conference (TREC 2010). editor / E.M Voorhees ; L.P. Buckland. Gaithersburg, Maryland, USA : National Institute of Standards and Technology (NIST), 2011. pp. 51 (NIST Special Publications).
@inproceedings{4df5eca298aa4ef39c63a90d176b4a7f,
title = "MapReduce for Experimental Search",
abstract = "This report presents preliminary results for the TREC 2010 ad-hoc web search task. We ran our MIREX system on 0.5 billion web documents from the ClueWeb09 crawl. On average, the system retrieves at least 3 relevant documents on the first result page containing 10 results, using a simple index consisting of anchor texts, page titles, and spam removal.",
keywords = "METIS-277578, EWI-19796, IR-76391",
author = "Djoerd Hiemstra and C. Hauff",
year = "2011",
month = "2",
day = "23",
language = "Undefined",
isbn = "not assigned",
series = "NIST Special Publications",
publisher = "National Institute of Standards and Technology (NIST)",
pages = "51",
editor = "E.M Voorhees and L.P. Buckland",
booktitle = "Proceedings of the Nineteenth Text REtrieval Conference (TREC 2010)",

}

Hiemstra, D & Hauff, C 2011, MapReduce for Experimental Search. in EM Voorhees & LP Buckland (eds), Proceedings of the Nineteenth Text REtrieval Conference (TREC 2010). NIST Special Publications, National Institute of Standards and Technology (NIST), Gaithersburg, Maryland, USA, pp. 51, Nineteenth Text REtrieval Conference, TREC-19 2010, Gaithersburg, United States, 16/11/10.

MapReduce for Experimental Search. / Hiemstra, Djoerd; Hauff, C.

Proceedings of the Nineteenth Text REtrieval Conference (TREC 2010). ed. / E.M Voorhees; L.P. Buckland. Gaithersburg, Maryland, USA : National Institute of Standards and Technology (NIST), 2011. p. 51 (NIST Special Publications).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

TY - GEN

T1 - MapReduce for Experimental Search

AU - Hiemstra,Djoerd

AU - Hauff,C.

PY - 2011/2/23

Y1 - 2011/2/23

N2 - This report presents preliminary results for the TREC 2010 ad-hoc web search task. We ran our MIREX system on 0.5 billion web documents from the ClueWeb09 crawl. On average, the system retrieves at least 3 relevant documents on the first result page containing 10 results, using a simple index consisting of anchor texts, page titles, and spam removal.

AB - This report presents preliminary results for the TREC 2010 ad-hoc web search task. We ran our MIREX system on 0.5 billion web documents from the ClueWeb09 crawl. On average, the system retrieves at least 3 relevant documents on the first result page containing 10 results, using a simple index consisting of anchor texts, page titles, and spam removal.

KW - METIS-277578

KW - EWI-19796

KW - IR-76391

M3 - Conference contribution

SN - not assigned

T3 - NIST Special Publications

SP - 51

BT - Proceedings of the Nineteenth Text REtrieval Conference (TREC 2010)

PB - National Institute of Standards and Technology (NIST)

CY - Gaithersburg, Maryland, USA

ER -

Hiemstra D, Hauff C. MapReduce for Experimental Search. In Voorhees EM, Buckland LP, editors, Proceedings of the Nineteenth Text REtrieval Conference (TREC 2010). Gaithersburg, Maryland, USA: National Institute of Standards and Technology (NIST). 2011. p. 51. (NIST Special Publications).