Abstract
The efficiency of neural information retrieval methods is primarily evaluated by measuring query latency. In practice, measuring latency is highly tied to hardware configurations and requires extensive computational resources. Given the rapid introduction of retrieval models, achieving an overall comparison of their efficiency is challenging. In this paper, we introduce PEIR, a framework for hardware-independent efficiency measurements in Learned Sparse Retrieval (LSR). By employing performance modeling approaches from high-performance computing, we derive performance models for query evaluation approaches such as BlockMax-MaxScore (BMM) and propose to measure memory and/or floating-point operations while performing retrieval on input queries. We demonstrate that by using PEIR, similar conclusions on comparing the latency of retrieval models are obtained.
| Original language | English |
|---|---|
| Title of host publication | Advances in Information Retrieval |
| Subtitle of host publication | 47th European Conference on Information Retrieval, ECIR 2025, Proceedings |
| Editors | Claudia Hauff, Craig Macdonald, Dietmar Jannach, Gabriella Kazai, Franco Maria Nardini, Fabio Pinelli, Fabrizio Silvestri, Nicola Tonellotto |
| Publisher | Springer |
| Pages | 279-294 |
| Number of pages | 16 |
| ISBN (Electronic) | 978-3-031-88711-6 |
| ISBN (Print) | 978-3-031-88710-9 |
| DOIs | |
| Publication status | Published - 4 Apr 2025 |
| Event | 47th European Conference on Information Retrieval, ECIR 2025 - Lucca, Italy Duration: 6 Apr 2025 → 10 Apr 2025 Conference number: 47 |
Publication series
| Name | Lecture Notes in Computer Science |
|---|---|
| Volume | 15573 LNCS |
| ISSN (Print) | 0302-9743 |
| ISSN (Electronic) | 1611-3349 |
Conference
| Conference | 47th European Conference on Information Retrieval, ECIR 2025 |
|---|---|
| Abbreviated title | ECIR 2025 |
| Country/Territory | Italy |
| City | Lucca |
| Period | 6/04/25 → 10/04/25 |
Keywords
- 2025 OA procedure
- Latency
- Learned Sparse Retrieval
- Performance Modelling
- Efficiency