Abstract
Distributed algorithms for graph searching require a high-performance CPU-efficient hash table that supports find-or-put. This operation either inserts data or indicates that it has already been added before. This paper focuses on the design and evaluation of such a hash table, targeting supercomputers. The latency of find-or-put is minimized by using one-sided RDMA operations. These operations are overlapped as much as possible to reduce waiting times for roundtrips. In contrast to existing work, we use linear probing and argue that this requires less roundtrips. The hash table is implemented in UPC. A peak-throughput of 114.9 million op/s is reached on an Infiniband cluster. With a load-factor of 0.9, find-or-put can be performed in 4.5μs on average. The hash table performance remains very high, even under high loads.
Original language | Undefined |
---|---|
Title of host publication | Parallel Processing and Applied Mathematics |
Subtitle of host publication | 11th International Conference, PPAM 2015, Krakow, Poland, September 6-9, 2015. Revised Selected Papers, Part II |
Editors | Roman Wyrzykowski, Ewa Deelman, Jack Dongarra, Konrad Karczewski, Jacek Kitowski, Kazimierz Wiatr |
Place of Publication | London |
Publisher | Springer |
Pages | 15-24 |
Number of pages | 10 |
ISBN (Electronic) | 978-3-319-32152-3 |
ISBN (Print) | 978-3-319-32151-6 |
DOIs | |
Publication status | Published - Sep 2015 |
Event | 11th International Conference on Parallel Processing and Applied Mathematics (PPAM 2016), Revised Selected Papers. - Krakow, Poland Duration: 6 Sep 2015 → 9 Sep 2015 Conference number: 11 |
Publication series
Name | Lecture Notes in Computer Science |
---|---|
Publisher | Springer Verlag |
Volume | 9574 |
ISSN (Print) | 0302-9743 |
ISSN (Electronic) | 1611-3349 |
Conference
Conference | 11th International Conference on Parallel Processing and Applied Mathematics (PPAM 2016), Revised Selected Papers. |
---|---|
Abbreviated title | PPAM 2015 |
Country | Poland |
City | Krakow |
Period | 6/09/15 → 9/09/15 |
Keywords
- high-performance computing
- remote direct memory access
- EWI-26785
- IR-99479
- par- titioned global address space
- METIS-316032
- Distributed hash table