Abstract
Random forests and decision trees are increasingly interesting candidates for resource-constrained machine learning models. In order to make the execution of these models efficient under resource limitations, various optimized implementations have been proposed in the literature, usually implementing either native trees or if-else trees. While a certain motivation for the optimization of if-else trees is to benefit the behavior of dedicated instruction caches, in this work we highlight that if-else trees might also strongly depend on data caches. We identify one crucial issue of if-else tree implementations and propose an optimized implementation, which keeps the logic tree structure untouched and thus does not influence the accuracy, but eliminates the need to load comparison values from the data caches. Experimental evaluation of this implementation shows that we can greatly reduce the amount of data cache misses by up to 99%, while not increasing the amount of instruction cache misses in comparison to the state-of-the-art. We additionally highlight various scenarios, where the reduction of data cache misses draws important benefit on the allover execution time.
Original language | English |
---|---|
Title of host publication | Machine Learning and Knowledge Discovery in Databases |
Subtitle of host publication | European Conference, ECML PKDD 2022, Grenoble, France, September 19–23, 2022, Proceedings, Part V |
Editors | Massih-Reza Amini, Stéphane Canu, Asja Fischer, Tias Guns, Petra Kralj Novak, Grigorios Tsoumakas |
Place of Publication | Cham |
Publisher | Springer |
Pages | 531-546 |
Number of pages | 16 |
ISBN (Electronic) | 978-3-031-26419-1 |
ISBN (Print) | 978-3-031-26418-4 |
DOIs | |
Publication status | Published - 17 Mar 2023 |
Event | European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases, ECML-PKDD 2022 - World Trade Center, Grenoble, France Duration: 19 Sept 2022 → 23 Sept 2022 Conference number: 22 https://2022.ecmlpkdd.org/index.html https://2022.ecmlpkdd.org/ |
Publication series
Name | Lecture Notes in Computer Science |
---|---|
Publisher | Springer |
Volume | 13717 |
ISSN (Print) | 0302-9743 |
ISSN (Electronic) | 1611-3349 |
Conference
Conference | European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases, ECML-PKDD 2022 |
---|---|
Abbreviated title | ECML-PKDD2022 |
Country/Territory | France |
City | Grenoble |
Period | 19/09/22 → 23/09/22 |
Internet address |
Keywords
- Random Forests
- Floating point
- Encoding
- hardware-software co-design
- 2023 OA procedure