Immediate Split Trees: Immediate Encoding of Floating Point Split Values in Random Forests

Christian Hakert, Kuan-Hsun Chen, Jian-Jia Chen

Research output: Chapter in Book/Report/Conference proceedingConference contributionAcademicpeer-review

43 Downloads (Pure)

Abstract

Random forests and decision trees are increasingly interesting candidates for resource-constrained machine learning models. In order to make the execution of these models efficient under resource limitations, various optimized implementations have been proposed in the literature, usually implementing either native trees or if-else trees. While a certain motivation for the optimization of if-else trees is to benefit the behavior of dedicated instruction caches, in this work we highlight that if-else trees might also strongly depend on data caches. We identify one crucial issue of if-else tree implementations and propose an optimized implementation, which keeps the logic tree structure untouched and thus does not influence the accuracy, but eliminates the need to load comparison values from the data caches. Experimental evaluation of this implementation shows that we can greatly reduce the amount of data cache misses by up to 99%, while not increasing the amount of instruction cache misses in comparison to the state-of-the-art. We additionally highlight various scenarios, where the reduction of data cache misses draws important benefit on the allover execution time.
Original languageEnglish
Title of host publicationMachine Learning and Knowledge Discovery in Databases
Subtitle of host publicationEuropean Conference, ECML PKDD 2022, Grenoble, France, September 19–23, 2022, Proceedings, Part V
EditorsMassih-Reza Amini, Stéphane Canu, Asja Fischer, Tias Guns, Petra Kralj Novak, Grigorios Tsoumakas
Place of PublicationCham
PublisherSpringer
Pages531-546
Number of pages16
ISBN (Electronic)978-3-031-26419-1
ISBN (Print)978-3-031-26418-4
DOIs
Publication statusPublished - 17 Mar 2023
EventEuropean Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases, ECML-PKDD 2022 - World Trade Center, Grenoble, France
Duration: 19 Sept 202223 Sept 2022
Conference number: 22
https://2022.ecmlpkdd.org/index.html
https://2022.ecmlpkdd.org/

Publication series

NameLecture Notes in Computer Science
PublisherSpringer
Volume13717
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Conference

ConferenceEuropean Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases, ECML-PKDD 2022
Abbreviated titleECML-PKDD2022
Country/TerritoryFrance
CityGrenoble
Period19/09/2223/09/22
Internet address

Keywords

  • Random Forests
  • Floating point
  • Encoding
  • hardware-software co-design
  • 2023 OA procedure

Fingerprint

Dive into the research topics of 'Immediate Split Trees: Immediate Encoding of Floating Point Split Values in Random Forests'. Together they form a unique fingerprint.

Cite this