TY - GEN
T1 - Enabling portable energy efficiency with memory accelerated library
AU - Guo, Qi
AU - Low, Tze Meng
AU - Alachiotis, Nikolaos
AU - Akin, Berkin
AU - Pileggi, Larry
AU - Hoe, James C.
AU - Franchetti, Franz
N1 - Publisher Copyright:
© 2015 ACM.
PY - 2015/12/5
Y1 - 2015/12/5
N2 - Over the last decade, the looming power wall has spurred a flurry of interest in developing heterogeneous systems with hardware accelerators. The questions, then, are what and how accelerators should be designed, and what software support is required. Our accelerator design approach stems from the observation that many efficient and portable software implementations rely on high performance software libraries with well-established application programming interfaces (APIs). We propose the integration of hardware accelerators on 3D-stacked memory that explicitly targets the memory-bounded operations within high performance libraries. The fixed APIs with limited configurability simplify the design of the accelerators, while ensuring that the accelerators have wide applicability. With our software support that automatically converts library APIs to accelerator invocations, an additional advantage of our approach is that library-based legacy code automatically gains the benefit of memory-side accelerators without requiring a reimplementation. On average, the legacy code using our proposed MEmory Accelerated Library (MEALib) improves performance and energy efficiency for individual operations in Intel's Math Kernel Library (MKL) by 38x and 75x, respectively. For a real-world signal processing application that employs Intel MKL, MEALib attains more than 10x better energy efficiency.
AB - Over the last decade, the looming power wall has spurred a flurry of interest in developing heterogeneous systems with hardware accelerators. The questions, then, are what and how accelerators should be designed, and what software support is required. Our accelerator design approach stems from the observation that many efficient and portable software implementations rely on high performance software libraries with well-established application programming interfaces (APIs). We propose the integration of hardware accelerators on 3D-stacked memory that explicitly targets the memory-bounded operations within high performance libraries. The fixed APIs with limited configurability simplify the design of the accelerators, while ensuring that the accelerators have wide applicability. With our software support that automatically converts library APIs to accelerator invocations, an additional advantage of our approach is that library-based legacy code automatically gains the benefit of memory-side accelerators without requiring a reimplementation. On average, the legacy code using our proposed MEmory Accelerated Library (MEALib) improves performance and energy efficiency for individual operations in Intel's Math Kernel Library (MKL) by 38x and 75x, respectively. For a real-world signal processing application that employs Intel MKL, MEALib attains more than 10x better energy efficiency.
KW - 3D DRAM
KW - Accelerator
KW - Energy efficiency
KW - Library
UR - http://www.scopus.com/inward/record.url?scp=84959923571&partnerID=8YFLogxK
U2 - 10.1145/2830772.2830788
DO - 10.1145/2830772.2830788
M3 - Conference contribution
AN - SCOPUS:84959923571
T3 - Proceedings of the Annual International Symposium on Microarchitecture, MICRO
SP - 750
EP - 761
BT - MICRO-48
PB - ACM Publishing
T2 - 48th Annual IEEE/ACM International Symposium on Microarchitecture, MICRO 2015
Y2 - 5 December 2015 through 9 December 2015
ER -