TY - JOUR
T1 - Exploiting Approximate Feature Extraction via Genetic Programming for Hardware Acceleration in a Heterogeneous Microprocessor
AU - Jia, Hongyang
AU - Verma, Naveen
N1 - Funding Information:
Manuscript received August 3, 2017; revised October 19, 2017 and November 30, 2017; accepted December 15, 2017. Date of publication January 17, 2018; date of current version March 23, 2018. This paper was approved by Guest Editor Makoto Ikeda. This work was supported in part by AFOSR, in part by NSF under Grant CCF-1253670, and in part by C-FAR/SONIC, two SRC STARnet centers by MARCO and DARPA. (Corresponding author: Hongyang Jia.) The authors are with the Department of Electrical Engineering, Princeton University, Princeton, NJ 08544 USA (e-mail: hjia@princeton.edu; nverma@princeton.edu).
Publisher Copyright:
© 2012 IEEE.
PY - 2018/4
Y1 - 2018/4
N2 - This paper presents a heterogeneous microprocessor for low-energy sensor-inference applications. Hardware acceleration has shown to enable substantial energy-efficiency and throughput gains, but raises significant challenges where programmable computations are required, as in the case of feature extraction. To overcome this, a programmable feature-extraction accelerator (FEA) is presented that exploits genetic programming for automatic program synthesis. This leads to approximate, but highly structured, computations, enabling: 1) a high degree of specialization; 2) systematic mapping of programs to the accelerator; and 3) energy scalability via user-controllable approximation knobs. A microprocessor integrating a CPU with feature-extraction and classification accelerators is prototyped in 130-nm CMOS. Two medical-sensor applications (electroencephalogram-based seizure detection and electrocardiogram-based arrhythmia detection) demonstrate 325 × and 156 × energy reduction, respectively, for programmable feature extraction implemented on the accelerator versus a CPU-only architecture, and 7.6 × and 6.5 × energy reduction, respectively, versus a CPU-with-coprocessor architecture. Furthermore, 20 × and 9 × energy scalability, respectively, is demonstrated via the approximation knobs. The energy-efficiency of the programmable FEA is 220 GOPS/W, near that of fixed-function accelerators in the same technology, exceeding typical programmable accelerators.
AB - This paper presents a heterogeneous microprocessor for low-energy sensor-inference applications. Hardware acceleration has shown to enable substantial energy-efficiency and throughput gains, but raises significant challenges where programmable computations are required, as in the case of feature extraction. To overcome this, a programmable feature-extraction accelerator (FEA) is presented that exploits genetic programming for automatic program synthesis. This leads to approximate, but highly structured, computations, enabling: 1) a high degree of specialization; 2) systematic mapping of programs to the accelerator; and 3) energy scalability via user-controllable approximation knobs. A microprocessor integrating a CPU with feature-extraction and classification accelerators is prototyped in 130-nm CMOS. Two medical-sensor applications (electroencephalogram-based seizure detection and electrocardiogram-based arrhythmia detection) demonstrate 325 × and 156 × energy reduction, respectively, for programmable feature extraction implemented on the accelerator versus a CPU-only architecture, and 7.6 × and 6.5 × energy reduction, respectively, versus a CPU-with-coprocessor architecture. Furthermore, 20 × and 9 × energy scalability, respectively, is demonstrated via the approximation knobs. The energy-efficiency of the programmable FEA is 220 GOPS/W, near that of fixed-function accelerators in the same technology, exceeding typical programmable accelerators.
KW - Approximate computation
KW - feature extraction
KW - machine learning
KW - programmable accelerator
KW - sensor inference
UR - http://www.scopus.com/inward/record.url?scp=85040927514&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85040927514&partnerID=8YFLogxK
U2 - 10.1109/JSSC.2017.2787762
DO - 10.1109/JSSC.2017.2787762
M3 - Article
AN - SCOPUS:85040927514
SN - 0018-9200
VL - 53
SP - 1016
EP - 1027
JO - IEEE Journal of Solid-State Circuits
JF - IEEE Journal of Solid-State Circuits
IS - 4
ER -