TY - GEN
T1 - A Switched-Capacitor SRAM In-memory Computing Macro with High-precision, High-efficiency Differential Architecture
AU - Lee, Jinseok
AU - Zhang, Bonan
AU - Verma, Naveen
N1 - Publisher Copyright:
© 2024 IEEE.
PY - 2024
Y1 - 2024
N2 - This paper presents an 1152 × 256 switched-capacitor (SC) SRAM in-memory computing (IMC) macro in 28 nm CMOS. SC IMC has enabled high-SNR analog computation, wherein ADC quantization now poses the critical precision and energy limitation. This limitation is addressed in this work by: (1) a fully differential architecture, which enables doubling of the supply-limited signal swing, allowing the ADC dynamic range to be efficiently increased to 10 b; (2) ADC sharing across either two or four binary-weighted columns to amortize energy and area; (3) configurable non-uniform quantization, optimized for the high concentration of data from IMC computation typically around the ADC mid-range. The macro achieves state-of-art (SoA) energy efficiency of 8161 TOPS/W and compute-density of 111.8 TOPS/mm2, both normalized to 1-b computations. In addition to high IMC compute precision, neural-net (NN) classification is demonstrated by mapping ResNet-18 to the prototype, performing both CIFAR-10 and the more complex ImageNet tasks, both achieving ideal-level accuracies of 92.34% and 69.88%, respectively.
AB - This paper presents an 1152 × 256 switched-capacitor (SC) SRAM in-memory computing (IMC) macro in 28 nm CMOS. SC IMC has enabled high-SNR analog computation, wherein ADC quantization now poses the critical precision and energy limitation. This limitation is addressed in this work by: (1) a fully differential architecture, which enables doubling of the supply-limited signal swing, allowing the ADC dynamic range to be efficiently increased to 10 b; (2) ADC sharing across either two or four binary-weighted columns to amortize energy and area; (3) configurable non-uniform quantization, optimized for the high concentration of data from IMC computation typically around the ADC mid-range. The macro achieves state-of-art (SoA) energy efficiency of 8161 TOPS/W and compute-density of 111.8 TOPS/mm2, both normalized to 1-b computations. In addition to high IMC compute precision, neural-net (NN) classification is demonstrated by mapping ResNet-18 to the prototype, performing both CIFAR-10 and the more complex ImageNet tasks, both achieving ideal-level accuracies of 92.34% and 69.88%, respectively.
UR - https://www.scopus.com/pages/publications/85208437403
UR - https://www.scopus.com/inward/citedby.url?scp=85208437403&partnerID=8YFLogxK
U2 - 10.1109/ESSERC62670.2024.10719551
DO - 10.1109/ESSERC62670.2024.10719551
M3 - Conference contribution
AN - SCOPUS:85208437403
T3 - European Solid-State Circuits Conference
SP - 357
EP - 360
BT - ESSERC 2024 - Proceedings
PB - IEEE Computer Society
T2 - 50th IEEE European Solid-State Electronics Research Conference, ESSERC 2024
Y2 - 9 September 2024 through 12 September 2024
ER -