TY - GEN
T1 - A Maximally Row-Parallel MRAM In-Memory-Computing Macro Addressing Readout Circuit Sensitivity and Area
AU - Deaville, Peter
AU - Zhang, Bonan
AU - Chen, Lung Yen
AU - Verma, Naveen
N1 - Publisher Copyright:
© 2021 IEEE.
PY - 2021/9/13
Y1 - 2021/9/13
N2 - This paper presents the first MRAM-based In-Memory-Computing (IMC) macro, implemented as a 128-kb array in an advanced-node 22nm FD-SOI technology. The design maximizes IMC row parallelism for energy efficiency and throughput, while addressing the critical challenges this raises, namely: high column currents; high output dynamic-range requirements; and large area of peripheral readout circuits. These are addressed through current-insensitive column-multiplexing and high-sensitivity readout circuits, occupying 26% of the macro area. Residual IMC non-idealities, arising from statistical circuit variations, are modeled and incorporated in a chip-generalized one-time neural-network training algorithm, with CIFAR-10 image-classification accuracy demonstrated at 90.1%, equal to ideal digital computation. The design addresses the particularly high sensitivity required for MRAM-based IMC compared to other non-volatile memory technologies, while achieving area-normalized throughput of 758 GOPS/mm2 and energy efficiency of 5.1 TOPS/W for the macro.
AB - This paper presents the first MRAM-based In-Memory-Computing (IMC) macro, implemented as a 128-kb array in an advanced-node 22nm FD-SOI technology. The design maximizes IMC row parallelism for energy efficiency and throughput, while addressing the critical challenges this raises, namely: high column currents; high output dynamic-range requirements; and large area of peripheral readout circuits. These are addressed through current-insensitive column-multiplexing and high-sensitivity readout circuits, occupying 26% of the macro area. Residual IMC non-idealities, arising from statistical circuit variations, are modeled and incorporated in a chip-generalized one-time neural-network training algorithm, with CIFAR-10 image-classification accuracy demonstrated at 90.1%, equal to ideal digital computation. The design addresses the particularly high sensitivity required for MRAM-based IMC compared to other non-volatile memory technologies, while achieving area-normalized throughput of 758 GOPS/mm2 and energy efficiency of 5.1 TOPS/W for the macro.
UR - http://www.scopus.com/inward/record.url?scp=85118446180&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85118446180&partnerID=8YFLogxK
U2 - 10.1109/ESSCIRC53450.2021.9567807
DO - 10.1109/ESSCIRC53450.2021.9567807
M3 - Conference contribution
AN - SCOPUS:85118446180
T3 - ESSCIRC 2021 - IEEE 47th European Solid State Circuits Conference, Proceedings
SP - 75
EP - 78
BT - ESSCIRC 2021 - IEEE 47th European Solid State Circuits Conference, Proceedings
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 47th IEEE European Solid State Circuits Conference, ESSCIRC 2021
Y2 - 6 September 2021 through 9 September 2021
ER -