TY - GEN
T1 - Specification and synthesis of hardware checkpointing and rollback mechanisms
AU - Chan, Carven
AU - Schwartz-Narbonne, Daniel
AU - Sethi, Divjyot
AU - Malik, Sharad
PY - 2012/7/11
Y1 - 2012/7/11
N2 - The increasing pressure to make hardware resilient to runtime failures has prompted development of design techniques for specific classes of systems, e.g. processors and routers. However, these techniques come at increased design and verification costs, thus limiting their broader application. In this work we describe a methodology for general RTL designs based on the widely usable checkpointing and rollback resiliency mechanism. We take a modeling and language approach that provides an appropriate set of abstractions for the resiliency logic. This cleanly separates the main design behavior from the resiliency behavior, leading to ease of design. Further, as the language abstractions can be automatically synthesized into resiliency logic, our methodology can merge with existing design flows. The concerns of verifying this additional resiliency logic can be addressed by synthesizing behavioral assertions capturing correct behavior. We demonstrate the use of this methodology on four examples, with synthesis for performance and area to estimate the overhead of the additional synthesis logic.
AB - The increasing pressure to make hardware resilient to runtime failures has prompted development of design techniques for specific classes of systems, e.g. processors and routers. However, these techniques come at increased design and verification costs, thus limiting their broader application. In this work we describe a methodology for general RTL designs based on the widely usable checkpointing and rollback resiliency mechanism. We take a modeling and language approach that provides an appropriate set of abstractions for the resiliency logic. This cleanly separates the main design behavior from the resiliency behavior, leading to ease of design. Further, as the language abstractions can be automatically synthesized into resiliency logic, our methodology can merge with existing design flows. The concerns of verifying this additional resiliency logic can be addressed by synthesizing behavioral assertions capturing correct behavior. We demonstrate the use of this methodology on four examples, with synthesis for performance and area to estimate the overhead of the additional synthesis logic.
KW - CpR-verilog
KW - backward error recovery
UR - http://www.scopus.com/inward/record.url?scp=84863543774&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=84863543774&partnerID=8YFLogxK
U2 - 10.1145/2228360.2228585
DO - 10.1145/2228360.2228585
M3 - Conference contribution
AN - SCOPUS:84863543774
SN - 9781450311991
T3 - Proceedings - Design Automation Conference
SP - 1226
EP - 1232
BT - Proceedings of the 49th Annual Design Automation Conference, DAC '12
T2 - 49th Annual Design Automation Conference, DAC '12
Y2 - 3 June 2012 through 7 June 2012
ER -