TY - GEN
T1 - Real-time, concurrent checkpoint for parallel programs
AU - Li, Kai
AU - Naughton, Jeffrey F.
AU - Plank, James S.
N1 - Funding Information:
*This research was supported in part by the National Science Foundation under grant CCR-8814265 and by the Digital Equip ment External Research Program and Systems Research Center. Kai Li and James Plank’s address: Computer Science Department, Princeton University. Jeffrey Naughton’s address: Computer Science Department, University of Wisconsin.
Publisher Copyright:
© 1990 ACM.
PY - 1990/2/1
Y1 - 1990/2/1
N2 - We have developed and implemented a checkpointing and restart algorithm for parallel programs running on commercial uniprocessors and shared-memory multipro cessors. The algorithm runs concurrently with the target program, interrupts the target program for small, fixed amounts of time and is transparent to the checkpointed program and its compiler. The algorithm achieves its efficiency through a novel use of address translation hardware that allows the most time-consuming operations of the checkpoint to be overlapped with the running of the program being checkpointed.
AB - We have developed and implemented a checkpointing and restart algorithm for parallel programs running on commercial uniprocessors and shared-memory multipro cessors. The algorithm runs concurrently with the target program, interrupts the target program for small, fixed amounts of time and is transparent to the checkpointed program and its compiler. The algorithm achieves its efficiency through a novel use of address translation hardware that allows the most time-consuming operations of the checkpoint to be overlapped with the running of the program being checkpointed.
UR - http://www.scopus.com/inward/record.url?scp=84990735760&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=84990735760&partnerID=8YFLogxK
U2 - 10.1145/99163.99173
DO - 10.1145/99163.99173
M3 - Conference contribution
AN - SCOPUS:84990735760
T3 - Proceedings of the ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, PPOPP
SP - 79
EP - 88
BT - Proceedings of the 2nd ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, PPOPP 1990
PB - Association for Computing Machinery
T2 - 2nd ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, PPOPP 1990
Y2 - 14 March 1990 through 16 March 1990
ER -