Design of multiprocessor systems for concurrent error detection and fault diagnosis

Bapiraju Vinnakota, Niraj K. Jha

Research output: Chapter in Book/Report/Conference proceedingConference contribution

9 Scopus citations

Abstract

Results on the design of systems using algorithm-based fault tolerance (ABFT), a low-overhead fault tolerance scheme for high-speed parallel processing systems, are presented. Bounds on the diagnosability of the system and the number of checks needed to design a unit system of given capability are derived. A procedure for forming the target fault-tolerant system from the unit system is introduced. The procedure is applicable to a wide range of systems in which processors may share data elements. The applications of the design scheme are illustrated through examples.

Original languageEnglish (US)
Title of host publication91 Fault-Tolerant Comput. Symp.
PublisherPubl by IEEE
Pages504-511
Number of pages8
ISBN (Print)0818621508
StatePublished - Jun 1 1991
Event21st International Symposium on Fault-Tolerant Computing - Montreal, Qui, Can
Duration: Jun 25 1991Jun 27 1991

Publication series

NameDigest of Papers - FTCS (Fault-Tolerant Computing Symposium)
ISSN (Print)0731-3071

Other

Other21st International Symposium on Fault-Tolerant Computing
CityMontreal, Qui, Can
Period6/25/916/27/91

All Science Journal Classification (ASJC) codes

  • Hardware and Architecture

Fingerprint Dive into the research topics of 'Design of multiprocessor systems for concurrent error detection and fault diagnosis'. Together they form a unique fingerprint.

Cite this