CORP, a procedure for recovery from transient faults in real-time sensitive medical applications, is introduced. CORP is more effective than the traditional approach because it relies on concurrent retries executed by two neighbor processors in the array (faulty and assistant), instead of successive retries executed only by the faulty processor. Techniques to analyze how the occurrence of transient/intermittent faults disturbs the execution of a parallel algorithm in linear arrays are discussed. An optimal assistant assignment policy is constructed that maximizes the array performance in the presence of faults. The adaptive implementation of the optimal policy in linear-wavefront arrays using local distributed control and near-neighbor communications is presented.
|Original language||English (US)|
|Title of host publication||Unknown Host Publication Title|
|Number of pages||5|
|State||Published - Jan 1 1988|
All Science Journal Classification (ASJC) codes