TY - JOUR
T1 - Comparison of MPI, SHMEM and cache-coherent shared address space programming models on the SGI Origin2000
AU - Shan, Hongzhang
AU - Singh, Jaswinder Pal
PY - 1999/1/1
Y1 - 1999/1/1
N2 - We compare the performance of three major programming models - a load-store cache-coherent shared address space (CC-SAS), message passing (MP) and the segmented SHMEM model - on a modern, 64-processor hardware cache-coherent machine, one of the two major types of platforms upon which high-performance computing is converging. We focus on applications that are either regular and predictable or at least do not require fine-grained dynamic replication of irregularly accessed data. Within this class, we use programs with a range of important communication patterns. We examine whether the basic parallel algorithm and communication structuring approaches needed for best performance are similar or different among the models, whether some models have substantial performance advantages over others as problem size and number of processors change, what the sources of these performance differences are, where the programs spend their time, and whether substantial improvements can be obtained by modifying either the application programming interfaces or the implementations of the programming models on this type of platform.
AB - We compare the performance of three major programming models - a load-store cache-coherent shared address space (CC-SAS), message passing (MP) and the segmented SHMEM model - on a modern, 64-processor hardware cache-coherent machine, one of the two major types of platforms upon which high-performance computing is converging. We focus on applications that are either regular and predictable or at least do not require fine-grained dynamic replication of irregularly accessed data. Within this class, we use programs with a range of important communication patterns. We examine whether the basic parallel algorithm and communication structuring approaches needed for best performance are similar or different among the models, whether some models have substantial performance advantages over others as problem size and number of processors change, what the sources of these performance differences are, where the programs spend their time, and whether substantial improvements can be obtained by modifying either the application programming interfaces or the implementations of the programming models on this type of platform.
UR - http://www.scopus.com/inward/record.url?scp=0032691538&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=0032691538&partnerID=8YFLogxK
M3 - Article
AN - SCOPUS:0032691538
SP - 329
EP - 338
JO - Proceedings of the International Conference on Supercomputing
JF - Proceedings of the International Conference on Supercomputing
ER -