TY - GEN
T1 - A comparison of three programming models for adaptive applications on the origin2000
AU - Shan, Hongzhang
AU - Singh, Jaswinder Pal
AU - Oliker, Leonid
AU - Biswas, Rupak
N1 - Publisher Copyright:
© 2000 IEEE.
PY - 2000
Y1 - 2000
N2 - Adaptive applications have computational workloads and communication patterns which change unpredictably at runtime, requiring dynamic load balancing to achieve scalable performance on parallel machines. Efficient parallel implementations of such adaptive applications is therefore a challenging task. In this paper, we compare the performance of and the programming effort required for two major classes of adaptive applications under three leading parallel programming models on an SGI Origin2000 system, a machine which supports all three models efficiently. Results indicate that the three models deliver comparable performance; however, the implementations differ significantly beyond merely using explicit messages versus implicit loads/stores even though the basic parallel algorithms are similar. Compared with the message-passing (using MPI) and SHMEM programming models, the cache-coherent shared address space (CC-SAS) model provides substantial ease of programming at both the conceptual and program orchestration levels, often accompanied by performance gains. However, CC-SAS currently has portability limitations and may suffer from poor spatial locality of physically distributed shared data on large numbers of processors.
AB - Adaptive applications have computational workloads and communication patterns which change unpredictably at runtime, requiring dynamic load balancing to achieve scalable performance on parallel machines. Efficient parallel implementations of such adaptive applications is therefore a challenging task. In this paper, we compare the performance of and the programming effort required for two major classes of adaptive applications under three leading parallel programming models on an SGI Origin2000 system, a machine which supports all three models efficiently. Results indicate that the three models deliver comparable performance; however, the implementations differ significantly beyond merely using explicit messages versus implicit loads/stores even though the basic parallel algorithms are similar. Compared with the message-passing (using MPI) and SHMEM programming models, the cache-coherent shared address space (CC-SAS) model provides substantial ease of programming at both the conceptual and program orchestration levels, often accompanied by performance gains. However, CC-SAS currently has portability limitations and may suffer from poor spatial locality of physically distributed shared data on large numbers of processors.
UR - http://www.scopus.com/inward/record.url?scp=1942448565&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=1942448565&partnerID=8YFLogxK
U2 - 10.1109/SC.2000.10004
DO - 10.1109/SC.2000.10004
M3 - Conference contribution
AN - SCOPUS:1942448565
T3 - Proceedings of the International Conference on Supercomputing
BT - SC 2000 - Proceedings of the 2000 ACM/IEEE Conference on Supercomputing
PB - Association for Computing Machinery
T2 - 2000 ACM/IEEE Conference on Supercomputing, SC 2000
Y2 - 4 November 2000 through 10 November 2000
ER -