TY - GEN
T1 - The effects of communication parameters on end performance of shared virtual memory clusters
AU - Bilas, Angelos
AU - Singh, Jaswinder Pal
PY - 1997
Y1 - 1997
N2 - Recently there has been a lot of effort in providing cost-effective Shared Memory systems by employing software only solutions on clusters of high-end workstations coupled with high-bandwidth, low-latency commodity networks. Much of the work so far has focused on improving protocols, and there has been some work on restructuring applications to perform better on SVM systems. The result of this progress has been the promise for good performance on a range of applications at least in the 16-32 processor range. New system area networks and network interfaces provide significantly lower overhead, lower latency and higher bandwidth communication in clusters, inexpensive SMPs have become common as the nodes of these clusters, and SVM protocols are now quite mature. With this progress, it is now useful to examine what are the important system bottlenecks that stand in the way of effective parallel performance; in particular, which parameters of the communication architecture are most important to improve further relative to processor speed, which ones are already adequate on modern systems for most applications, and how will this change with technology in the future. Such information can assist system designers in determining where to focus their energies in improving performance, and users in determining what system characteristics are appropriate for their applications. bandwidth relative to processor speed helps some bandwidth-bound applications, but currently available ratios of bandwidth to processor speed are already adequate for many others. Surprisingly, neither the processor overhead for handling messages nor the occupancy of the communication interface in preparing and pushing packets through the network appear to require much improvement.
AB - Recently there has been a lot of effort in providing cost-effective Shared Memory systems by employing software only solutions on clusters of high-end workstations coupled with high-bandwidth, low-latency commodity networks. Much of the work so far has focused on improving protocols, and there has been some work on restructuring applications to perform better on SVM systems. The result of this progress has been the promise for good performance on a range of applications at least in the 16-32 processor range. New system area networks and network interfaces provide significantly lower overhead, lower latency and higher bandwidth communication in clusters, inexpensive SMPs have become common as the nodes of these clusters, and SVM protocols are now quite mature. With this progress, it is now useful to examine what are the important system bottlenecks that stand in the way of effective parallel performance; in particular, which parameters of the communication architecture are most important to improve further relative to processor speed, which ones are already adequate on modern systems for most applications, and how will this change with technology in the future. Such information can assist system designers in determining where to focus their energies in improving performance, and users in determining what system characteristics are appropriate for their applications. bandwidth relative to processor speed helps some bandwidth-bound applications, but currently available ratios of bandwidth to processor speed are already adequate for many others. Surprisingly, neither the processor overhead for handling messages nor the occupancy of the communication interface in preparing and pushing packets through the network appear to require much improvement.
KW - Bandwidth
KW - Clustering
KW - Communication parameters
KW - Distributed memory
KW - Host overhead
KW - Interrupt cost
KW - Latency
KW - Network occupancy
KW - Shared memory
UR - http://www.scopus.com/inward/record.url?scp=84900303378&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=84900303378&partnerID=8YFLogxK
U2 - 10.1145/509593.509594
DO - 10.1145/509593.509594
M3 - Conference contribution
AN - SCOPUS:84900303378
SN - 0897919858
SN - 9780897919852
T3 - Proceedings of the International Conference on Supercomputing
BT - Proceedings of the 1997 ACM/IEEE Conference on Supercomputing, SC 1997
PB - Association for Computing Machinery
T2 - 1997 ACM/IEEE Conference on Supercomputing, SC 1997
Y2 - 15 November 1997 through 21 November 1997
ER -