TY - JOUR
T1 - Analysis of multigrid algorithms on massively parallel computers
T2 - Architectural implications
AU - Matheson, Lesley R.
AU - Tarjan, Robert E.
N1 - Funding Information:
2 Research at Princeton University partially supported by the National Science Foundation, Grant CCR-8920505; the Office of Naval Research, Contract N0014-91-J-1463; and DIMACS (Center for Discrete Mathematics and Theoretical Computer Science), a National Science and Technology Center, Grant NSF-STC88-09648.
PY - 1996/2/25
Y1 - 1996/2/25
N2 - We study the potential performance of multigrid algorithms running on massively parallel computers with the intent of discovering whether currently envisioned machines will provide an efficient platform for such algorithms. These algorithms substantially improve the performance of iterative methods of solving partial differential equations. We consider the domain parallel version of the standard V-cycle multigrid algorithm on model problems, discretized using finite difference techniques in two and three dimensions on block-structured grids of size 106 and 109, respectively. We develop a set of models of parallel computation which reflect the computing characteristics of the current generation of massively parallel multicomputers. These models are based on an interconnection network of 256 to 16,384 message passing, "workstation size" processors executing in a SPMD mode. The models, based on the computing characteristics of an architectural class, provide metrics which balance abstraction with machine specificity. With the medium grain parallelism of the current generation and the high fixed cost of an interprocessor communication, our analysis suggests that an efficient implementation for practical problem sizes requires the machine to support the efficient transmission of long messages (up to 1000 words); otherwise the high initiation cost of a communication must be significantly reduced through an alternative optimization technique. The analysis also suggests that low diameter multistage networks provide little or no advantage over a simple single stage communications network. Finally, the analysis suggests that fine grain parallelism and low fixed communication costs may provide more efficiency than medium grain parallelism with low variable communications costs.
AB - We study the potential performance of multigrid algorithms running on massively parallel computers with the intent of discovering whether currently envisioned machines will provide an efficient platform for such algorithms. These algorithms substantially improve the performance of iterative methods of solving partial differential equations. We consider the domain parallel version of the standard V-cycle multigrid algorithm on model problems, discretized using finite difference techniques in two and three dimensions on block-structured grids of size 106 and 109, respectively. We develop a set of models of parallel computation which reflect the computing characteristics of the current generation of massively parallel multicomputers. These models are based on an interconnection network of 256 to 16,384 message passing, "workstation size" processors executing in a SPMD mode. The models, based on the computing characteristics of an architectural class, provide metrics which balance abstraction with machine specificity. With the medium grain parallelism of the current generation and the high fixed cost of an interprocessor communication, our analysis suggests that an efficient implementation for practical problem sizes requires the machine to support the efficient transmission of long messages (up to 1000 words); otherwise the high initiation cost of a communication must be significantly reduced through an alternative optimization technique. The analysis also suggests that low diameter multistage networks provide little or no advantage over a simple single stage communications network. Finally, the analysis suggests that fine grain parallelism and low fixed communication costs may provide more efficiency than medium grain parallelism with low variable communications costs.
UR - http://www.scopus.com/inward/record.url?scp=0030600717&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=0030600717&partnerID=8YFLogxK
U2 - 10.1006/jpdc.1996.0022
DO - 10.1006/jpdc.1996.0022
M3 - Article
AN - SCOPUS:0030600717
SN - 0743-7315
VL - 33
SP - 33
EP - 43
JO - Journal of Parallel and Distributed Computing
JF - Journal of Parallel and Distributed Computing
IS - 1
ER -