TY - GEN
T1 - All-pairs
T2 - IPDPS 2008 - 22nd IEEE International Parallel and Distributed Processing Symposium
AU - Moretti, Christopher
AU - Bulosan, Jared
AU - Thain, Douglas
AU - Flynn, Patrick J.
N1 - Copyright:
Copyright 2011 Elsevier B.V., All rights reserved.
PY - 2008
Y1 - 2008
N2 - Although modern parallel and distributed computing systems provide easy access to large amounts of computing power, it is not always easy for non-expert users to harness these large systems effectively. A large workload composed in what seems to be the obvious way by a naive user may accidentally abuse shared resources and achieve very poor performance. To address this problem, we propose that production systems should provide end users with high-level abstractions that allow for the easy expression and efficient execution of data intensive workloads. We present one example of an abstraction - All-Pairs - that fits the needs of several data-intensive scientific applications. We demonstrate that an optimized All-Pairs abstraction is both easier to use than the underlying system, and achieves performance orders of magnitude better than the obvious but naive approach, and twice as fast as a hand-optimized conventional approach.
AB - Although modern parallel and distributed computing systems provide easy access to large amounts of computing power, it is not always easy for non-expert users to harness these large systems effectively. A large workload composed in what seems to be the obvious way by a naive user may accidentally abuse shared resources and achieve very poor performance. To address this problem, we propose that production systems should provide end users with high-level abstractions that allow for the easy expression and efficient execution of data intensive workloads. We present one example of an abstraction - All-Pairs - that fits the needs of several data-intensive scientific applications. We demonstrate that an optimized All-Pairs abstraction is both easier to use than the underlying system, and achieves performance orders of magnitude better than the obvious but naive approach, and twice as fast as a hand-optimized conventional approach.
UR - http://www.scopus.com/inward/record.url?scp=51049112055&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=51049112055&partnerID=8YFLogxK
U2 - 10.1109/IPDPS.2008.4536311
DO - 10.1109/IPDPS.2008.4536311
M3 - Conference contribution
AN - SCOPUS:51049112055
SN - 9781424416943
T3 - IPDPS Miami 2008 - Proceedings of the 22nd IEEE International Parallel and Distributed Processing Symposium, Program and CD-ROM
BT - IPDPS Miami 2008 - Proceedings of the 22nd IEEE International Parallel and Distributed Processing Symposium, Program and CD-ROM
Y2 - 14 April 2008 through 18 April 2008
ER -