Persistent data staging services for data intensive in-situ scientific workflows

Melissa Romanus, Fan Zhang, Tong Jin, Qian Sun, Hoang Bui, Manish Parashar, Jong Choi, Saloman Janhunen, Robert Hager, Scott Klasky, Choong Seock Chang, Ivan Rodero

Research output: Chapter in Book/Report/Conference proceedingConference contribution

7 Scopus citations

Abstract

Scientific simulation workflows executing on very large scale computing systems are essential modalities for scientific investigation. The increasing scales and resolution of these simulations provide new opportunities for accurately modeling complex natural and engineered phenomena. However, the increasing complexity necessitates managing, transporting, and processing unprecedented amounts of data, and as a result, researchers are increasingly exploring data-staging and in-situ workflows to reduce data movement and data-related overheads. However, as these workflows become more dynamic in their structures and behaviors, data staging and in-situ solutions must evolve to support new requirements. In this paper, we explore how the service-oriented concept can be applied to extreme-scale in-situ workflows. Specifically, we explore persistent data staging as a service and present the design and implementation of DataSpaces as a Service, a service-oriented data staging framework. We use a dynamically coupled fusion simulation workflow to illustrate the capabilities of this framework and evaluate its performance and scalability.

Original languageEnglish (US)
Title of host publicationDIDC 2016 - Proceedings of the ACM International Workshop on Data-Intensive Distributed Computing
PublisherAssociation for Computing Machinery, Inc
Pages37-44
Number of pages8
ISBN (Electronic)9781450343527
DOIs
StatePublished - Jun 1 2016
Event6th ACM International Workshop on Data-Intensive Distributed Computing, DIDC 2016 - Kyoto, Japan
Duration: Jun 1 2016 → …

Publication series

NameDIDC 2016 - Proceedings of the ACM International Workshop on Data-Intensive Distributed Computing

Conference

Conference6th ACM International Workshop on Data-Intensive Distributed Computing, DIDC 2016
Country/TerritoryJapan
CityKyoto
Period6/1/16 → …

All Science Journal Classification (ASJC) codes

  • Computational Theory and Mathematics
  • Computer Science Applications
  • Applied Mathematics

Fingerprint

Dive into the research topics of 'Persistent data staging services for data intensive in-situ scientific workflows'. Together they form a unique fingerprint.

Cite this