In-Staging Data Placement for Asynchronous Coupling of Task-Based Scientific Workflows

Coupled application workflows composed of applications implemented using task-based models present new coupling and data exchange challenges, due to the asynchronous interaction and coupling behaviors between tasks of the component applications. In this paper, we present an adaptive data placement a...

Full description

Saved in:
Bibliographic Details
Published in2016 Second International Workshop on Extreme Scale Programming Models and Middlewar (ESPM2) pp. 2 - 9
Main Authors Qian Sun, Romanus, Melissa, Tong Jin, Hongfeng Yu, Bremer, Peer-Timo, Petruzza, Steve, Klasky, Scott, Parashar, Manish
Format Conference Proceeding
LanguageEnglish
Published IEEE 01.11.2016
Subjects
Online AccessGet full text
DOI10.1109/ESPM2.2016.006

Cover

Loading…
Abstract Coupled application workflows composed of applications implemented using task-based models present new coupling and data exchange challenges, due to the asynchronous interaction and coupling behaviors between tasks of the component applications. In this paper, we present an adaptive data placement approach that addresses these challenges by dynamically adjusting to the asynchronous coupling patterns. Specifically, it places data across a set of staging cores/nodes with an awareness of the application-specific data locality requirements and the runtime task executions at these staging cores/nodes, with the goal of reducing end-to-end execution time and data movement overhead of the workflow. We experimentally demonstrate the effectiveness of our approach on the Titan Cray XK7 system using representative data coupling patterns derived from current scientific workflows. The evaluation demonstrates that our approach efficiently improves performance by reducing the time-to-solution and increasing the quality of insights for scientific discovery.
AbstractList Coupled application workflows composed of applications implemented using task-based models present new coupling and data exchange challenges, due to the asynchronous interaction and coupling behaviors between tasks of the component applications. In this paper, we present an adaptive data placement approach that addresses these challenges by dynamically adjusting to the asynchronous coupling patterns. Specifically, it places data across a set of staging cores/nodes with an awareness of the application-specific data locality requirements and the runtime task executions at these staging cores/nodes, with the goal of reducing end-to-end execution time and data movement overhead of the workflow. We experimentally demonstrate the effectiveness of our approach on the Titan Cray XK7 system using representative data coupling patterns derived from current scientific workflows. The evaluation demonstrates that our approach efficiently improves performance by reducing the time-to-solution and increasing the quality of insights for scientific discovery.
Author Qian Sun
Parashar, Manish
Romanus, Melissa
Tong Jin
Petruzza, Steve
Hongfeng Yu
Bremer, Peer-Timo
Klasky, Scott
Author_xml – sequence: 1
  surname: Qian Sun
  fullname: Qian Sun
  organization: Rutgers Univ., Piscataway, NJ, USA
– sequence: 2
  givenname: Melissa
  surname: Romanus
  fullname: Romanus, Melissa
  organization: Rutgers Univ., Piscataway, NJ, USA
– sequence: 3
  surname: Tong Jin
  fullname: Tong Jin
  organization: Rutgers Univ., Piscataway, NJ, USA
– sequence: 4
  surname: Hongfeng Yu
  fullname: Hongfeng Yu
  organization: Oak Ridge Nat. Labortory, Oak Ridge, TN, USA
– sequence: 5
  givenname: Peer-Timo
  surname: Bremer
  fullname: Bremer, Peer-Timo
  organization: Univ. of Nebraska-Lincoln, Lincoln, NE, USA
– sequence: 6
  givenname: Steve
  surname: Petruzza
  fullname: Petruzza, Steve
  organization: Univ. of Utah, Salt Lake City, UT, USA
– sequence: 7
  givenname: Scott
  surname: Klasky
  fullname: Klasky, Scott
  organization: Lawrence Livermore Nat. Lab., Livermore, CA, USA
– sequence: 8
  givenname: Manish
  surname: Parashar
  fullname: Parashar, Manish
  organization: Rutgers Univ., Piscataway, NJ, USA
BookMark eNotzE1PwjAcgPGa6EGRqxcv_QKb_660a484EUkwkmzRI-krNoyWrCOGb69GT8_ll-cGXcYUHUJ3BEpCQD4s2s1rVVZAeAnAL9BU1oIwkEAFE_Iava9i0Y5qF-IOP6lR4U2vjDu4OGKfBjzP52g-hxTTKeMmnY79L0wedyrvi0eVncWtCT88-GDwRxr2vk9f-RZdedVnN_3vBHXPi655KdZvy1UzXxdBwlhIop2t7cxzL6T2UFFXeU2ZZsClJpIxo63y1Kq6UtRz0EAYN8ZyLoijhE7Q_d82OOe2xyEc1HDe1oISxmb0G3MvTiY
CODEN IEEPAD
ContentType Conference Proceeding
DBID 6IE
6IL
CBEJK
RIE
RIL
DOI 10.1109/ESPM2.2016.006
DatabaseName IEEE Electronic Library (IEL) Conference Proceedings
IEEE Xplore POP ALL
IEEE Xplore All Conference Proceedings
IEEE Electronic Library (IEL)
IEEE Proceedings Order Plans (POP All) 1998-Present
DatabaseTitleList
Database_xml – sequence: 1
  dbid: RIE
  name: IEEE Electronic Library (IEL)
  url: https://proxy.k.utb.cz/login?url=https://ieeexplore.ieee.org/
  sourceTypes: Publisher
DeliveryMethod fulltext_linktorsrc
EISBN 9781509038589
1509038582
EndPage 9
ExternalDocumentID 7831554
Genre orig-research
GroupedDBID 6IE
6IL
CBEJK
RIE
RIL
ID FETCH-LOGICAL-i90t-91bed7d4f6f89bf023e2fb35b5069b1955cbdaf3da72a3f60b0156ccd6681e313
IEDL.DBID RIE
IngestDate Thu Jun 29 18:37:48 EDT 2023
IsPeerReviewed false
IsScholarly false
Language English
LinkModel DirectLink
MergedId FETCHMERGED-LOGICAL-i90t-91bed7d4f6f89bf023e2fb35b5069b1955cbdaf3da72a3f60b0156ccd6681e313
PageCount 8
ParticipantIDs ieee_primary_7831554
PublicationCentury 2000
PublicationDate 2016-Nov.
PublicationDateYYYYMMDD 2016-11-01
PublicationDate_xml – month: 11
  year: 2016
  text: 2016-Nov.
PublicationDecade 2010
PublicationTitle 2016 Second International Workshop on Extreme Scale Programming Models and Middlewar (ESPM2)
PublicationTitleAbbrev ESPM2
PublicationYear 2016
Publisher IEEE
Publisher_xml – name: IEEE
Score 1.6397083
Snippet Coupled application workflows composed of applications implemented using task-based models present new coupling and data exchange challenges, due to the...
SourceID ieee
SourceType Publisher
StartPage 2
SubjectTerms Adaptation models
Couplings
Data models
Data processing
Data storage systems
Distributed databases
Monitoring
Runtime
Title In-Staging Data Placement for Asynchronous Coupling of Task-Based Scientific Workflows
URI https://ieeexplore.ieee.org/document/7831554
hasFullText 1
inHoldings 1
isFullTextHit
isPrint
link http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwjV09T8MwED21nZgAtYhveWDEbdLETjJCaVWQiipRULfKnwgVJYgmQvDr8TmlCMTAZmVwIvucd3e-9w7gLGJaa64Y1ZJZGuuAUeksiaLWXOwsShjlq3xv-fg-vpmzeQPON1wYY4wvPjNdHPq7fF2oClNlvSSNEP6a0HSBW83VWuswhkHWG95NJ0itCvF6gf_oluLBYrQNk6_X1DUiy25Vyq76-KXA-N_v2IHONy2PTDeAswsNk7fh4TqnzmPEXkPkSpSCTDExjhMQ54-Si9V7rlAA10X4ZFBUSMB9JIUlM7Fa0kuHYZr48-1rhgjmzu1z8bbqwGw0nA3GdN0sgT5lQen-WdLoRMeW2zST1iGx6VsZMckCnskwY0xJLWykRdIXkeWY_2RcKc15GpoojPaglRe52QfCEx1YFxWmEptZMSlMliphEu38cOcP8ANo45IsXmo5jMV6NQ7_fnwEW7glNX3vGFrla2VOHI6X8tRv4Ce8pKC-
linkProvider IEEE
linkToHtml http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwjV07T8MwELZKGWAC1CLeeGDEbdLETjJCadVCW1UioG6VnwgVJYgmQvDr8TkFBGJgszzYll_f-XzfdwidBVQpxSQlSlBDQuVRIuxOIqA1F9odxbV0Ub4TNrgLr2d0VkPnX1wYrbULPtMtKLq_fJXLElxl7SgOAP7W0DoFMm7F1lopMfpe0u7dTsdArvLhg4H9yJfi4KK_hcafHVVRIotWWYiWfP-lwfjfkWyj5jcxD0-_IGcH1XTWQPfDjFibEbIN4StecDwF1zg0gK1Fii-Wb5kECVz7xsfdvAQK7gPODU75ckEuLYop7E64ixrC4D03T_nrsonSfi_tDsgqXQJ5TLzC3lpCq0iFhpk4EcZise4YEVBBPZYIP6FUCsVNoHjU4YFh4AGlTErFWOzrwA92UT3LM72HMIuUZ-y7MBaQzooKrpNYch0pa4lbi4DtowZMyfy5EsSYr2bj4O_qU7QxSMej-Wg4uTlEm7A8FZnvCNWLl1IfW1QvxIlbzA9MRKQG
openUrl ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Abook&rft.genre=proceeding&rft.title=2016+Second+International+Workshop+on+Extreme+Scale+Programming+Models+and+Middlewar+%28ESPM2%29&rft.atitle=In-Staging+Data+Placement+for+Asynchronous+Coupling+of+Task-Based+Scientific+Workflows&rft.au=Qian+Sun&rft.au=Romanus%2C+Melissa&rft.au=Tong+Jin&rft.au=Hongfeng+Yu&rft.date=2016-11-01&rft.pub=IEEE&rft.spage=2&rft.epage=9&rft_id=info:doi/10.1109%2FESPM2.2016.006&rft.externalDocID=7831554