Multi-Resource List Scheduling of Moldable Parallel Jobs under Precedence Constraints

The scheduling literature has traditionally focused on a single type of resource (e.g., computing nodes). However, scientific applications in modern High-Performance Computing (HPC) systems process large amounts of data, hence have diverse requirements on different types of resources (e.g., cores, c...

Full description

Saved in:
Bibliographic Details
Main Authors Perotin, Lucas, Sun, Hongyang, Raghavan, Padma
Format Journal Article
LanguageEnglish
Published 13.06.2021
Subjects
Online AccessGet full text
DOI10.48550/arxiv.2106.07059

Cover

Abstract The scheduling literature has traditionally focused on a single type of resource (e.g., computing nodes). However, scientific applications in modern High-Performance Computing (HPC) systems process large amounts of data, hence have diverse requirements on different types of resources (e.g., cores, cache, memory, I/O). All of these resources could potentially be exploited by the runtime scheduler to improve the application performance. In this paper, we study multi-resource scheduling to minimize the makespan of computational workflows comprised of parallel jobs subject to precedence constraints. The jobs are assumed to be moldable, allowing the scheduler to flexibly select a variable set of resources before execution. We propose a multi-resource, list-based scheduling algorithm, and prove that, on a system with $d$ types of schedulable resources, our algorithm achieves an approximation ratio of $1.619d+2.545\sqrt{d}+1$ for any $d$, and a ratio of $d+O(\sqrt[3]{d^2})$ for large $d$. We also present improved results for independent jobs and for jobs with special precedence constraints (e.g., series-parallel graphs and trees). Finally, we prove a lower bound of $d$ on the approximation ratio of any list scheduling scheme with local priority considerations. To the best of our knowledge, these are the first approximation results for moldable workflows with multiple resource requirements.
AbstractList The scheduling literature has traditionally focused on a single type of resource (e.g., computing nodes). However, scientific applications in modern High-Performance Computing (HPC) systems process large amounts of data, hence have diverse requirements on different types of resources (e.g., cores, cache, memory, I/O). All of these resources could potentially be exploited by the runtime scheduler to improve the application performance. In this paper, we study multi-resource scheduling to minimize the makespan of computational workflows comprised of parallel jobs subject to precedence constraints. The jobs are assumed to be moldable, allowing the scheduler to flexibly select a variable set of resources before execution. We propose a multi-resource, list-based scheduling algorithm, and prove that, on a system with $d$ types of schedulable resources, our algorithm achieves an approximation ratio of $1.619d+2.545\sqrt{d}+1$ for any $d$, and a ratio of $d+O(\sqrt[3]{d^2})$ for large $d$. We also present improved results for independent jobs and for jobs with special precedence constraints (e.g., series-parallel graphs and trees). Finally, we prove a lower bound of $d$ on the approximation ratio of any list scheduling scheme with local priority considerations. To the best of our knowledge, these are the first approximation results for moldable workflows with multiple resource requirements.
Author Perotin, Lucas
Raghavan, Padma
Sun, Hongyang
Author_xml – sequence: 1
  givenname: Lucas
  surname: Perotin
  fullname: Perotin, Lucas
– sequence: 2
  givenname: Hongyang
  surname: Sun
  fullname: Sun, Hongyang
– sequence: 3
  givenname: Padma
  surname: Raghavan
  fullname: Raghavan, Padma
BackLink https://doi.org/10.48550/arXiv.2106.07059$$DView paper in arXiv
BookMark eNqFzrsKwjAUxvEMOnh7ACfzAq2pWi9zUUQsFC9zSdtTDRwTOUlE314t7k7f8v_g12UtbTQwNoxEOFvGsRhLeqpHOInEPBQLEa867Jx6dCo4gDWeSuB7ZR0_lleoPCp94abmqcFKFgg8kyQRAfnOFJZ7XQHxjKCECvTnmhhtHUmlne2zdi3RwuC3PTbarE_JNmgA-Z3UTdIr_0LyBjL9X7wBrIlANw
ContentType Journal Article
Copyright http://creativecommons.org/licenses/by/4.0
Copyright_xml – notice: http://creativecommons.org/licenses/by/4.0
DBID AKY
GOX
DOI 10.48550/arxiv.2106.07059
DatabaseName arXiv Computer Science
arXiv.org
DatabaseTitleList
Database_xml – sequence: 1
  dbid: GOX
  name: arXiv.org
  url: http://arxiv.org/find
  sourceTypes: Open Access Repository
DeliveryMethod fulltext_linktorsrc
ExternalDocumentID 2106_07059
GroupedDBID AKY
GOX
ID FETCH-arxiv_primary_2106_070593
IEDL.DBID GOX
IngestDate Tue Jul 22 23:13:56 EDT 2025
IsDoiOpenAccess true
IsOpenAccess true
IsPeerReviewed false
IsScholarly false
Language English
LinkModel DirectLink
MergedId FETCHMERGED-arxiv_primary_2106_070593
OpenAccessLink https://arxiv.org/abs/2106.07059
ParticipantIDs arxiv_primary_2106_07059
PublicationCentury 2000
PublicationDate 2021-06-13
PublicationDateYYYYMMDD 2021-06-13
PublicationDate_xml – month: 06
  year: 2021
  text: 2021-06-13
  day: 13
PublicationDecade 2020
PublicationYear 2021
Score 3.5257995
SecondaryResourceType preprint
Snippet The scheduling literature has traditionally focused on a single type of resource (e.g., computing nodes). However, scientific applications in modern...
SourceID arxiv
SourceType Open Access Repository
SubjectTerms Computer Science - Data Structures and Algorithms
Computer Science - Distributed, Parallel, and Cluster Computing
Title Multi-Resource List Scheduling of Moldable Parallel Jobs under Precedence Constraints
URI https://arxiv.org/abs/2106.07059
hasFullText 1
inHoldings 1
isFullTextHit
isPrint
link http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwdV1LSwMxEB7anryIolKfnYPXYM2-j6VYS7Fa0MLelqRJRCiu7G7Fn-9MtkUvvSYhDBOSmcnMfB_ArdOZSrVJhLNGifDepCINhlrIJDVk_pxb-UT7_DmeLsNZHuUdwF0vjKp-Pr5bfGBd31E8wtia5AJ0oSslB1ePL3mbnPRQXNv1f-vIx_RD_4zE5AgOt94djtrjOIaO_TyBpW9yFbuPcnwizeIrKctwFfg7lg7n5dpwDxMuVMXkJmuclbpGbvCqcEGPkvXUn8j0mp7UoalPYTB5eBtPhRek-GpRIwqWsfAyBmfQo9je9gGzKFjJTKkw0TI0ioGZVEyXTA1NFGdankN_3y4X-6cu4UBy6QVT7ARX0Guqjb0m29noG6_AX-B9dFE
linkProvider Cornell University
openUrl ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=Multi-Resource+List+Scheduling+of+Moldable+Parallel+Jobs+under+Precedence+Constraints&rft.au=Perotin%2C+Lucas&rft.au=Sun%2C+Hongyang&rft.au=Raghavan%2C+Padma&rft.date=2021-06-13&rft_id=info:doi/10.48550%2Farxiv.2106.07059&rft.externalDocID=2106_07059