DREAM: A Dynamic Scheduler for Dynamic Real-time Multi-model ML Workloads

Emerging real-time multi-model ML (RTMM) workloads such as AR/VR and drone control involve dynamic behaviors in various granularity; task, model, and layers within a model. Such dynamic behaviors introduce new challenges to the system software in an ML system since the overall system load is not com...

Full description

Saved in:
Bibliographic Details
Main Authors Kim, Seah, Kwon, Hyoukjun, Song, Jinook, Jo, Jihyuck, Chen, Yu-Hsin, Lai, Liangzhen, Chandra, Vikas
Format Journal Article
LanguageEnglish
Published 06.12.2022
Subjects
Online AccessGet full text

Cover

Loading…
Abstract Emerging real-time multi-model ML (RTMM) workloads such as AR/VR and drone control involve dynamic behaviors in various granularity; task, model, and layers within a model. Such dynamic behaviors introduce new challenges to the system software in an ML system since the overall system load is not completely predictable, unlike traditional ML workloads. In addition, RTMM workloads require real-time processing, involve highly heterogeneous models, and target resource-constrained devices. Under such circumstances, developing an effective scheduler gains more importance to better utilize underlying hardware considering the unique characteristics of RTMM workloads. Therefore, we propose a new scheduler, DREAM, which effectively handles various dynamicity in RTMM workloads targeting multi-accelerator systems. DREAM quantifies the unique requirements for RTMM workloads and utilizes the quantified scores to drive scheduling decisions, considering the current system load and other inference jobs on different models and input frames. DREAM utilizes tunable parameters that provide fast and effective adaptivity to dynamic workload changes. In our evaluation of five scenarios of RTMM workload, DREAM reduces the overall UXCost, which is an equivalent metric of the energy-delay product (EDP) for RTMM defined in the paper, by 32.2% and 50.0% in the geometric mean (up to 80.8% and 97.6%) compared to state-of-the-art baselines, which shows the efficacy of our scheduling methodology.
AbstractList Emerging real-time multi-model ML (RTMM) workloads such as AR/VR and drone control involve dynamic behaviors in various granularity; task, model, and layers within a model. Such dynamic behaviors introduce new challenges to the system software in an ML system since the overall system load is not completely predictable, unlike traditional ML workloads. In addition, RTMM workloads require real-time processing, involve highly heterogeneous models, and target resource-constrained devices. Under such circumstances, developing an effective scheduler gains more importance to better utilize underlying hardware considering the unique characteristics of RTMM workloads. Therefore, we propose a new scheduler, DREAM, which effectively handles various dynamicity in RTMM workloads targeting multi-accelerator systems. DREAM quantifies the unique requirements for RTMM workloads and utilizes the quantified scores to drive scheduling decisions, considering the current system load and other inference jobs on different models and input frames. DREAM utilizes tunable parameters that provide fast and effective adaptivity to dynamic workload changes. In our evaluation of five scenarios of RTMM workload, DREAM reduces the overall UXCost, which is an equivalent metric of the energy-delay product (EDP) for RTMM defined in the paper, by 32.2% and 50.0% in the geometric mean (up to 80.8% and 97.6%) compared to state-of-the-art baselines, which shows the efficacy of our scheduling methodology.
Author Kwon, Hyoukjun
Kim, Seah
Song, Jinook
Chandra, Vikas
Lai, Liangzhen
Chen, Yu-Hsin
Jo, Jihyuck
Author_xml – sequence: 1
  givenname: Seah
  surname: Kim
  fullname: Kim, Seah
– sequence: 2
  givenname: Hyoukjun
  surname: Kwon
  fullname: Kwon, Hyoukjun
– sequence: 3
  givenname: Jinook
  surname: Song
  fullname: Song, Jinook
– sequence: 4
  givenname: Jihyuck
  surname: Jo
  fullname: Jo, Jihyuck
– sequence: 5
  givenname: Yu-Hsin
  surname: Chen
  fullname: Chen, Yu-Hsin
– sequence: 6
  givenname: Liangzhen
  surname: Lai
  fullname: Lai, Liangzhen
– sequence: 7
  givenname: Vikas
  surname: Chandra
  fullname: Chandra, Vikas
BackLink https://doi.org/10.48550/arXiv.2212.03414$$DView paper in arXiv
BookMark eNo9z81OAjEYheEudIHIBbiiN9Cxv3TG3QRQSWZCAiYsJx_9iY2dqSlg5O5VNK7e5CxO8tygqyENDqE7RgtZKkXvIX-Gj4JzxgsqJJMjtFpslnX7gGu8OA_QB4O35tXZU3QZ-5T_142DSI6hd7g9xWMgfbIu4rbBu5TfYgJ7uEXXHuLBTf46RtvH5cv8mTTrp9W8bgjMtCR7rrUuja3UHrgRgiorPGWeg_6uU6YqOaMglSoVBUUNtZ5VhnFgUohKjNH09_VC6d5z6CGfux9SdyGJL80VRdI
ContentType Journal Article
Copyright http://creativecommons.org/licenses/by/4.0
Copyright_xml – notice: http://creativecommons.org/licenses/by/4.0
DBID AKY
GOX
DOI 10.48550/arxiv.2212.03414
DatabaseName arXiv Computer Science
arXiv.org
DatabaseTitleList
Database_xml – sequence: 1
  dbid: GOX
  name: arXiv.org
  url: http://arxiv.org/find
  sourceTypes: Open Access Repository
DeliveryMethod fulltext_linktorsrc
ExternalDocumentID 2212_03414
GroupedDBID AKY
GOX
ID FETCH-LOGICAL-a674-b27778cd95ba2c3305d3f01f2a73f0e5c98210a455850a50c0df19c12a143393
IEDL.DBID GOX
IngestDate Mon Jan 08 05:50:12 EST 2024
IsDoiOpenAccess true
IsOpenAccess true
IsPeerReviewed false
IsScholarly false
Language English
LinkModel DirectLink
MergedId FETCHMERGED-LOGICAL-a674-b27778cd95ba2c3305d3f01f2a73f0e5c98210a455850a50c0df19c12a143393
OpenAccessLink https://arxiv.org/abs/2212.03414
ParticipantIDs arxiv_primary_2212_03414
PublicationCentury 2000
PublicationDate 2022-12-06
PublicationDateYYYYMMDD 2022-12-06
PublicationDate_xml – month: 12
  year: 2022
  text: 2022-12-06
  day: 06
PublicationDecade 2020
PublicationYear 2022
Score 1.864421
SecondaryResourceType preprint
Snippet Emerging real-time multi-model ML (RTMM) workloads such as AR/VR and drone control involve dynamic behaviors in various granularity; task, model, and layers...
SourceID arxiv
SourceType Open Access Repository
SubjectTerms Computer Science - Distributed, Parallel, and Cluster Computing
Computer Science - Learning
Title DREAM: A Dynamic Scheduler for Dynamic Real-time Multi-model ML Workloads
URI https://arxiv.org/abs/2212.03414
hasFullText 1
inHoldings 1
isFullTextHit
isPrint
link http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwdV1LSwMxEA61Jy-iqNQnOXgN7uaxD2-Lba3iKrQKe1tmkywIZSvbVvz5TrLr4-IpMJlLvhDmy2QyHyFXEEKEkccw4x4JpQRgqTI1w2hXCZlYXnvRvvwpmr3Kh0IVA0K__8JA-_n20fUHrtbXnLtUnfBK1Tucu5Ktu-eie5z0rbh6_18_5Jje9CdITPfJXs_uaNZtxwEZ2OaQ3I_nkyy_oRkdd_LvdIFIme3SthQp4491jpyNOa136n_FMi9SQ_NH6hLayxWY9RFZTCcvtzPWKxgwiGLJKh7HcaJNqirgWuDRMqIOwppDjKNVOk3wxgVSIWcPQAU6MHWY6pADshiRimMybFaNHRGKRCCpRaScRCcCGjohAg0yFVYroWJ7QkZ-2eV716OidIiUHpHT_6fOyC531fyuOiM6J8NNu7UXGGM31aUH-gv0YHh2
link.rule.ids 228,230,783,888
linkProvider Cornell University
openUrl ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=DREAM%3A+A+Dynamic+Scheduler+for+Dynamic+Real-time+Multi-model+ML+Workloads&rft.au=Kim%2C+Seah&rft.au=Kwon%2C+Hyoukjun&rft.au=Song%2C+Jinook&rft.au=Jo%2C+Jihyuck&rft.date=2022-12-06&rft_id=info:doi/10.48550%2Farxiv.2212.03414&rft.externalDocID=2212_03414