DREAM: A Dynamic Scheduler for Dynamic Real-time Multi-model ML Workloads
Emerging real-time multi-model ML (RTMM) workloads such as AR/VR and drone control involve dynamic behaviors in various granularity; task, model, and layers within a model. Such dynamic behaviors introduce new challenges to the system software in an ML system since the overall system load is not com...
Saved in:
Main Authors | , , , , , , |
---|---|
Format | Journal Article |
Language | English |
Published |
06.12.2022
|
Subjects | |
Online Access | Get full text |
Cover
Loading…
Abstract | Emerging real-time multi-model ML (RTMM) workloads such as AR/VR and drone
control involve dynamic behaviors in various granularity; task, model, and
layers within a model. Such dynamic behaviors introduce new challenges to the
system software in an ML system since the overall system load is not completely
predictable, unlike traditional ML workloads. In addition, RTMM workloads
require real-time processing, involve highly heterogeneous models, and target
resource-constrained devices. Under such circumstances, developing an effective
scheduler gains more importance to better utilize underlying hardware
considering the unique characteristics of RTMM workloads. Therefore, we propose
a new scheduler, DREAM, which effectively handles various dynamicity in RTMM
workloads targeting multi-accelerator systems. DREAM quantifies the unique
requirements for RTMM workloads and utilizes the quantified scores to drive
scheduling decisions, considering the current system load and other inference
jobs on different models and input frames. DREAM utilizes tunable parameters
that provide fast and effective adaptivity to dynamic workload changes. In our
evaluation of five scenarios of RTMM workload, DREAM reduces the overall
UXCost, which is an equivalent metric of the energy-delay product (EDP) for
RTMM defined in the paper, by 32.2% and 50.0% in the geometric mean (up to
80.8% and 97.6%) compared to state-of-the-art baselines, which shows the
efficacy of our scheduling methodology. |
---|---|
AbstractList | Emerging real-time multi-model ML (RTMM) workloads such as AR/VR and drone
control involve dynamic behaviors in various granularity; task, model, and
layers within a model. Such dynamic behaviors introduce new challenges to the
system software in an ML system since the overall system load is not completely
predictable, unlike traditional ML workloads. In addition, RTMM workloads
require real-time processing, involve highly heterogeneous models, and target
resource-constrained devices. Under such circumstances, developing an effective
scheduler gains more importance to better utilize underlying hardware
considering the unique characteristics of RTMM workloads. Therefore, we propose
a new scheduler, DREAM, which effectively handles various dynamicity in RTMM
workloads targeting multi-accelerator systems. DREAM quantifies the unique
requirements for RTMM workloads and utilizes the quantified scores to drive
scheduling decisions, considering the current system load and other inference
jobs on different models and input frames. DREAM utilizes tunable parameters
that provide fast and effective adaptivity to dynamic workload changes. In our
evaluation of five scenarios of RTMM workload, DREAM reduces the overall
UXCost, which is an equivalent metric of the energy-delay product (EDP) for
RTMM defined in the paper, by 32.2% and 50.0% in the geometric mean (up to
80.8% and 97.6%) compared to state-of-the-art baselines, which shows the
efficacy of our scheduling methodology. |
Author | Kwon, Hyoukjun Kim, Seah Song, Jinook Chandra, Vikas Lai, Liangzhen Chen, Yu-Hsin Jo, Jihyuck |
Author_xml | – sequence: 1 givenname: Seah surname: Kim fullname: Kim, Seah – sequence: 2 givenname: Hyoukjun surname: Kwon fullname: Kwon, Hyoukjun – sequence: 3 givenname: Jinook surname: Song fullname: Song, Jinook – sequence: 4 givenname: Jihyuck surname: Jo fullname: Jo, Jihyuck – sequence: 5 givenname: Yu-Hsin surname: Chen fullname: Chen, Yu-Hsin – sequence: 6 givenname: Liangzhen surname: Lai fullname: Lai, Liangzhen – sequence: 7 givenname: Vikas surname: Chandra fullname: Chandra, Vikas |
BackLink | https://doi.org/10.48550/arXiv.2212.03414$$DView paper in arXiv |
BookMark | eNo9z81OAjEYheEudIHIBbiiN9Cxv3TG3QRQSWZCAiYsJx_9iY2dqSlg5O5VNK7e5CxO8tygqyENDqE7RgtZKkXvIX-Gj4JzxgsqJJMjtFpslnX7gGu8OA_QB4O35tXZU3QZ-5T_142DSI6hd7g9xWMgfbIu4rbBu5TfYgJ7uEXXHuLBTf46RtvH5cv8mTTrp9W8bgjMtCR7rrUuja3UHrgRgiorPGWeg_6uU6YqOaMglSoVBUUNtZ5VhnFgUohKjNH09_VC6d5z6CGfux9SdyGJL80VRdI |
ContentType | Journal Article |
Copyright | http://creativecommons.org/licenses/by/4.0 |
Copyright_xml | – notice: http://creativecommons.org/licenses/by/4.0 |
DBID | AKY GOX |
DOI | 10.48550/arxiv.2212.03414 |
DatabaseName | arXiv Computer Science arXiv.org |
DatabaseTitleList | |
Database_xml | – sequence: 1 dbid: GOX name: arXiv.org url: http://arxiv.org/find sourceTypes: Open Access Repository |
DeliveryMethod | fulltext_linktorsrc |
ExternalDocumentID | 2212_03414 |
GroupedDBID | AKY GOX |
ID | FETCH-LOGICAL-a674-b27778cd95ba2c3305d3f01f2a73f0e5c98210a455850a50c0df19c12a143393 |
IEDL.DBID | GOX |
IngestDate | Mon Jan 08 05:50:12 EST 2024 |
IsDoiOpenAccess | true |
IsOpenAccess | true |
IsPeerReviewed | false |
IsScholarly | false |
Language | English |
LinkModel | DirectLink |
MergedId | FETCHMERGED-LOGICAL-a674-b27778cd95ba2c3305d3f01f2a73f0e5c98210a455850a50c0df19c12a143393 |
OpenAccessLink | https://arxiv.org/abs/2212.03414 |
ParticipantIDs | arxiv_primary_2212_03414 |
PublicationCentury | 2000 |
PublicationDate | 2022-12-06 |
PublicationDateYYYYMMDD | 2022-12-06 |
PublicationDate_xml | – month: 12 year: 2022 text: 2022-12-06 day: 06 |
PublicationDecade | 2020 |
PublicationYear | 2022 |
Score | 1.864421 |
SecondaryResourceType | preprint |
Snippet | Emerging real-time multi-model ML (RTMM) workloads such as AR/VR and drone
control involve dynamic behaviors in various granularity; task, model, and
layers... |
SourceID | arxiv |
SourceType | Open Access Repository |
SubjectTerms | Computer Science - Distributed, Parallel, and Cluster Computing Computer Science - Learning |
Title | DREAM: A Dynamic Scheduler for Dynamic Real-time Multi-model ML Workloads |
URI | https://arxiv.org/abs/2212.03414 |
hasFullText | 1 |
inHoldings | 1 |
isFullTextHit | |
isPrint | |
link | http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwdV1LSwMxEA61Jy-iqNQnOXgN7uaxD2-Lba3iKrQKe1tmkywIZSvbVvz5TrLr4-IpMJlLvhDmy2QyHyFXEEKEkccw4x4JpQRgqTI1w2hXCZlYXnvRvvwpmr3Kh0IVA0K__8JA-_n20fUHrtbXnLtUnfBK1Tucu5Ktu-eie5z0rbh6_18_5Jje9CdITPfJXs_uaNZtxwEZ2OaQ3I_nkyy_oRkdd_LvdIFIme3SthQp4491jpyNOa136n_FMi9SQ_NH6hLayxWY9RFZTCcvtzPWKxgwiGLJKh7HcaJNqirgWuDRMqIOwppDjKNVOk3wxgVSIWcPQAU6MHWY6pADshiRimMybFaNHRGKRCCpRaScRCcCGjohAg0yFVYroWJ7QkZ-2eV716OidIiUHpHT_6fOyC531fyuOiM6J8NNu7UXGGM31aUH-gv0YHh2 |
link.rule.ids | 228,230,783,888 |
linkProvider | Cornell University |
openUrl | ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=DREAM%3A+A+Dynamic+Scheduler+for+Dynamic+Real-time+Multi-model+ML+Workloads&rft.au=Kim%2C+Seah&rft.au=Kwon%2C+Hyoukjun&rft.au=Song%2C+Jinook&rft.au=Jo%2C+Jihyuck&rft.date=2022-12-06&rft_id=info:doi/10.48550%2Farxiv.2212.03414&rft.externalDocID=2212_03414 |