Portfolio Scheduling for Managing Operational and Disaster-Recovery Risks in Virtualized Datacenters Hosting Business-Critical Workloads
Cloud datacenters are increasingly hosting business workloads. Such long-running, on-demand workloads raise important challenges in datacenter operation, requiring efficient online scheduling of workloads with unprecedented characteristics under strict service level agreements (SLAs). In this work,...
Saved in:
Published in | 2019 18th International Symposium on Parallel and Distributed Computing (ISPDC) pp. 94 - 102 |
---|---|
Main Authors | , , |
Format | Conference Proceeding |
Language | English |
Published |
IEEE
01.06.2019
|
Subjects | |
Online Access | Get full text |
DOI | 10.1109/ISPDC.2019.00022 |
Cover
Abstract | Cloud datacenters are increasingly hosting business workloads. Such long-running, on-demand workloads raise important challenges in datacenter operation, requiring efficient online scheduling of workloads with unprecedented characteristics under strict service level agreements (SLAs). In this work, we propose an approach to manage the risk of not meeting SLAs. Our approach is based on portfolio scheduling, which is an online scheduling technique that dynamically selects a scheduling algorithm from a set (portfolio), subject to a possibly changing utility function. Ours is the first datacenter-scheduling approach to consider operational and disaster-recovery risks. Using trace-based simulation with traces collected from a commercial multi-datacenter environment, we give evidence that portfolio scheduling is able to mitigate risks significantly better than its constituent scheduling algorithms and better than datacenter engineers. |
---|---|
AbstractList | Cloud datacenters are increasingly hosting business workloads. Such long-running, on-demand workloads raise important challenges in datacenter operation, requiring efficient online scheduling of workloads with unprecedented characteristics under strict service level agreements (SLAs). In this work, we propose an approach to manage the risk of not meeting SLAs. Our approach is based on portfolio scheduling, which is an online scheduling technique that dynamically selects a scheduling algorithm from a set (portfolio), subject to a possibly changing utility function. Ours is the first datacenter-scheduling approach to consider operational and disaster-recovery risks. Using trace-based simulation with traces collected from a commercial multi-datacenter environment, we give evidence that portfolio scheduling is able to mitigate risks significantly better than its constituent scheduling algorithms and better than datacenter engineers. |
Author | Iosup, Alexandru Oikonomou, Giorgos van Beek, Vincent |
Author_xml | – sequence: 1 givenname: Vincent surname: van Beek fullname: van Beek, Vincent organization: Solvinity, Delft University of Technology – sequence: 2 givenname: Giorgos surname: Oikonomou fullname: Oikonomou, Giorgos organization: Delft University of Technology – sequence: 3 givenname: Alexandru surname: Iosup fullname: Iosup, Alexandru organization: VU Amsterdam |
BookMark | eNotzM1OAjEYheGa6ELRvYmb3sBg25nptEsdVEgwEPBnSb6ZfoMNY0vaYoJX4GUL0dXJSd48F-TUeYeEXHM25Jzp28lyPqqHgnE9ZIwJcUKudKV4JRTPFePlOfmZ-5A631tPl-0Hml1v3Zp2PtBncLA-ntkWAyTrHfQUnKEjGyEmDNkCW_-FYU8XNm4itY6-2ZB20NtvPGSQoEV3CCMd-5iO1P0uWocxZnWwybYH8N2HTe_BxEty1kEf8ep_B-T18eGlHmfT2dOkvptmVvAqZUIJIzk3heSN0A1gmzdS5sw0SklZckDWcdRlh4VigFIWpmMoUJkKi5JBPiA3f65FxNU22E8I-5WqNNOFzn8Bw0ViNA |
CODEN | IEEPAD |
ContentType | Conference Proceeding |
DBID | 6IE 6IL CBEJK RIE RIL |
DOI | 10.1109/ISPDC.2019.00022 |
DatabaseName | IEEE Electronic Library (IEL) Conference Proceedings IEEE Xplore POP ALL IEEE Xplore All Conference Proceedings IEEE Electronic Library (IEL) IEEE Proceedings Order Plans (POP All) 1998-Present |
DatabaseTitleList | |
Database_xml | – sequence: 1 dbid: RIE name: IEEE Electronic Library (IEL) url: https://proxy.k.utb.cz/login?url=https://ieeexplore.ieee.org/ sourceTypes: Publisher |
DeliveryMethod | fulltext_linktorsrc |
EISBN | 9781728138015 1728138019 |
EndPage | 102 |
ExternalDocumentID | 8790949 |
Genre | orig-research |
GroupedDBID | 6IE 6IL CBEJK RIE RIL |
ID | FETCH-LOGICAL-i217t-282d611d461b29baec3b6630db886651ae0f1e95fe480ae664df0e2e8d7e450a3 |
IEDL.DBID | RIE |
IngestDate | Thu Jun 29 18:39:04 EDT 2023 |
IsDoiOpenAccess | false |
IsOpenAccess | true |
IsPeerReviewed | false |
IsScholarly | false |
Language | English |
LinkModel | DirectLink |
MergedId | FETCHMERGED-LOGICAL-i217t-282d611d461b29baec3b6630db886651ae0f1e95fe480ae664df0e2e8d7e450a3 |
OpenAccessLink | http://www.scopus.com/inward/record.url?scp=85071510648&partnerID=8YFLogxK |
PageCount | 9 |
ParticipantIDs | ieee_primary_8790949 |
PublicationCentury | 2000 |
PublicationDate | 2019-Jun |
PublicationDateYYYYMMDD | 2019-06-01 |
PublicationDate_xml | – month: 06 year: 2019 text: 2019-Jun |
PublicationDecade | 2010 |
PublicationTitle | 2019 18th International Symposium on Parallel and Distributed Computing (ISPDC) |
PublicationTitleAbbrev | ISPDC |
PublicationYear | 2019 |
Publisher | IEEE |
Publisher_xml | – name: IEEE |
Score | 1.6986378 |
Snippet | Cloud datacenters are increasingly hosting business workloads. Such long-running, on-demand workloads raise important challenges in datacenter operation,... |
SourceID | ieee |
SourceType | Publisher |
StartPage | 94 |
SubjectTerms | Datacenter Resource Management Disaster Recoverability Risk Dynamic scheduling Monitoring Operational Risk Optimization Portfolio Scheduling Portfolios Reliability Resource management Risk Management Risk Tolerance |
Title | Portfolio Scheduling for Managing Operational and Disaster-Recovery Risks in Virtualized Datacenters Hosting Business-Critical Workloads |
URI | https://ieeexplore.ieee.org/document/8790949 |
hasFullText | 1 |
inHoldings | 1 |
isFullTextHit | |
isPrint | |
link | http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwjV1NTwIxEG2Qkyc1YPxODx5d6H6yPYMETVAiYriR6XbWbCC7hF0O8gv82XZYFo3x4K1pmuym03TmTd-8YewWYwmOcgJLa9s3AAU7FkQuWEJJUDo0cAyoOHn4FAwm3uPUn9bY3b4WBhG35DNs0XD7lq-zaE2psnbYkQaNyAN2YI5ZWatVvTwK2X4Yj3pdImuRAqWgbrg_-qVs3UX_iA2rD5UskXlrXahWtPmlwfjfPzlmze_CPD7au5wTVsO0wT6JDRpniyTjY2MCTdzyd26CUV41IeLPS1ztsn4cUs17SQ6kkGAR-jSH-YO_JPk850nK35IVlZQkGzTLoABib5oQkQ-ynBjSvGLKW1WTBE759kUGOm-ySf_-tTuwdg0WrMQgkcIycEsHtq29wFaOVICRq0wEIrQKSQbPBhSxjdKP0QsFYBB4OhboYKg76PkC3FNWT7MUzxinu8FMA4SB8mQkQ_BdHdtuiI7wPB2dswbt4mxZamjMdht48ff0JTskO5aUrCtWL1ZrvDbOv1A3W6t_ARaFtek |
linkProvider | IEEE |
linkToHtml | http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwjV09T8MwELWgDDABAsQ3HhhJcRInjeeWqgVaEC2IrTrHFxSBkqpJB_oL-Nn42gYQYmCLrEiJbMt37_zuPcbOMVHgaS90jHEDC1Cw4UDsgyO0Am0iC8eAmpN7_bDzKK-fg-cVdvHVC4OIc_IZ1ulxfpdv8nhKpbLLqKEsGlGrbM3GfRksurWqu0ehLruD-1aT6FqkQSnID_eHY8o8YLQ3Wa_61IIn8lqflroez36pMP73X7bY7ndrHr__CjrbbAWzHfZBfNAkf0tzPrCLYIhd_sJtOsorGyJ-N8bJsu7HITO8lRZAGgkO4U-7nd_5Q1q8FjzN-FM6oaaSdIb2NSiB-Js2SeSdvCCONK-48k5lk8Cp4v6Wgyl22WP7atjsOEuLBSe1WKR0LOAyoesaGbraUxow9rXNQYTREQnhuYAicVEFCcpIAIahNIlADyPTQBkI8PdYLcsz3GecTgc7DBCFWqpYRRD4JnH9CD0hpYkP2A7N4mi8UNEYLSfw8O_hM7beGfZuR7fd_s0R26A1XRC0jlmtnEzxxKYCpT6d74BPwwe5Ng |
openUrl | ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Abook&rft.genre=proceeding&rft.title=2019+18th+International+Symposium+on+Parallel+and+Distributed+Computing+%28ISPDC%29&rft.atitle=Portfolio+Scheduling+for+Managing+Operational+and+Disaster-Recovery+Risks+in+Virtualized+Datacenters+Hosting+Business-Critical+Workloads&rft.au=van+Beek%2C+Vincent&rft.au=Oikonomou%2C+Giorgos&rft.au=Iosup%2C+Alexandru&rft.date=2019-06-01&rft.pub=IEEE&rft.spage=94&rft.epage=102&rft_id=info:doi/10.1109%2FISPDC.2019.00022&rft.externalDocID=8790949 |