Portfolio Scheduling for Managing Operational and Disaster-Recovery Risks in Virtualized Datacenters Hosting Business-Critical Workloads

Cloud datacenters are increasingly hosting business workloads. Such long-running, on-demand workloads raise important challenges in datacenter operation, requiring efficient online scheduling of workloads with unprecedented characteristics under strict service level agreements (SLAs). In this work,...

Full description

Saved in:
Bibliographic Details
Published in2019 18th International Symposium on Parallel and Distributed Computing (ISPDC) pp. 94 - 102
Main Authors van Beek, Vincent, Oikonomou, Giorgos, Iosup, Alexandru
Format Conference Proceeding
LanguageEnglish
Published IEEE 01.06.2019
Subjects
Online AccessGet full text
DOI10.1109/ISPDC.2019.00022

Cover

Abstract Cloud datacenters are increasingly hosting business workloads. Such long-running, on-demand workloads raise important challenges in datacenter operation, requiring efficient online scheduling of workloads with unprecedented characteristics under strict service level agreements (SLAs). In this work, we propose an approach to manage the risk of not meeting SLAs. Our approach is based on portfolio scheduling, which is an online scheduling technique that dynamically selects a scheduling algorithm from a set (portfolio), subject to a possibly changing utility function. Ours is the first datacenter-scheduling approach to consider operational and disaster-recovery risks. Using trace-based simulation with traces collected from a commercial multi-datacenter environment, we give evidence that portfolio scheduling is able to mitigate risks significantly better than its constituent scheduling algorithms and better than datacenter engineers.
AbstractList Cloud datacenters are increasingly hosting business workloads. Such long-running, on-demand workloads raise important challenges in datacenter operation, requiring efficient online scheduling of workloads with unprecedented characteristics under strict service level agreements (SLAs). In this work, we propose an approach to manage the risk of not meeting SLAs. Our approach is based on portfolio scheduling, which is an online scheduling technique that dynamically selects a scheduling algorithm from a set (portfolio), subject to a possibly changing utility function. Ours is the first datacenter-scheduling approach to consider operational and disaster-recovery risks. Using trace-based simulation with traces collected from a commercial multi-datacenter environment, we give evidence that portfolio scheduling is able to mitigate risks significantly better than its constituent scheduling algorithms and better than datacenter engineers.
Author Iosup, Alexandru
Oikonomou, Giorgos
van Beek, Vincent
Author_xml – sequence: 1
  givenname: Vincent
  surname: van Beek
  fullname: van Beek, Vincent
  organization: Solvinity, Delft University of Technology
– sequence: 2
  givenname: Giorgos
  surname: Oikonomou
  fullname: Oikonomou, Giorgos
  organization: Delft University of Technology
– sequence: 3
  givenname: Alexandru
  surname: Iosup
  fullname: Iosup, Alexandru
  organization: VU Amsterdam
BookMark eNotzM1OAjEYheGa6ELRvYmb3sBg25nptEsdVEgwEPBnSb6ZfoMNY0vaYoJX4GUL0dXJSd48F-TUeYeEXHM25Jzp28lyPqqHgnE9ZIwJcUKudKV4JRTPFePlOfmZ-5A631tPl-0Hml1v3Zp2PtBncLA-ntkWAyTrHfQUnKEjGyEmDNkCW_-FYU8XNm4itY6-2ZB20NtvPGSQoEV3CCMd-5iO1P0uWocxZnWwybYH8N2HTe_BxEty1kEf8ep_B-T18eGlHmfT2dOkvptmVvAqZUIJIzk3heSN0A1gmzdS5sw0SklZckDWcdRlh4VigFIWpmMoUJkKi5JBPiA3f65FxNU22E8I-5WqNNOFzn8Bw0ViNA
CODEN IEEPAD
ContentType Conference Proceeding
DBID 6IE
6IL
CBEJK
RIE
RIL
DOI 10.1109/ISPDC.2019.00022
DatabaseName IEEE Electronic Library (IEL) Conference Proceedings
IEEE Xplore POP ALL
IEEE Xplore All Conference Proceedings
IEEE Electronic Library (IEL)
IEEE Proceedings Order Plans (POP All) 1998-Present
DatabaseTitleList
Database_xml – sequence: 1
  dbid: RIE
  name: IEEE Electronic Library (IEL)
  url: https://proxy.k.utb.cz/login?url=https://ieeexplore.ieee.org/
  sourceTypes: Publisher
DeliveryMethod fulltext_linktorsrc
EISBN 9781728138015
1728138019
EndPage 102
ExternalDocumentID 8790949
Genre orig-research
GroupedDBID 6IE
6IL
CBEJK
RIE
RIL
ID FETCH-LOGICAL-i217t-282d611d461b29baec3b6630db886651ae0f1e95fe480ae664df0e2e8d7e450a3
IEDL.DBID RIE
IngestDate Thu Jun 29 18:39:04 EDT 2023
IsDoiOpenAccess false
IsOpenAccess true
IsPeerReviewed false
IsScholarly false
Language English
LinkModel DirectLink
MergedId FETCHMERGED-LOGICAL-i217t-282d611d461b29baec3b6630db886651ae0f1e95fe480ae664df0e2e8d7e450a3
OpenAccessLink http://www.scopus.com/inward/record.url?scp=85071510648&partnerID=8YFLogxK
PageCount 9
ParticipantIDs ieee_primary_8790949
PublicationCentury 2000
PublicationDate 2019-Jun
PublicationDateYYYYMMDD 2019-06-01
PublicationDate_xml – month: 06
  year: 2019
  text: 2019-Jun
PublicationDecade 2010
PublicationTitle 2019 18th International Symposium on Parallel and Distributed Computing (ISPDC)
PublicationTitleAbbrev ISPDC
PublicationYear 2019
Publisher IEEE
Publisher_xml – name: IEEE
Score 1.6986378
Snippet Cloud datacenters are increasingly hosting business workloads. Such long-running, on-demand workloads raise important challenges in datacenter operation,...
SourceID ieee
SourceType Publisher
StartPage 94
SubjectTerms Datacenter Resource Management
Disaster Recoverability Risk
Dynamic scheduling
Monitoring
Operational Risk
Optimization
Portfolio Scheduling
Portfolios
Reliability
Resource management
Risk Management
Risk Tolerance
Title Portfolio Scheduling for Managing Operational and Disaster-Recovery Risks in Virtualized Datacenters Hosting Business-Critical Workloads
URI https://ieeexplore.ieee.org/document/8790949
hasFullText 1
inHoldings 1
isFullTextHit
isPrint
link http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwjV1NTwIxEG2Qkyc1YPxODx5d6H6yPYMETVAiYriR6XbWbCC7hF0O8gv82XZYFo3x4K1pmuym03TmTd-8YewWYwmOcgJLa9s3AAU7FkQuWEJJUDo0cAyoOHn4FAwm3uPUn9bY3b4WBhG35DNs0XD7lq-zaE2psnbYkQaNyAN2YI5ZWatVvTwK2X4Yj3pdImuRAqWgbrg_-qVs3UX_iA2rD5UskXlrXahWtPmlwfjfPzlmze_CPD7au5wTVsO0wT6JDRpniyTjY2MCTdzyd26CUV41IeLPS1ztsn4cUs17SQ6kkGAR-jSH-YO_JPk850nK35IVlZQkGzTLoABib5oQkQ-ynBjSvGLKW1WTBE759kUGOm-ySf_-tTuwdg0WrMQgkcIycEsHtq29wFaOVICRq0wEIrQKSQbPBhSxjdKP0QsFYBB4OhboYKg76PkC3FNWT7MUzxinu8FMA4SB8mQkQ_BdHdtuiI7wPB2dswbt4mxZamjMdht48ff0JTskO5aUrCtWL1ZrvDbOv1A3W6t_ARaFtek
linkProvider IEEE
linkToHtml http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwjV09T8MwELWgDDABAsQ3HhhJcRInjeeWqgVaEC2IrTrHFxSBkqpJB_oL-Nn42gYQYmCLrEiJbMt37_zuPcbOMVHgaS90jHEDC1Cw4UDsgyO0Am0iC8eAmpN7_bDzKK-fg-cVdvHVC4OIc_IZ1ulxfpdv8nhKpbLLqKEsGlGrbM3GfRksurWqu0ehLruD-1aT6FqkQSnID_eHY8o8YLQ3Wa_61IIn8lqflroez36pMP73X7bY7ndrHr__CjrbbAWzHfZBfNAkf0tzPrCLYIhd_sJtOsorGyJ-N8bJsu7HITO8lRZAGgkO4U-7nd_5Q1q8FjzN-FM6oaaSdIb2NSiB-Js2SeSdvCCONK-48k5lk8Cp4v6Wgyl22WP7atjsOEuLBSe1WKR0LOAyoesaGbraUxow9rXNQYTREQnhuYAicVEFCcpIAIahNIlADyPTQBkI8PdYLcsz3GecTgc7DBCFWqpYRRD4JnH9CD0hpYkP2A7N4mi8UNEYLSfw8O_hM7beGfZuR7fd_s0R26A1XRC0jlmtnEzxxKYCpT6d74BPwwe5Ng
openUrl ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Abook&rft.genre=proceeding&rft.title=2019+18th+International+Symposium+on+Parallel+and+Distributed+Computing+%28ISPDC%29&rft.atitle=Portfolio+Scheduling+for+Managing+Operational+and+Disaster-Recovery+Risks+in+Virtualized+Datacenters+Hosting+Business-Critical+Workloads&rft.au=van+Beek%2C+Vincent&rft.au=Oikonomou%2C+Giorgos&rft.au=Iosup%2C+Alexandru&rft.date=2019-06-01&rft.pub=IEEE&rft.spage=94&rft.epage=102&rft_id=info:doi/10.1109%2FISPDC.2019.00022&rft.externalDocID=8790949